{"id":7511,"date":"2019-11-05T10:17:43","date_gmt":"2019-11-05T15:17:43","guid":{"rendered":"https:\/\/blog.brainstation.io\/?p=7511"},"modified":"2020-04-03T13:19:25","modified_gmt":"2020-04-03T17:19:25","slug":"how-to-prep-your-data-scientist-for-success","status":"publish","type":"post","link":"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success","title":{"rendered":"How to Prep Your Data Scientist for Success"},"content":{"rendered":"<p><span style=\"font-weight: 400;\">We take it for granted that there is a deluge of data, and that\u2019s not wrong. <\/span><span style=\"font-weight: 400;\">A report from Domo<\/span><span style=\"font-weight: 400;\"> estimates that 90 percent of the data recorded in human history was generated in the last two years. Organizations are now collecting data at an unprecedented rate, and business leaders are working to leverage this mountain of data into actionable business insights and intelligent data-driven products. And to really do that, you need a Data Scientist.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">However, a near-universal problem for Data Scientists is that although data is being <\/span><i><span style=\"font-weight: 400;\">collected<\/span><\/i><span style=\"font-weight: 400;\"> at an unprecedented rate, obtaining access to that data and working with it can be anywhere from difficult to impossible. <\/span><\/p>\n<p><span style=\"font-weight: 400;\"><a href=\"https:\/\/brainstation.io\/blog\/3-things-you-need-to-do-before-hiring-a-data-scientist\" target=\"_blank\" rel=\"noopener\">In a recent post<\/a>,<\/span><span style=\"font-weight: 400;\"> we talked about building a data science roadmap: a set of ideas framed in the language and structure of data science and machine learning that a newly hired Data Scientist can work on. In this post, we will dive deeper into what it takes to ensure that the Data Scientist can actually obtain the data required to begin working on this roadmap.<\/span><\/p>\n<h2>What Data Does a Data Scientist Need, Anyway?<\/h2>\n<p><span style=\"font-weight: 400;\">A typical data science project is centered around the idea of building a <\/span><i><span style=\"font-weight: 400;\">model<\/span><\/i><span style=\"font-weight: 400;\">. A model is a mathematical system that in some way mimics the process that is generating your data. If you have a reasonably good model, you can examine it to learn about the process that you\u2019re dealing with, or you can use the model to predict how that process will behave in the future.\u00a0<\/span><span style=\"font-weight: 400;\">Say you run an eCommerce site and you wish to build a model to predict how likely a customer is to purchase the product they\u2019re looking at. This is the type of project that a Data Scientist would be responsible for.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Data Scientists build models by showing a computer many previous examples of the outcome in question. To build a model to predict whether a customer will buy a certain product, we need many examples of customers that looked at a product, and we then need to know whether they actually purchased the product. This last point has a key subtlety that occasionally trips up <a href=\"https:\/\/brainstation.io\/blog\/what-is-data-science\" target=\"_blank\" rel=\"noopener\">data science newbie<\/a>s: We need examples of both customers purchasing products and not purchasing products. This is the only way to build a truly predictive model. <\/span><\/p>\n<p><span style=\"font-weight: 400;\">So, for the Data Scientist to work on this problem, she will need a historical record of visitors\u2019 product browsing history, and an indication of whether each product viewed was purchased or not. Additionally, the Data Scientist requires data about the visitor and their history: demographic data, behavioral data, purchasing history, and any other pieces of data that can be instrumental in building this type of model.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Before setting your Data Scientist off to work on this type of model, it\u2019s a worthwhile exercise to go through this brainstorming process and identify all of the data that will be necessary for her to complete the project.<\/span><\/p>\n<h2><strong>You Know the Data Exists, but Where Is It?<\/strong><\/h2>\n<p><span style=\"font-weight: 400;\">Although we are collecting data rapidly, most of this collection is incidental, and not for the purpose of <a href=\"https:\/\/brainstation.io\/course\/online\/data-science\" target=\"_blank\" rel=\"noopener\">data science<\/a> or machine learning. Data within a single organization is collected through invoices, ledgers, content management systems, customer relationship systems, analytics services, email inboxes, spreadsheets, and so on. The data is messy, inaccessible, and difficult to understand. For each of these data sources, credentials must be created and permissions granted, which may come with non-disclosure agreements and privacy restrictions, creating a range of challenges.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">While Data Scientists are increasingly trained to move data from all of these sources and transform it into the formats required by modern machine learning tools, it\u2019s a time-consuming process they can rarely do alone. For this reason, there is an increasing job market for <em>D<\/em><\/span><i><span style=\"font-weight: 400;\"><em>ata Engineer<\/em>s<\/span><\/i><span style=\"font-weight: 400;\">: Software Engineers trained to seamlessly move data from where it was collected to where it needs to be. If your data architecture is particularly complex, a dedicated Data Engineer can be an invaluable asset to the team, helping the Data Scientist build models more efficiently.<\/span><\/p>\n<h2><strong>Think About Data Engineering<\/strong><\/h2>\n<p><span style=\"font-weight: 400;\">Your Data Scientist might be interested in dealing with data engineering on her own, but even if that\u2019s the case it\u2019s important to consider the challenges she will face. For the eCommerce site, we should understand that the Data Scientist requires access to a host of services, and you will be able to hit the ground running if access credentials, privacy considerations, data residency issues, and whatever other hurdles might exist are taken care of before the Data Scientist even begins. This is another thing that managers can do, without any special knowledge of <a href=\"https:\/\/brainstation.io\/course\/online\/remote-data-science-bootcamp\" target=\"_blank\" rel=\"noopener\">data science<\/a> or machine learning, to ensure a smooth transition into becoming a data-driven organization.<\/span><\/p>\n<div class=\"lead-grid-container\">\n<div class=\"lead__card\">\n<div class=\"lead__image\"><img decoding=\"async\" class=\"hide--mobile\" src=\"https:\/\/d1jxdyr49kipv5.cloudfront.net\/prod\/wp-content\/uploads\/2020\/03\/Data.jpg\" alt=\"Icon\" \/><\/div>\n<div class=\"lead__content\">\n<p id=\"lead__heading\" class=\"heading--4\">Learn data skills to boost your career \u2013 from home!<\/p>\n<p class=\"lead__description\">BrainStation offers <a href=\"https:\/\/brainstation.io\/online-live?utm_source=Blog&amp;utm_medium=BlogPost&amp;utm_campaign=lead_bookCall\" target=\"_blank\" rel=\"noopener\">Online Live Certificate Courses<\/a> in data science, data analytics, machine learning, and python programming. Attend live classes and interact with Instructors and peers from anywhere in the world.<\/p>\n<p id=\"lead__button--margin\"><a id=\"lead__button--hover\" class=\"lead__button\" href=\"https:\/\/brainstation.io\/book-call?utm_source=Blog&amp;utm_medium=BlogPost&amp;utm_campaign=lead_bookCall\" target=\"_blank\" rel=\"noopener noreferrer\">Speak to a Learning Advisor<\/a><\/p>\n<\/div>\n<\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Looking to hire a data scientist but don\u2019t know how your organization\u2019s data is collected? You might want to read this. <\/p>\n","protected":false},"author":7,"featured_media":7512,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[343],"tags":[332,419,564,577,405,578,548,550],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v18.9 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>How to Prep Your Data Scientist for Success | BrainStation\u00ae Blog<\/title>\n<meta name=\"description\" content=\"Looking to hire a data scientist but don\u2019t know how your organization\u2019s data is collected? Read on to find out how to set up your data scientist for success.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"How to Prep Your Data Scientist for Success | BrainStation\u00ae Blog\" \/>\n<meta property=\"og:description\" content=\"Looking to hire a data scientist but don\u2019t know how your organization\u2019s data is collected? Read on to find out how to set up your data scientist for success.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success\" \/>\n<meta property=\"og:site_name\" content=\"BrainStation\u00ae Blog\" \/>\n<meta property=\"article:published_time\" content=\"2019-11-05T15:17:43+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2020-04-03T17:19:25+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/d2re7sjnpekmig.cloudfront.net\/prod\/wp-content\/uploads\/2018\/09\/dose-media-424257-unsplash.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1160\" \/>\n\t<meta property=\"og:image:height\" content=\"400\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"BrainStation\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebSite\",\"@id\":\"https:\/\/brainstation.io\/blog\/#website\",\"url\":\"https:\/\/brainstation.io\/blog\/\",\"name\":\"BrainStation\u00ae Blog\",\"description\":\"The Digital Learning Company\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/brainstation.io\/blog\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success#primaryimage\",\"url\":\"https:\/\/d2re7sjnpekmig.cloudfront.net\/prod\/wp-content\/uploads\/2018\/09\/dose-media-424257-unsplash.jpg\",\"contentUrl\":\"https:\/\/d2re7sjnpekmig.cloudfront.net\/prod\/wp-content\/uploads\/2018\/09\/dose-media-424257-unsplash.jpg\",\"width\":1160,\"height\":400,\"caption\":\"data science\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success#webpage\",\"url\":\"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success\",\"name\":\"How to Prep Your Data Scientist for Success | BrainStation\u00ae Blog\",\"isPartOf\":{\"@id\":\"https:\/\/brainstation.io\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success#primaryimage\"},\"datePublished\":\"2019-11-05T15:17:43+00:00\",\"dateModified\":\"2020-04-03T17:19:25+00:00\",\"author\":{\"@id\":\"https:\/\/brainstation.io\/blog\/#\/schema\/person\/9f37983a6c4da6cf5dd422481ac8cf11\"},\"description\":\"Looking to hire a data scientist but don\u2019t know how your organization\u2019s data is collected? Read on to find out how to set up your data scientist for success.\",\"breadcrumb\":{\"@id\":\"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/brainstation.io\/blog\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"How to Prep Your Data Scientist for Success\"}]},{\"@type\":\"Person\",\"@id\":\"https:\/\/brainstation.io\/blog\/#\/schema\/person\/9f37983a6c4da6cf5dd422481ac8cf11\",\"name\":\"BrainStation\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/brainstation.io\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/80c14b8388838ae1453aec36606b232d?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/80c14b8388838ae1453aec36606b232d?s=96&d=mm&r=g\",\"caption\":\"BrainStation\"},\"description\":\"BrainStation is a global leader in digital skills training, empowering businesses and brands to succeed in the digital age. Established in 2012, BrainStation has worked with over 250 instructors from the most innovative companies, developing cutting-edge, real-world digital education that has empowered more than 50,000 professionals and some of the largest corporations in the world.\",\"url\":\"https:\/\/brainstation.io\/blog\/author\/brainstation\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"How to Prep Your Data Scientist for Success | BrainStation\u00ae Blog","description":"Looking to hire a data scientist but don\u2019t know how your organization\u2019s data is collected? Read on to find out how to set up your data scientist for success.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success","og_locale":"en_US","og_type":"article","og_title":"How to Prep Your Data Scientist for Success | BrainStation\u00ae Blog","og_description":"Looking to hire a data scientist but don\u2019t know how your organization\u2019s data is collected? Read on to find out how to set up your data scientist for success.","og_url":"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success","og_site_name":"BrainStation\u00ae Blog","article_published_time":"2019-11-05T15:17:43+00:00","article_modified_time":"2020-04-03T17:19:25+00:00","og_image":[{"width":1160,"height":400,"url":"https:\/\/d2re7sjnpekmig.cloudfront.net\/prod\/wp-content\/uploads\/2018\/09\/dose-media-424257-unsplash.jpg","type":"image\/jpeg"}],"twitter_card":"summary_large_image","twitter_misc":{"Written by":"BrainStation","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebSite","@id":"https:\/\/brainstation.io\/blog\/#website","url":"https:\/\/brainstation.io\/blog\/","name":"BrainStation\u00ae Blog","description":"The Digital Learning Company","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/brainstation.io\/blog\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success#primaryimage","url":"https:\/\/d2re7sjnpekmig.cloudfront.net\/prod\/wp-content\/uploads\/2018\/09\/dose-media-424257-unsplash.jpg","contentUrl":"https:\/\/d2re7sjnpekmig.cloudfront.net\/prod\/wp-content\/uploads\/2018\/09\/dose-media-424257-unsplash.jpg","width":1160,"height":400,"caption":"data science"},{"@type":"WebPage","@id":"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success#webpage","url":"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success","name":"How to Prep Your Data Scientist for Success | BrainStation\u00ae Blog","isPartOf":{"@id":"https:\/\/brainstation.io\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success#primaryimage"},"datePublished":"2019-11-05T15:17:43+00:00","dateModified":"2020-04-03T17:19:25+00:00","author":{"@id":"https:\/\/brainstation.io\/blog\/#\/schema\/person\/9f37983a6c4da6cf5dd422481ac8cf11"},"description":"Looking to hire a data scientist but don\u2019t know how your organization\u2019s data is collected? Read on to find out how to set up your data scientist for success.","breadcrumb":{"@id":"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/brainstation.io\/blog\/how-to-prep-your-data-scientist-for-success#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/brainstation.io\/blog"},{"@type":"ListItem","position":2,"name":"How to Prep Your Data Scientist for Success"}]},{"@type":"Person","@id":"https:\/\/brainstation.io\/blog\/#\/schema\/person\/9f37983a6c4da6cf5dd422481ac8cf11","name":"BrainStation","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/brainstation.io\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/80c14b8388838ae1453aec36606b232d?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/80c14b8388838ae1453aec36606b232d?s=96&d=mm&r=g","caption":"BrainStation"},"description":"BrainStation is a global leader in digital skills training, empowering businesses and brands to succeed in the digital age. Established in 2012, BrainStation has worked with over 250 instructors from the most innovative companies, developing cutting-edge, real-world digital education that has empowered more than 50,000 professionals and some of the largest corporations in the world.","url":"https:\/\/brainstation.io\/blog\/author\/brainstation"}]}},"_links":{"self":[{"href":"https:\/\/brainstation.io\/blog\/wp-json\/wp\/v2\/posts\/7511"}],"collection":[{"href":"https:\/\/brainstation.io\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/brainstation.io\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/brainstation.io\/blog\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/brainstation.io\/blog\/wp-json\/wp\/v2\/comments?post=7511"}],"version-history":[{"count":6,"href":"https:\/\/brainstation.io\/blog\/wp-json\/wp\/v2\/posts\/7511\/revisions"}],"predecessor-version":[{"id":10896,"href":"https:\/\/brainstation.io\/blog\/wp-json\/wp\/v2\/posts\/7511\/revisions\/10896"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/brainstation.io\/blog\/wp-json\/wp\/v2\/media\/7512"}],"wp:attachment":[{"href":"https:\/\/brainstation.io\/blog\/wp-json\/wp\/v2\/media?parent=7511"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/brainstation.io\/blog\/wp-json\/wp\/v2\/categories?post=7511"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/brainstation.io\/blog\/wp-json\/wp\/v2\/tags?post=7511"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}