Kaggle is a predictive modeling, analytics, and data sharing site where more than 500,000 users compete and collaborate to explore datasets made available by business partners who believe crowdsourced analytics may bring them greater insight. Cash prizes are awarded to top performers, and the community learns from one another by sharing strategy and code. More than 200 competitions have been run since Kaggle launched in 2010, and Kaggle has become a playground of sorts for data scientists and others interested in honing their analytic skills.
This year Kaggle conducted its first ‘State of Data Science & Machine Learning’ survey, and received input from more than 16,000 respondents. The full survey data along with visualizations of some responses can be found at kaggle.com.
Our Data Science Bootcamp is aligned with the results of this survey in terms of the tools and methods we teach. Our first cohort is currently learning to write Python inside Jupyter Notebooks and will be learning R this spring. Additionally, we practice and add to the skills we’ve learned by focusing each new unit on understanding and answering a specific data question. We agree that understanding the question at hand is a key part of the data science process. Fortunately, we’ve also acquired some real-life messy data to help students get acquainted with the challenges of data munging. And we aspire to help grow the talent pool in Nashville and demonstrate how data science can add value to a business as the data science bootcamp continues to evolve.
As the field of data science grows, particularly here in Nashville, we’ll be curious to see how these challenges change.