Getting Data (The Course)
Course Info
This course includes 1 attempt.
Description
Data science is one of the most exciting and fastest growing careers in the world. The goal of this series is to help people with no background and limited resources transition into data science. It would be helpful to have already taken the previous courses Organizing Data Projects, Introduction to R, Version Control, and Data Tidying. We guide you through the rest!
Learning objectives
After taking this course you will be able to:
- Get data from online, databases, and other resources
- Pull the data from these sources in a variety of formats
- Save them and organize them so you can tidy them for analysis
Things you need to do this course
This course is designed for people with no background with Chromebooks and no background in data science. So it is a great introduction for high-school students or people looking for a career change into the tech industry. The only requirements are:
- A computer with a web browser and an internet connection.
- The ability to type and follow instructions.
- The accounts you have set up in previous courses.
How you will be graded
The course has a series of short quizzes, one for each chapter. You will get two attempts at each quiz and your best score for each quiz will count toward your final score. If you receive more than 70% of the points across all quizzes you will pass. If you receive more than 90% of the points across all quizzes you will pass with honors. You get two attempts at the class with each class purchase.
How to report an error
If you find a bug, typo, or issue in the material, feel free to contact us using this form.
Course Material
- 1 Where Does Data Come From?
- 2 CSV, Excel, and TSV Files
- 3 Importing Data from Google Sheets
- 4 Getting Data From the Internet
- 5 Relational Data
- 6 Unconventional Sources of Data
- 7 Finding Data
- 8 Internet Safety
- 9 Data Privacy
- 10 Ethical Data Science
- 11 References
- About this Course
- About the Authors
Instructors
Jeff is Chief Data Officer, Vice President, and J Orin Edson Foundation Chair of Biostatistics at the Fred Hutchinson Cancer Center. Previously, he was a professor of Biostatistics and Oncology at the Johns Hopkins Bloomberg School of Public Health and co-director of the Johns Hopkins Data Science Lab. His group develops statistical methods, software, data resources, and data analyses that help people make sense of massive-scale genomic and biomedical data. As the co-director of the Johns Hopkins Data Science Lab he helped to develop massive online open programs that have enrolled more than 8 million individuals and partnered with community-based non-profits to use data science education for economic and public health development. He is a Fellow of the American Statistical Association and a recipient of the Mortimer Spiegelman Award and Committee of Presidents of Statistical Societies Presidential Award.
Shannon Ellis is an Associate Teaching Professor in the Cognitive Science Department at UC San Diego.
Aboozar Hadavand is a postdoctoral fellow at Johns Hopkins Bloomberg School of Public Health. His current research involves analyzing MOOC data. He has previously taught at Barnard College (Columbia University), Brooklyn College, and Yeshiva University.
Community
This course has a private forum for learners who are taking this course.
The Leanpub 60 Day 100% Happiness Guarantee
Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
Now, this is technically risky for us, since you'll have the book or course files either way. But we're so confident in our products and services, and in our authors and readers, that we're happy to offer a full money back guarantee for everything we sell.
You can only find out how good something is by trying it, and because of our 100% money back guarantee there's literally no risk to do so!
So, there's no reason not to click the Add to Cart button, is there?
See full terms...
Earn $8 on a $10 Purchase, and $16 on a $20 Purchase
We pay 80% royalties on purchases of $7.99 or more, and 80% royalties minus a 50 cent flat fee on purchases between $0.99 and $7.98. You earn $8 on a $10 sale, and $16 on a $20 sale. So, if we sell 5000 non-refunded copies of your book for $20, you'll earn $80,000.
(Yes, some authors have already earned much more than that on Leanpub.)
In fact, authors have earnedover $14 millionwriting, publishing and selling on Leanpub.
Learn more about writing on Leanpub
Free Updates. DRM Free.
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.
Learn more about Leanpub's ebook formats and where to read them