Click here to Skip to main content
15,881,967 members
Articles / Web Development / HTML

Web Scraping - Learn The Fundamentals In 60 Minutes

Rate me:
Please Sign up or sign in to vote.
4.87/5 (5 votes)
14 Sep 2018CPOL2 min read 8.4K   12   2
A free video course to learn the fundamentals of web scraping - usually missed by other sources!

Introduction

One of the things I talk to the community about a lot and always get a good response to is the black-art of web scraping. As the world becomes more and more hungry for data, it seems that more often than not, the data we want can be found on a website somewhere. But how to get that data so you can change it the way you want and do something interesting with it and add value to it? This is where web scraping comes in.

Web scraping involves two core skills - obviously, some programming knowledge to get started, and the other arguably more important, is the ability to know how a website is built, to be able to identify where the data you want actually comes from. Is the data you need embedded on the page? (really?) or is it in a CSS file, a tangled up mess encoded in Java-script, or being generated based on a JSON feed from an AJAX call to an API somewhere. Web scraping at the start is less about programming and more about the investigation - this is something that a lot of folks who start to web scrape miss.

One of my most popular series of articles is all about web-scraping - sometimes, however, the text is not enough, and we need visual help to see what's going on. With this in mind, I have created a free webscraping course that teaches the fundamentals of web scraping - it will take you from knowing little about the subject to having a solid knowledge of what you need to do when you start web scraping.

Image 1

Most of the resources (both free and paid) I have come across assume the reader/student has more knowledge than they do - my aim with this course is to give you that important foundation knowledge, of course, for free :)

I am really enjoying the process of making courses (especially free ones) as a new way to pass skills and knowledge to others in the community - here is some of the amazing feedback I have received so far - I am delighted with it and it is the support of the community over the years that has allowed me to get to this stage.

"Finally, an instructor who knows how to teach students so that they are not lost or confused. Thank you!"

"Great course, wish I would have watched it before reading Python Scrapy documentation."

"I loved this course. It's a great introduction to web scraping! It's short, sweet and to the point!"

"Really impressed about theory and strategies to use... goes beyond my expectation"

If you want to learn the very fundamentals of web scraping - this is a great place to start, and it's free, and we like free:    

Free web scraping course link

History

  • 14/Sep/2018: Version 1

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)


Written By
Chief Technology Officer SocialVoice.AI
Ireland Ireland
Allen is CTO of SocialVoice (https://www.socialvoice.ai), where his company analyses video data at scale and gives Global Brands Knowledge, Insights and Actions never seen before! Allen is a chartered engineer, a Fellow of the British Computing Society, a Microsoft mvp and Regional Director, and C-Sharp Corner Community Adviser and MVP. His core technology interests are BigData, IoT and Machine Learning.

When not chained to his desk he can be found fixing broken things, playing music very badly or trying to shape things out of wood. He currently completing a PhD in AI and is also a ball throwing slave for his dogs.

Comments and Discussions

 
SuggestionWeb Scraping Legalities Pin
alexcoder9917-Sep-18 5:58
alexcoder9917-Sep-18 5:58 
GeneralMy vote of 5 Pin
David A. Gray16-Sep-18 8:09
David A. Gray16-Sep-18 8:09 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.