Click here to Skip to main content
Click here to Skip to main content
 
Add your own
alternative version

Tagged as

Web scraping with regular expressions

, 18 Jan 2013 CPOL
Using regular expressions for web scraping is sometimes criticized, but I believe they still have their place, particularly for one-off scrapes. Let's say I want to extract the title of a particular webpage - here is an implementation using BeautifulSoup, lxml, and regular expressions:import reimpor

Revisions


  

Compare Revision Minor Date Status Editor
2 - publicly available No 18-Jan-13 13:28 Available Sean Ewington
Updates in content.
1 No 17-Jan-13 22:04 Draft Richard Penman

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Share

About the Author

Richard Penman

Australia Australia
No Biography provided

| Advertise | Privacy | Terms of Use | Mobile
Web01 | 2.8.141220.1 | Last Updated 18 Jan 2013
Article Copyright 2013 by Richard Penman
Everything else Copyright © CodeProject, 1999-2014
Layout: fixed | fluid