+ 16
Python is really good for webscraping and crawling. Its scrapy and beautifulsoup4 libraries are great, easy and intuitive tools for even commercially viable webscraping.
27th Mar 2018, 9:21 PM
Kuba Siekierzyński
Kuba Siekierzyński - avatar
+ 8
Depending on how complex your program is file_get_contents and explode() do the job for me.
27th Mar 2018, 6:25 PM
Kevin Eldurson
Kevin Eldurson - avatar
+ 6
Disclaimer: I️ have never used a language for web scrapping. I️ would rule out any language that does not support parallelism as many computer these days have multiple cores. Most languages will probably be fast enough for your needs, however I️ would recommend c and golang for speed.
27th Mar 2018, 7:28 PM
Freetoast
Freetoast - avatar
+ 4
I agree with Kuba Siekierzyński beautifulsoup is training 1/2 an hour, if you ha e basic knowlesge of Python
29th Mar 2018, 10:46 AM
Oma Falk
Oma Falk - avatar
+ 2
I have done web scraping with PHP (cURL) and Python (requests, beautifulSoup, Selenium) so far and Python is my preferred choice. Be aware that good web scraping is most often not about speed. Depending on your needs, it might be to look as human (check your headers, referrers etc.) as possible and to be intentionally slower than possible in order to not being detected if your IP is hitting the target too many times each minute!
28th Mar 2018, 7:42 AM
Pe Kie
Pe Kie - avatar
+ 2
Since you are asking for "FASTEST", people share good experiences with GO scrappers: Simple web scraper: https://schier.co/blog/2015/04/26/a-simple-web-scraper-in-go.html List of scrapers and related tools: https://github.com/lorien/awesome-web-scraping/blob/master/golang.md Web Scraper in Go, similar to BeautifulSoup https://github.com/anaskhan96/soup How to speed it up? https://juliensalinas.com/en/how-to-speed-up-web-scraping-with-go-golang-concurrency/ Go is so much faster than Python or Ruby and uses so much less memory than Java, it’s really the natural choice for both enterprises and startups. — Nate Finch.
28th Mar 2018, 1:06 PM
Bill Zelenko
Bill Zelenko - avatar
+ 2
Kuba Siekierzyński please make a course on web scraping. I really loved your explanation in Google Maps course
29th Mar 2018, 3:49 PM
Aditya Rana
+ 1
in my opinion, it's python
28th Mar 2018, 7:38 AM
Ahmed Hossam Gamaleldin
Ahmed Hossam Gamaleldin - avatar
+ 1
The speed is not necessary something you really need if by speed refers to the peed of a binary executable compared to scripts. I have seen many languages used for web scraping but I saw a YouTube tutorial for using Ruby for just that purpose that blow my mind. The thing was not speed but the ability to load different "modules" that was for example behaving like a Firefox browser without the usual interface and the ability to assign what it fetched to suitable variables for processing according to your needs. The simplicity and almost exactly "human" like interaction (which could be tweaked further) from the "targets" perspective made me realize the wast capabilities it presented.
28th Mar 2018, 9:50 AM
Jan B. Michanek
Jan B. Michanek - avatar
0
Python Is the Best for web scrapping and crawling . and also it is easy to learn as compare to other programming languages like java , c++ etc.
28th Mar 2018, 8:06 AM
Altahaan Rauf
Altahaan Rauf - avatar
28th Mar 2018, 2:28 PM
Benson Nwarie
Benson Nwarie - avatar
0
google it, you will get perfect answer!
28th Mar 2018, 6:51 PM
Faisel Nur
Faisel Nur - avatar
0
Python is the best.
13th Apr 2018, 9:28 AM
Dee Flomo
Dee Flomo - avatar