+ 1

Why does my web crawler only work for selected websites?

I built a web crawler based on the python tutorials of thenewboston and let the program output all the links on the main page of wikipedia which it did without any problems. I also tried letting the code output all caption of sections and a few other things. It all worked perfectly. But when I simple wanted to output the links on Amazon‘s main page it did nothing. Is it Amazon not allowing me to do that or the modules I used? (I used the BeautifulSoup method from the pycharm build-in module bs4).

python pycharm webcrawler

23rd Mar 2019, 4:57 PM

SohndesZeus

2 ответов

+ 4

Try adding a header as suggested in this stackoverflow topic: https://stackoverflow.com/questions/23555283/why-cant-i-scrape-amazon-by-beautifulsoup

27th Mar 2019, 2:13 PM

Tibor Santa

+ 2

thank you Im going to test that

27th Mar 2019, 6:45 PM

SohndesZeus

Актуальное сегодня

Can i get my old account back? I had deleted it :(

1 Votes

How to stop constructor call from max with three objects

0 Votes

How to have a helper function in a class ,without getting a warning of (self expected as the first parameter )

1 Votes

1 Votes

Could you ple ase help me???

0 Votes

C++ Intermediate - More on Exceptions final question

0 Votes

What's wrong here?

0 Votes

How to grow in sololearn

0 Votes

Who’s been using this app for 7+ years

0 Votes

0 Votes