+ 2

Webscraping

Hi there So for school i have to do these massive projects, but i always need the date and author of the website i use to get my information. Sometimes getting the date is hard, because it's not always on it. So i thought i would webscrape it. I would use the "inurl:" google operater, but BeautifulSoup can't request the url. For example: "https://www.sololearn.com/Discuss/New/" would work. inurl:https://www.sololearn.com/Discuss/New/ doesn't work. I've been searching the web for a while but didn't find anything. Do you have some ideas i can try? I would really appreciate it. Thank you in advance Jordi

python webscraping

11th Jan 2020, 2:17 PM

Jordi

2 Réponses

"inurl:https://sololearn.com/Discuss/New" is not a valid url. A valid url is in the following form <http/https>://<www/subdomain/none>.<domain>.<domain-extention>/Path like: https://example.com/example Anyway, what data you want to scrape from what URL? Maybe I can help you with that

11th Jan 2020, 2:49 PM

Sarthak

Well i was planning to make a program that outputs the date of a website. When the user enters the url, it ouputs the date. It sucks that the "inurl" doesn't work, cuz it would always be in the same place for every website. anyways, thank you. I will find something

11th Jan 2020, 2:58 PM

Jordi