Manually Extracted Content from Wikipedia on Web Scraping:


Manually Extracted Content from Wikipedia on Web Scraping:

This section presents a good basic introduction to this topic, and hence, I have kept the content as it is. While Wikipedia is a good information tool, prudence is required while scanning through Wikipedia. Diverse educational and experiential background certainly helps to validate the content, not only over at Wikipedia, but also, at other sites.

Coming to web scraping, as we know, garbage in garbage out. Web scraping extracts what is presented, and if the presented info is a garbage, it will give out garbage as well. Further, using this technique, information can be extracted without active website participation. That is why I had mentioned in srikanthkidambi.com several years back that information can reach places we don't even know irrespective of whatever data analytics tools in place.

#Web #Scraping #Information #Copying #Crawler #WebCrawler

Comments