Questions

  1. What library does Scrapy use to extract content from web pages as if they were regular expressions? 
  2. What XPath expression could we use to extract the images of a certain URL from which the HTML code has been extracted?
  3. What XPath expression could we use to extract the links of a certain URL from which the HTML code has been extracted?
  4. What method of the BeautifulSoup module allows you to obtain all the elements of a certain label?
  5. What basic elements at the level of files and folders can we find in a Scrapy project?
  6. In which part of our Scrapy project do we define the extraction procedure for each of the items?
  1. In which part of our Scrapy project do we define the classes that allow us to validate the data or save the extracted data in some databases?
  2. What is the main Scrapy class that allows us to define our spider?
  3. What is the main method you must implement when building an item pipeline?
  4. What is the main platform for deploying spiders in the cloud and what are the commands for doing this task?
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset