Manual extraction

You are here:
< Back

Manual extraction lets you choose what image/content is to be extracted from a page and displayed in the search results. If the pages in your website follow a certain pattern, then you can use this to configure the search results to your liking. Leave this off if you are unsure on how to use it.

Examples-

  1. Extract images from your product pages and display them as the images in the search results
  2. Extract content from product pages and show it as the search snippet

Here is how you can use the custom extractor

  1. Go to cse.expertrec.com and login to access your dashboard
  2. Navigate to Crawl->Advanced->Manual Extraction
  3. Enter the URL or URL pattern in the field provided. If you are unfamiliar with regex, you can take the help of third-party websites to build one. Here’s one that you may use. Furthermore, you can validate your regex here
  4. There are two components that you can control using manual extraction-
    1. Image
    2. Search contentcustom search extractor
  5. To extract the image, go to the product page and find the CSS selector for the image. Copy this into the CSS selector field in your control panel
  6. Repeat the same for search content
  7. You can add multiple conditions and reorder them (drag and drop) if you want. Keep in mind that the extractor will pass the URL with the first matching condition, so order the conditions appropriately
  8. Click on Update once you have made all the changes. A recrawl will be scheduled and changes will start to appear once it completes

 

Let’s take an example for the following page.

Here our aim is to create a custom search engine for products. We will take a sample product page and use its structure to create the search engine components.

Here is a product page that we want to configure using our selector for appearing in the search results. If all your product pages have a similar URL pattern (say, yourwebsite.com/products/), you can enter it in the URL pattern field. Else skip that part.

Here the product is ACR 1926. The same settings will apply to other products as well which have a similar page structure.

custom search extractor

How to get image CSS selector?

  1. Right click on the required image
  2. Click on inspect element
  3. Right click on the highlighted code
  4. Go to copy-> copy selectorcustom search extractor
  5. Paste it in the CSS selector field for image
  6. Click on Update

How to get search content CSS selector?

  1. Right click on the required text
  2. Click on inspect element
  3. Right click on the highlighted code
  4. Go to copy-> copy selectorcustom search extractor
  5. Paste this in the CSS selector field for search content
  6. Click on Update

Now you can check the search engine for this product. custom search extractor

As you can see, the components you mentioned in the search control panel are reflected in the search results.