Help Center

< Back
You are here:
Print

List of all supported fields

  1. Article content: Automatically identified as the most important content on a page. This usually contains the relevant part of a page after ignoring things like <header>, <footer>, <nav>, and sidebars. This is a good field to use in search weights and also as a fallback in search description.
  2. Brand: This is extracted from a page’s microdata and holds the brand name.
  3. Brand.name: This is extracted from a page’s microdata and holds additional information about brands like a sub-brand when available. Empty if not available. 
  4. Content: All the visible content extracted from a page is stored in this field. This is usable in search weights and search description. Empty if not available.
  5. Date: Taken from the page’s microdata and stores the date of publishing. Empty if not present. Generally used to boost recent content.
  6. Description: Taken automatically from a page. The crawler identifies the part of the page that it understands is the most fitting to describe it and stores it in this field. This is best used for the search results snippet.
  7. Extracted Content: This contains all the content manually extracted from a page using the manual extraction feature available on the dashboard. Empty if no content is extracted. 
  8. Host: Stores the domain of a page. This is useful to boost a particular domain/sub-domain in a large index. It can also be used to set up filters.
  9. Id: Is a unique identifier for a page. For the crawl-based solution, it is the same as a page’s URL.
  10. Image: Stores the URL to the extracted image. Can be used to boost pages that have images or contain a particular image.
  11. is_Product: This stores true/false based on the page’s microdata to identify product pages. It is useful to set up boost and filters. 
  12. Last modified: Taken from the sitemap’s last modified field. Generally used to boost recent content. Empty if not present. 
  13. Meta description: Taken as such from the <meta name=”description”> attribute of a webpage. Empty if the tag is not present. This usually contains a brief description of the content of the page is an ideal choice for the search result snippet.
  14. Meta keywords: Taken from the <meta name=”keywords”> attribute of a page. This is useful in assigning search weights.
  15. Price: Taken from the page’s microdata and holds the value of the price. Can be used to set up filters.
  16. Price currency: Holds the currency of the price, taken from the microdata.
  17. Priority: Taken from the sitemap priority field. Empty if not present.
  18. Title: This is extracted from the <title> tag of a page.
  19. Type: Stores the file type of the indexed content. Can be HTML, PDF, etc. Useful in setting up filters and boost.
  20. URL: The URL of a page is stored as such in this field.
Table of Contents