Skip to main content

How to manage exclusions in the warm queue

Use exclusions to optimize the extension's performance and avoid warming up unnecessary pages.

Excluding pages by page types

Use exclusions to filter out a group of pages by their type. For example, catalog pages with selected filters are unique and are visited only 1-2 times, but these pages are numerous and can impact the warming of popular pages. Therefore, you can exclude them from the warming process. To do this, navigate to System -> Page Cache Warmer -> Settings -> Mirasvit extensions -> Page Cache Warmer -> General warming and exclusion configuration -> Exclusions for warming queue -> Page types to exclude, and select Catalog · Category · View · * in the field. You can select multiple page types at once.

Save the changes. When warming jobs are run, the extension will check each URL. If it belongs to the specified type, it will be ignored and removed from the list of URLs in the Pages section. The same URL will be ignored if you run crawler or synchronization commands.


Excluding pages by expressions

Use exclusions to exclude specific pages or a page from warming. To add an exclusion, go to System -> Page Cache Warmer -> Settings -> Mirasvit extensions -> Page Cache Warmer -> General warming and exclusion configuration -> Exclusions for warming queue -> Pages to exclude. For example, to exclude all pages that have GET parameters (except for paging, like ?p=2), add the following expression and save the settings:

/\?(?!p=).*/

Other popular examples:

  • Ignore all pages that have GET parameters (filtered category pages, search results pages, etc.)

    /\?/
  • Ignore pages with more than 2 GET parameters.

    /\?[\w=]*&[\w=]*&[\w=]*/ 

Before using, we recommend studying the rules for creating regular expressions and testing your queries.


Excluding pages by User-Agents

Use this if you want to ignore bot visits or visitors with specific User-Agents, especially if your server is already configured to block them. For example, setting /.*Googlebot.*/ will ignore the Google crawler, preventing pages visited by Googlebot from being added to the cache warmer queue.