The July Release of our RheinInsights Retrieval Suite

July 1, 2025

The July release of our Retrieval Suite brings another level of significant crawl performance improvements and hybrid search for retrieval augmented generation. This means, we integrated improved query pipelines which give the most relevant search results for natural language queries.

Screenshot showing a Crawl Rate of a File Share Connector with Microsoft SQL

Crawl Rates in Development Environments

Our connectors achieve reliably high and stable crawl rates. Of course, crawl performance always depends on the following factors:

  • content source,

  • the sizing of the host machine where our Suite runs,

  • the data target

  • and the connected database

An extreme example is our file connector, which can crawl a local file system with a locally connected Microsoft SQL Server as data store and postponed text extraction with up to 1,500 documents per second in a full scan and 1,000 documents for a recrawl.

For remote sources, such as Atlassian Confluence data center, the connector achieves crawl rates with stable 100-120 pages per second for a full scan.

Of course, one always needs to balance index freshness and load on the source systems. Therefore, our connectors provide options to limit the load on the content sources.

More insights
Permission-Based Grounding in Microsoft Copilot > The July Release of our RheinInsights Retrieval Suite > Adding On-Premises Actions to Copilot