November 30, 2022
xx min read

Convr AI's Data Lake Demystified

Retrieving data can be time consuming and expensive. Our Convr AI data pipelines automate the process of retrieving data from hundreds of different sources.

Data is an essential component of our products at Convr. This article describes the structure and method by which we deliver data to the Convr platform and applications.\nThe following diagram depicts how large volumes of data are ingested, processed, transformed, stored, and made accessible to users.

Components of Convr’s data architecture:

Ingest

Convr’s data pipelines ingest data from private, public, and social media sites. A few examples of data that is ingested:

  • Firmographic, property, and geolocation data,
  • Business inspection, violation, permit, and license data
  • Business reviews from social media sites such as Yelp, Google

Retrieving data is time consuming and expensive. Our data pipelines automate the process of retrieving data from hundreds of different sources.

Standardize

Since data sources collect information for different purposes, similar data is very often stored within different data models and on different database systems. Hence, the same information may be represented in a variety of ways and in different formats.\nData standardization converts information into a common format and common terminology. This delivers the ability to build reports and applications without having to create multiple mappings of the same data. For example, phone numbers are converted into one format:\n

Combine/Enrich

The data pipelines pull together the information for each company from all data sources. In addition, the data is enhanced with statistical risk scoring generated by data science models.

Integrate

In addition to ingesting data from various sources, the Convr application performs real-time integration of data from data analytics companies (such as Verisk, Core Logic, etc.) and social media sites (such as Google places, Yelp, and Bing)

Deliver

Data is delivered to our customers via Convr application UI (User Interface) and via data API’s.

Convr Data Infrastructure

Convr data pipelines run on AWS cloud resources. Data pipelines use S3 (Simple Storage Service) for storage and EC2 (Elastic Compute Cloud) nodes for processing.

Convr applications and data pipelines are implemented using open-source software:

  • Python programming language
  • ElasticSearch for our data lake
  • Apache Spark™, Apache Kafka™ for data transformation
  • Flask framework for web services and data API’s
  • mySQL, Maria for our application databases

Our data stack philosophy:

  • Look to mature open-source software with large community adoption and support rather than re-inventing the wheel
  • Ensure the software scales easily as data is constantly increasing and new products are being added as the business grows
  • Automate processes as much as possible by using CI/CD (Continuous Integration/Continuous Deployment) and other orchestration tools

Given the critical importance of data to our business and the business of our customers, Convr is committed to ensuring the standards and security of our data platform. We are also committed to the education of our customers and the transparency of our operations. Please look forward to other articles describing our infrastructure and capabilities.

blogs

Keep Reading

More articles on AI, underwriting and the future of commercial P&C.

XX MIN READ

The Convr Risk Context Engine: Why it Matters to the Chief Underwriting Officer

The problem every Chief Underwriting Officer (CUO) faces isn't a shortage of AI tools. It's a shortage of AI they can trust and embed seamlessly within their workflow.

Generative models, agentic assistants, and large language models have proliferated across the insurance industry at an unprecedented pace. Nearly all of them share the same critical flaw: they’re built on general-purpose foundation models that were never trained on commercial insurance and have never seen a real submission, a real loss run, or a real underwriter's decision. They can produce fluent text about underwriting without understanding it. For a CUO responsible for combined ratios, regulatory defensibility, and the consistency of thousands of risk decisions per year, that gap is a material liability.

The Convr Risk Context Engine (RCE) is the answer to that problem and it’s the only answer of its kind.

What makes the RCE unique

Unveiled on June 9, 2026, the RCE is a commercial P&C knowledge graph and semantic ontology that encodes the language, structures, exposures, classifications, and decision logic of underwriting into a unified, machine-readable model, calibrated against a decade of real submissions, real exposures, and real underwriter feedback from leading carriers in production.

The practical implication of that architecture is profound. The RCE does not approximate what a painting contractor is; it knows the difference between a painting contractor and a roofing contractor at the classification level and it knows how that difference should affect appetite, coverage, and pricing. It understands that "general liability for a habitational account in coastal Florida" carries a specific set of exposure signals that have nothing in common with "general liability for a light manufacturing operation in the Midwest." Rather than just pattern recognition over text, the RCE is structured knowledge about commercial insurance, expressed as a machine-readable graph that every AI capability in the Convr workbench runs on top of.

Calibrated on a decade of production data and more than 2,500 integrated sources, the RCE powers every AI capability across the Convr AI Underwriting Workbench from intake to business classification, risk scoring, data enrichment, and workflows.

Why this matters operationally to the CUO

The CUO's mandate is to make good risk decisions, consistently, at scale, in ways the organization can defend and document exactly with reliability. The RCE advances all these dimensions simultaneously.

Consistency: One of the most persistent sources of combined ratio deterioration is inconsistent appetite application . . . underwriters in different territories or teams making materially different decisions on similar risks. The RCE delivers consistent, traceable, verifiable risk data, in-line, which means the same exposure in the same class code is evaluated against the same criteria every time, regardless of which underwriter opens the file or which office processes the submission. The CUO sets the appetite rules; the RCE enforces them uniformly.

Defensibility: Every classification, appetite call, and risk-score output traces back through the ontology to the source submission documents, loss data, and underwriter decisions that informed it. The regulatory direction is reinforcing the value of the RCE. When a regulator, reinsurer, or internal audit function asks why a particular account was accepted or declined, the answer is not a probability score from a black box. It is a documented chain of reasoning tied to real data. In a regulatory environment that is increasingly scrutinizing AI-driven decisioning, audit-ready outputs are not a nice-to-have. They are becoming a condition of doing business.

Scale: The RCE reduces submission-through-quote times by 70% and increases new business win rates. Carriers using the Convr AI Underwriting Workbench have documented an 8% combined ratio improvement on commercial auto lines, 20,000 submissions per month processed fully automatically on non-admitted lines, and 38% more quotes generated per underwriting assistant on financial lines, with quote generation time dropping from two hours to 20 minutes. These are not projections. They are outcomes from carriers already running the RCE in production.

The distinction that separates the RCE from everything else

As Convr Chief Executive Officer John Stammen stated at the RCE launch, "Everyone talks about models. The real question is what they're grounded in. Without a commercial P&C knowledge graph and ontology underneath them, generative and agentic AI are confident guessers. The RCE supplies the missing context . . . what a submission means, what an exposure is, what an underwriter decides . . . and turns outputs into decisions a carrier can defend."

That framing captures the CUO's core concern precisely. A CUO does not need AI that sounds right. They need AI that is right, and that can prove it. Convr unifies fragmented insurance data into a structured data model powered by ontology, schema, semantics, and a knowledge graph within the context engine . . . preserving risk relationships and enabling assistive AI to deliver decision-ready underwriting insights and trace those insights to any historical moment in time. The RCE is the infrastructure that transforms raw submission data, third-party enrichment, and historical loss experience into a single, coherent view of a risk, in real time, at the point of decision.

What this means for the CUO's book of business

For small commercial and BOP books, the RCE eliminates the premium leakage and adverse selection that accumulates when submissions are classified by hand. Business classification errors, misapplied territory codes, and underclassed risks . . . the chronic sources of ratio deterioration on high-volume books are caught at intake before they ever reach a rating engine.

For mid-market and multi-line accounts, the RCE compresses the enrichment cycle that consumes the most underwriter time. Rather than spending two to three days researching an account before rating, an underwriter opens the submission to find the business already classified, the exposure already verified, prior loss signals already surfaced, and appetite already scored against the carrier's own guidelines. The judgment call that makes underwriting valuable happens in minutes rather than days.

For large and complex accounts, the RCE provides the CUO with something that has historically been impossible to achieve at scale: a portfolio-level view of risk concentration, exposure accumulation, and appetite consistency across the entire book, updated continuously as new submissions are processed. The CUO who can see the book in real time, rather than waiting for a quarterly report is the CUO who can act on emerging trends before they become loss events.

The bottom line

The Convr Risk Context Engine is the foundational infrastructure that makes AI in commercial underwriting legitimate. It’s grounded in a decade of real production data, structured around the actual language and logic of commercial P&C insurance, and designed to produce outputs that underwriters, CUOs, and regulators can all defend. For the CUO who is already being asked by their board and their reinsurers how they are using AI, and who cannot afford the answer to be "we're experimenting," the RCE is the answer that closes the gap between AI's promise and underwriting's requirements.

XX MIN READ

Glean Insights on Hard-to-Find Small Businesses with Convr’s Biz Intel Feature

A huge portion of commercial property and casualty (P&C) insurance applicants barely exist online. Many small and mid-size commercial insureds (the bread and butter of commercial insurance underwriting) are nearly invisible online.

Think about it . . . landscapers, contractors, florists and more. The  food truck owners, small town auto mechanics and mom and pop shops . . . many don’t have:

  • a website
  • a strong social media presence
  • consistent business filings
  • complete insurance applications

Underwriting team members call this a low digital footprint risk and it’s a problem for them. When the submission comes in, they need to know if the business is real, if the owners do what they claim to do, and if the exposure is what the agent says it is.

But if the business has no digital presence, the underwriter is lost without their normal verification tools including website and online reviews, access to pertinent safety records and satellite exposure checks as well as prior filings.

That’s where Convr’s AI Underwriting Workbench shines. With our Biz Intel web search feature for low digital footprint companies, that hard to find information easily turns up for the underwriter within our underwriting platform.

The Convr Underwriting Workbench’s Biz Intel can uncover:

1) Business Classification

2) Appetite relevant exposures

3) Number of employees

4) Revenue

It turns an unknown into a knowable risk, giving the underwriter the opportunity to decide whether or not to write the risk rather than to spend time investigating it further. It’s a shortcut for underwriting team members of all levels as they spend less time searching for the details that move the decision.

All in one place:

In the Convr AI Underwriting Workbench, every new submission with the web option enabled, runs Biz Intel and returns the results inline. The hard-to-find details land next to the submission you're working on, not three tabs away from it.

Why it matters:

Low-digital-footprint submissions take time that underwriters often can't justify spending. Enrichment surfaces the missing data automatically, so accounts that would have been deprioritized or declined for lack of information become writable.

Convr’s Biz Intel users get:

1) First-quote advantage: Brokers place business with the first to quote. If your underwriting team is out searching Google, the Secretary of State, checking maps and emailing questions – you could be missing out on deals. With Convr AI data enrichment, the data comes to the underwriter instead of the other way around – and the first quote is more often yours.

2) Reduced referral dependency: When reliable information on low digital footprint companies is available in the file, more submissions can be decided where they land. Junior underwriters escalate only the accounts that genuinely need a second set of eyes. Senior underwriters spend their time on the complex risks and judgment calls that actually require their experience – not on questions a richer file would have answered on its own. Across the team, consistency improves and cycle times tighten.

3) Greater portfolio profitability: This is the real return on investment. Commercial carriers rarely lose money on catastrophic risks. Instead, they lose money on thousands of slightly mispriced/misunderstood small and mid-size policies – and low-visibility insureds are exactly where this is most common.

Convr's AI Underwriting Workbench isn't a productivity system. It's a loss ratio control system. If thin-file submissions are costing your team time or premium, we should talk – visit us at convr.com today.

XX MIN READ

Convr Accelerates MGA Growth

From Intake automation efficiency to data modeling for hidden insights, Convr is helping Managing General Agents (MGAs) turn fragmented submission documents into structured, enriched data – accelerating clearance, rating, and quote times to unleash profitable growth.

Everyone knows the best models win at taming documents!\nOur Intake module ingests and enriches data from both structured and unstructured documents including PDFs, Excel and emails across commercial property and casualty (P&C) insurance asset types, including ACORDs, Inspection Forms, SOVs, Loss Runs, Schedules, and more.

Powered by Convr AI and the Risk Context Engine – a purpose-built commercial insurance ontology, knowledge graph, and semantic layer that powers a multi-line schema – transforming fragmented submissions into structured, decision-ready intelligence. By grounding every document, application and data source into a consistent schema, Convr Intake ensures contextually complete, consistent, and reliable data from the start. The result is faster processing, fewer manual touchpoints, and improved risk clarity for accelerated MGA Growth.

Through Risk 360 – a commercial insurance data lake comprised of the digital footprints of millions of businesses – Convr standardizes addresses, performs geo-coding, enriches submissions with CAT modeling codes, and adds property intelligence data such as distance to coast and other hazard indicators. The enrichment delivers a holistic decision-ready view of risk prepared for underwriting, rating, and carrier reporting.

By eliminating re-keying and reducing back-and-forth data gathering, submissions are ready to quote in less than 10 minutes! This is how our MGA customers underwrite smarter and faster to unlock substantial written-premium growth without adding to headcount.

If you’re exploring ways to scale faster with AI, better data and meaningful operational efficiency, Convr welcomes the opportunity to share how leading MGAs are using Convr today.\nJust reach out to Convr today to see how we can help!

Realize End-to-End Underwriting Excellence with Convr AI

Experience how commercial P&C insurance organizations benefit from submission through quote with a frictionless process enriched by AI decisioning, empowering them to make better decisions, faster.