08. April 2025
Extracting data from PDFs with AI and the GRID API

PDFs are everywhere — invoices, reports, product catalogs, policy documents. While they’re great for sharing information, they’re not exactly easy to work with when you want to do something useful with the data inside.
This is where the combination of a large language model (LLM) and the GRID API comes in handy. LLMs can be used to extract structured values from a PDF, and the GRID spreadsheet engine can take those values and plug them directly into spreadsheet models that calculate, compare, and return accurate calculations — all via an API.
The spreadsheet engine plays a critical role in this process. While LLMs are impressive, they weren’t built for math. They regularly hallucinate numbers, miscalculate logic and confidently return incorrect results. That’s a deal-breaker when your AI needs to handle real business logic and provide outputs people can trust.
By pairing the language and document parsing capabilities of an LLM with the calculation power of the GRID spreadsheet engine, you get the best of both worlds: smart extraction and reliable math.
Together, this setup unlocks a wide range of use-cases: auto-calculating responses to form submissions, generating quotes or offers, or powering intelligent assistants that reply with real, accurate insights.
A practical example: From PDF to mortgage comparison tool
One use-case we’ve explored with a customer in the mortgage space (Win the House You Love) involves helping homebuyers understand their loan options more easily. Their team was looking to simplify the process of interpreting standard loan estimate PDFs.Imagine turning a static document like that into a live, interactive chat experience where key details are extracted using AI, the calculations are handled behind the scenes by a spreadsheet model, and the results are displayed in clear, interactive charts.Here’s what that flow could look like in practice:
1. Start with a PDF loan estimate
Loan estimates contain important numbers — interest rate, loan amount, loan term, taxes, insurance, fees — all of which shape the monthly payment and total cost of a loan. But the format they come in is static, typically a PDF, and not designed for easy data extraction or automation.
2. Extract the relevant values using an LLM
An LLM reads the PDF and pulls out the necessary values and labels. LLMs are well-suited for this task as they are especially good at understanding semi-structured documents like these, where the data is consistent but the formatting can vary slightly between sources. They’re able to interpret headings, fields, and context — pulling out standard data like loan amount, term length, and escrow payments without needing custom logic.

3. Send the values to a spreadsheet model via the GRID API
The extracted values are passed into a spreadsheet model hosted in GRID which already contains the formulas and logic to calculate monthly payments, compare loan types, and summarize costs.
This is where business logic lives — in a format many teams already know and trust. With the GRID API, there’s no need to recode this logic into a custom backend. The spreadsheet is the backend.

4. Power the chat with calculated spreadsheet outputs
The spreadsheet runs in the background, and the results — monthly payment, total loan cost, comparison metrics or even visualizations like charts — are delivered to users through the interface.
The GRID API acts as the connective layer, ensuring each response is accurate, verifiable, and reflects the latest inputs.
As users interact with the chat, their inputs are sent to the spreadsheet via the API, triggering live calculations. These are done dynamically — the original model remains untouched. Once connected, a spreadsheet is effectively transformed into a scalable web service that can respond to queries in real time, run calculations on demand, and return results, while preserving the original logic and data.
This makes it easy to compare options, test different scenarios, or support multiple users without risking unwanted edits to the source file.

Why it works
This setup has a few major advantages:
Flexible data extraction: The LLM handles variability in formats — pulling out key values like interest rate, down payment, and insurance even when formatting differs between documents.
Accurate, verifiable calculations: All the logic lives in a spreadsheet — where it likely already exists. By running extracted values through a spreadsheet engine, you get accurate, verifiable results. With the GRID API, your spreadsheet becomes a live backend that recalculates on the fly as users interact.
Everything connected by API: The workflow is powered by simple, flexible API calls. That makes it easy to plug into a chat assistant, an internal tool, or even a public-facing application.
It’s a great example of how modern language models and spreadsheet logic can work together — turning static content into actionable insights grounded in accurate calculations.
Want to try it yourself?
You can sign up and start using the GRID API to build workflows like this with your own spreadsheet data and models. If you’re working with LLMs, PDFs, or any kind of data extraction, it’s a fast and reliable way to automate real business logic — without rebuilding what already works.
Leave the language to LLMs and the numbers to us.
👉 Sign up
News
Updates and announcements
02.12.2024
Bringing spreadsheets into the AI-first era
Introducing GRID's new mission The current wave of AI is arguably the biggest shift in user interfaces since the advent of the GUI. Meanwhile, spreadsheets remain a cornerstone of the business world — resilient, ubiquitous, and indispensable despite repeated predictions of their demise. The fusion of AI and spreadsheets is poised to be big, but it requires a fundamental rethinking — not bringing AI to traditional spreadsheets, but reimagining spreadsheets and their workflows for the AI-first era. With a unique set of cutting-edge spreadsheet technologies, GRID is uniquely positioned — and determined — to lead this transformation. AI-First The AI-first paradigm is defined by three key characteristics: Language-oriented: We interact with computers in our language, not theirs. Task-centric: Work starts with the task at hand, not the hunt for the right software. Agentic: Computers will act on our behalf, even when we’re not there. Spreadsheets Spreadsheets are not just tools; they are foundational to modern business: Ubiquitous: Over 1 billion users worldwide rely on them. Empowering: As the original low-code solution, they enable business users to solve problems independently. The fabric of business: Spreadsheets likely hold more business logic and data than any formal IT system. Bringing them together With the world’s most advanced independent spreadsheet engine — designed for lightning-fast performance and seamless compatibility with Excel and Google Sheets — and a suite of other powerful spreadsheet technologies, GRID is uniquely positioned to redefine the future of spreadsheets in an AI-first world. We’re bridging the gap between AI and spreadsheets, delivering the reliable and verifiable calculations that AI solutions currently lack. Bringing spreadsheets to ChatGPT Today, we’re taking a major step forward by expanding the Alpha testing of our ChatGPT solution and opening registrations for early access. Sign up now to secure your spot in the Alpha and see GRID’s solution in action!
26.08.2020
GRID closes $12M in Series A funding round led by NEA
We’re thrilled to announce that we have closed a $12M Series A funding round led by New Enterprise Associates (NEA), with participation from our existing investors BlueYard Capital, Slack Fund, Acequia Capital and other strategic partners! This funding will enable us to bring GRID to market and power accelerated product development. For more information see our press release. Additional coverage: Tech Crunch: GRID raises $12M Series A to turn spreadsheets into 'visual narratives' SiliconANGLE: Iceland's Grid lands $12M to help workers make their spreadsheets more visual Tech Target: Analytics startup Grid raises $12 million in funding
27.03.2019
GRID closes $3.5M seed funding
We are thrilled to share some great news with you: We just closed $3.5M in seed funding! The investment is led by BlueYard Capital, with participation from strategic investors such as Slack Fund, Acequia Capital and angel investor Charlie Songhurst. We are happy to work with this group, as they add a lot of value to our mission other than their funding. Needless to say, they deeply believe in our mission to empower people to turn any spreadsheet into a beautiful web report, dashboard or interactive application. After our private Alpha launch a few weeks ago, we are now all heads-down again working on product, strategy, network expansion and go-to-market planning. This investment - on top of our $1M angel round in October - fuels current plans well into 2021. It gives us breathing room to focus on building the initial version of our product, take it to market and grow it from there - by delighting our users. We will be adding a few people to our team in the coming weeks and months. This is a fantastic opportunity to join an exciting startup at an inflection point. Take a look at our open positions, and keep an eye on our tweets.