How to Convert a PDF Invoice to Excel (.xlsx) the Easy Way
Excel is still where most bookkeeping work happens. Even if your accounting software is the system of record, the day-to-day — reconciliations, supplier checks, month-end reviews — usually lives in a spreadsheet.
Which makes it frustrating when a supplier sends you a PDF invoice. You want it in Excel, you want it clean, and you want it now. This guide shows you how to get there without typing a single number.
Excel or CSV — which one do you actually need?
Quick decision first. Both formats hold the same data, but they behave differently.
Pick Excel (.xlsx) when:
- You'll be working on the file directly — adding formulas, formatting, notes
- You need multiple sheets in one file (header on one, line items on another)
- You're sharing it with a client or colleague who'll open it in Excel
- You care about preserving number formatting, dates and currency symbols
Pick CSV when:
- You're importing it into accounting software, an ERP, or a database
- You want maximum compatibility (every tool reads CSV)
- The file is large and you want it lightweight
If you're leaning CSV, we cover that path in how to convert a PDF invoice to CSV. Otherwise, keep reading.
The four ways to get a PDF invoice into Excel
There's no shortage of methods. They're not all worth your time.
1. Manually retyping
Open PDF, open Excel, start typing. Most people start here. Nobody who processes more than a handful of invoices a month stays here.
A typical invoice with 10–15 line items takes 5–10 minutes to retype properly, and that's assuming you don't make any mistakes. At the end of a long day, you definitely do.
2. Copy-paste from the PDF into Excel
Select the text in your PDF reader, paste into a worksheet. In theory the columns line up. In practice they almost never do.
What you usually get is everything dumped into column A, with weird spacing where the PDF had multi-line cells. You then spend 15 minutes splitting columns with Text to Columns, fixing dates and re-aligning line items. By the time you're done, retyping would have been faster.
This method can work for a single, simple invoice in a pinch. It's not a workflow.
3. Excel's built-in "Get Data from PDF"
If you're on Microsoft 365 or Excel 2021+, there's a feature under Data → Get Data → From File → From PDF. It uses Power Query to pull tables out of a PDF.
It works reasonably well on PDFs where the invoice is a clean, single table. It struggles with:
- Multi-page invoices
- Invoices where line items wrap onto two rows
- Anything with merged cells in the source PDF
- Invoices where the header info (number, date, supplier) is laid out in boxes rather than a table
For about 30% of invoices it gets you 80% of the way. For the rest, you'll spend more time fixing the output than you saved.
It's also Windows-only in any practical sense. The Mac version of Excel doesn't have the same Power Query support.
4. A dedicated PDF-to-Excel tool
This is where most accounting teams end up. You upload the PDF, the tool reads it, and you get an .xlsx file back with the data already structured into sensible columns.
The quality difference between tools is huge. We compare the realistic options in the best tools to extract data from PDF invoices, but the short version: look for one that understands invoice structure, not just one that pulls text out of PDFs.
What a good Excel export should contain
Before you commit to a tool or a method, know what you're looking for in the output. A clean Excel export of an invoice usually has:
Sheet 1 — Invoice header:
- Invoice number
- Invoice date and due date
- Supplier name, address, VAT/tax ID
- Customer name (you, usually)
- Currency
- Subtotal, total VAT, grand total
Sheet 2 — Line items:
- A column linking back to the invoice number
- Description
- Quantity
- Unit price
- Line total
- VAT rate and VAT amount
Some tools dump everything on one sheet. That's fine too, as long as the structure is consistent across invoices. The thing you want to avoid is an export where header info is mixed in with line items in a single column — that's a sign the tool is just OCR-ing text without understanding what it's reading.
Step-by-step: PDF invoice to Excel in under a minute
Here's the practical workflow most bookkeepers settle on.
Step 1: Check the PDF is text-based, not a scan
Open the PDF. Try to highlight a word with your cursor. If the text selects normally, you're working with a machine-readable PDF and any decent tool will handle it. If the whole page selects as one big image, it's a scan — you'll need a tool with OCR built in.
Most supplier invoices these days are machine-readable. Phone photos and old scanned receipts often aren't.
Step 2: Upload to your tool of choice
For browser-based options (the easy path) you just drag the PDF into a tab. No install, no plugin, nothing to set up. For desktop apps, open the file from there.
Step 3: Choose Excel as the output
Pick .xlsx. If the tool offers options like "preserve formatting" or "include formulas", default to off for invoices — you want clean, raw data, not a recreation of the PDF's visual layout.
Step 4: Spot-check the output before you do anything else
Open the Excel file and check three things:
- Totals add up. Sum the line items. Does it match the invoice subtotal?
- Dates parsed correctly. A "03/04/2026" silently read as April 3rd instead of March 4th will hurt later.
- Decimal separators. European invoices use commas (1.250,00), US invoices use periods (1,250.00). The export should match what your downstream system expects.
If those three things look right, the rest almost always is.
Step 5: Use the data
From here, do whatever you'd normally do with an invoice in Excel — categorize, reconcile, pivot, import into your accounting software. We have walkthroughs for the main ones:
- Import invoices to QuickBooks
- Import invoices to Xero
- Import invoices to FreshBooks
- Import invoices to Sage
Common Excel-specific gotchas
Excel adds a few wrinkles that pure CSV doesn't have. Watch for these.
Numbers stored as text. If amounts come through with a leading apostrophe or aligned left in the cell, Excel is treating them as text. They won't sum. Fix with VALUE() or by re-formatting the column as Number.
Dates that look right but aren't. Excel sometimes displays a date correctly but stores it as text underneath. If you sort by date and it sorts alphabetically, that's why.
Leading zeros disappearing. Invoice numbers like "0042" become "42" when Excel auto-converts. Either format the column as Text before pasting, or pick a tool that handles this for you.
Long descriptions truncated in cells. This is purely visual — the data is there, the column is just narrow. Don't waste time "fixing" it before you've imported it elsewhere.
Merged cells from the source PDF. Some tools try to recreate visual layout with merged cells. Avoid that. Merged cells break filtering, sorting and most imports.
When converting to Excel makes sense (and when it doesn't)
If you're processing a few invoices a month and you live in Excel, converting to .xlsx is the natural choice. You open the file, work on it, save it, done.
If you're processing dozens or hundreds, especially with a goal of importing into accounting software, CSV is usually the better target format — it's leaner, more predictable, and doesn't carry Excel's quirks. You can always open a CSV in Excel anyway.
For high-volume work, the format matters less than the workflow. What matters is whether you can batch process PDF invoices and whether the export is consistent enough that you can build downstream automations on top of it.
If you're new to this whole space, what is invoice data extraction is a good primer on what's actually happening under the hood.
A quick word on time
The reason any of this matters: invoice data entry is one of those tasks that quietly eats more hours than it has any right to. Five minutes here, eight minutes there, and suddenly two days of your month are gone.
We dug into the real numbers — and a simple way to calculate the cost for your own setup — in save time on invoice data entry. The short version: above 20 invoices a month, automation pays for itself in the first week.
Try it on a real invoice
CsvInvoice converts PDF invoices to Excel (or CSV) in about 10 seconds, in the browser, with no install. It handles multi-page invoices, line items that wrap, and any supplier layout — not just templates you've configured ahead of time. You pay $0.29 per invoice, with no subscription and no base fee. Files are encrypted in transit and never shared.
Pick a PDF you'd normally spend ten minutes on. See how it compares.