Insights · AI · Field Notes

Data before AI: how small businesses make AI actually work.

Q: How can a small business prepare its data for AI without spending a lot?

Start with cheap, proven tools: QR codes to tag locations or equipment, mobile forms to replace paper checklists, smartphone photos with a few simple rules, and spreadsheets for monitoring. Consistency and good process matter far more than expensive software.

Q: Do I need AI before I fix my data?

No. Fix your data first. Decide which operational events you need to capture and how you will capture them reliably. Once that foundation is in place, AI has accurate inputs to work with and a real chance to deliver useful results.

8 min read · June 2026 · Sergio Gomez, Shinpo Capital

Most AI projects don't fail because the technology is bad. They fail because the data feeding it is messy, incomplete, or never recorded at all. Here's what I learned fixing that at scale, and how any small business can do the same with cheap, simple tools.

Quick answer

AI is only as good as the data you feed it. Most small-business AI projects fail on messy or missing data, not bad technology. Fix the data first: decide which operational events you need to capture, then capture them consistently with cheap tools like QR codes, mobile forms, smartphone photos, and spreadsheets. Get that foundation right and AI finally has accurate inputs to work with.

Every week, another business owner hears the same advice: you need to adopt AI. It's not wrong. But in practice, a large share of AI projects quietly fall apart, and the reason usually has nothing to do with the AI itself.

The reason is data.

AI is only as good as the information you feed it. If the records going in are incomplete, inconsistent, or scribbled on paper and never entered anywhere, even the most advanced model will hand you confident-sounding nonsense. Good data in, useful answers out. Messy data in, expensive disappointment out.

I learned this the hard way, on the ground. Before founding Shinpo Capital, I worked as a product implementation specialist for a national facilities services company responsible for cleaning and maintenance inside one of the busiest airports in the United States. Leadership wanted to roll out an AI system to grade service quality across several hundred zones, each with its own standards and inspection requirements. The ambition was right. The data underneath it was not ready, and no amount of clever software was going to fix that.

This is the story of how we fixed the data first, using tools cheap enough for any small business to afford. It's also the single most important conversation I have with every client who tells me they want to start using AI.

Why AI projects really fail

There's a comforting myth that AI success comes down to picking the smartest model. In reality, the most common reason these initiatives collapse is far less glamorous: the data was never captured properly in the first place.

This is especially true for small and mid-sized businesses, where information tends to live in scattered places: paper logs, a manager's memory, a half-updated spreadsheet, three different apps that don't talk to each other. Budgets are tight, IT help is thin, and the daily grind always wins over data housekeeping. So the records that should teach an AI system end up patchy and unreliable.

When that happens, the AI gets blamed for poor performance. But the real problem sits upstream, in how the work is recorded long before any model sees it.

The problem on the ground

When I started, recording was inconsistent. Some work was logged, some logged late or only partly, and some was never written down at all. Reporting leaned on recollection, paper logs, and one-off spreadsheets, which made it nearly impossible to get a clear, real-time picture of what was actually getting done.

From the AI's point of view, this was a disaster. A job that was completed but never recorded simply didn't exist. A task logged without a clear location or a standard category was just noise it couldn't interpret. The model wasn't going to be wrong because it was unintelligent. It was going to be wrong because we were feeding it a blurry, half-finished picture of reality.

If a service was completed but never recorded, the AI treated it as if it never happened.

So we set the fancy grading system aside and asked a much simpler question first: how do we capture better data about the work being done, using the cheapest tools that will reliably get the job done?

The fix: cheap tools, recorded consistently

We didn't buy specialized hardware or an enterprise analytics platform. We used tools almost any business already has access to, and we used them with discipline. Here's what actually moved the needle.

QR codes to anchor every record to a place

We tagged each service area with a unique QR code. When a staff member scanned it with a phone, it opened a short form tied to that exact location. Printed on standard labels and scanned with phones people already carry, this cost almost nothing, and every single scan automatically carried a location with it. No more guessing where the work happened. For a small business, the same trick works on equipment, rooms, vehicles, or job sites.

Mobile forms instead of paper

We replaced paper checklists with short digital forms. The design rules mattered more than the technology: drop-down menus and required fields instead of free typing, only the essential questions like task type, time, and condition, and forms that still worked in spots with weak signal and synced later. A small business can do this today with Google Forms, Microsoft Forms, or a simple low-code app.

Photos, with three simple rules

Staff snapped photos to document results. To make those photos useful rather than random, we gave three rules: shoot from a consistent angle, get decent lighting, and include the key part of the area in the frame. That's it. No special equipment, just phone cameras and a short training module. Suddenly the images were consistent enough for both people and AI to actually evaluate.

Spreadsheets to track that the work got logged

Instead of a heavy business-intelligence stack, we used spreadsheets to track whether associates were actually validating their work throughout the day. That was enough to see the percentage of expected work that got recorded by shift, team, and location, spot where logging was falling off, and watch whether training actually changed behavior. Plain spreadsheets, kept up consistently, did the job. (Worth being clear: the spreadsheets tracked that work was being recorded, not the quality of the photos themselves; that was a separate, hands-on process, below.)

Listening to the people doing the work

Short surveys and small group conversations told us which parts of the form were confusing, where scanning interrupted the natural flow of a task, and what small tweaks would make recording feel easier. We folded that feedback straight back into the forms and training. Better tools, better buy-in.

The part most people skip: making it stick

Here's the truth that no software vendor will tell you: capturing data is a human-behavior problem, not a technology problem. The tools were the easy part. Getting hundreds of people, with very different comfort levels around technology, to use them consistently was the real work.

Three things made the difference. First, someone had to translate between the technical team and the people on the floor, so the tools fit how the work actually happened instead of how a developer imagined it. Second, training had to explain the why, not just the buttons, so logging felt like support rather than surveillance. And third, I spent time watching how people actually moved through their day, which showed exactly where a scan or a form could slip in without slowing anyone down.

What changed

Over time, the share of work that got recorded climbed and, more importantly, stayed consistent across shifts and teams. As scanning, forms, and photos became second nature, reliable recording became the norm and gaps became the exception.

That changed everything for the AI. Now it was working from a complete, well-labeled picture: events tied to real locations, times, and people, with standardized photos it could actually interpret. The same model that would have failed on the old data now had a fair shot at being accurate. And managers gained real visibility from nothing more than well-kept spreadsheets, knowing which teams needed support and whether changes were working.

How we checked the work, and kept a human in the loop

Logging that work happened was only half the job. We also had to know whether the records told the full story. So right after associates took their photos and uploaded them, we audited those areas in person. The question we were answering: did the images actually capture the depth and condition of the space, or did they leave something important out? Those audits told us when we needed more photos, a different angle on a particular fixture, or, just as usefully, when a shot was unnecessary. That's how the photo guidelines got sharper over time, grounded in what the pictures were and weren't showing.

Later, once AI was built into the associates' app, we ran the same discipline on the model itself. When the AI sent feedback to an associate about an area, we reviewed it and then went to that area in person to verify whether the assessment held up. Where it didn't, we fed what we found back in to refine the output. We never treated the AI's judgment as the final word. People on the ground kept it honest, and kept it improving.

The takeaways for your business

Start with data, not models. Decide which events you need to capture, completed jobs, inspections, customer interactions, exceptions, and how you'll capture them every time.
Use the cheapest tool that works. Printed codes, mobile forms, and spreadsheets beat expensive platforms when used with discipline. Consistency wins, not price.
Make recording part of the job. If it feels like extra work, it gets skipped under pressure. Train the why, and show people how good records protect them.
Start small, then standardize. Pilot with one team or location, refine the forms, then turn the winning version into a template you can roll out everywhere.
Keep humans in the loop. Spot-check the data and the AI's output regularly, and feed what you learn back into both your process and your model.

The bottom line

The most powerful lever for AI success in a small business usually isn't fancier technology. It's better, simpler data capture. Combine low-cost tools with thoughtful workflow design, real training, and human oversight, and a noisy, incomplete operation can become a reliable foundation that AI can actually build on.

If you're being told to adopt AI, start one step earlier. Make your operational data trustworthy and structured, using the cheapest tools that will reliably support your work. Get that foundation right, and AI finally has a real chance to deliver accurate insights, better decisions, and genuine improvements for your customers.

Start the conversation → AI in construction →

Frequently asked questions

Why do most small business AI projects fail?

Almost always because of poor data, not poor technology. When the records feeding an AI system are incomplete, inconsistent, or never captured at all, even an advanced model produces unreliable results. Build a simple, disciplined way to record your operations first.

How can a small business prepare its data for AI without spending a lot?

Begin with cheap, proven tools: QR codes to tag locations or equipment, mobile forms to replace paper, smartphone photos with a few simple rules, and spreadsheets for monitoring. Good process matters far more than expensive software.

Do I need AI before I fix my data?

No, fix the data first. Decide what you need to capture and how you'll capture it reliably. Once that foundation exists, AI has accurate inputs to work with and a real chance to deliver value.

Note: Some operational details have been generalized to protect client confidentiality.