AI Classification — Tax Optimization Training Guide

In this guide

1. What gets classified
2. The 49 categories
3. The 17 tax tags
4. The 4-layer memory system
5. How to correct classifications
6. Reaching 95%+ accuracy

1. What gets classified

Every transaction flowing into HaraPro — whether via Plaid or PDF statement — gets passed to the Classifier Agent. The agent looks at:

Merchant name and transaction description
Amount (+/–, exact value, magnitude pattern)
Date and recurrence pattern (one-time vs monthly)
The business the transaction belongs to (business vs individual)
Your prior corrections on similar transactions
Global patterns across all HaraPro tenants (anonymized)

It then assigns one category (e.g., "Meals & Entertainment") and up to 3 tax tags (e.g., "Deductible", "Business Meal 50%", "Travel-related"). Each assignment gets a confidence score 0–100.

2. The 49 categories

Categories are the "what is this?" layer. We use 49 that map cleanly to IRS Schedule C, Schedule E, and common S-Corp and partnership accounts:

Revenue / Sales

Cost of Goods Sold

Contractor Fees

Professional Fees

Legal Fees

Accounting

Advertising

Marketing

Software & SaaS

Office Supplies

Rent — Office

Rent — Equipment

Utilities

Internet / Phone

Insurance — Business

Insurance — Health

Vehicle Expense

Vehicle Depreciation

Meals & Entertainment

Travel

Dues & Subscriptions

Bank Fees

Interest Expense

Loan Principal

Credit Card Payment

Owner Draw

Owner Contribution

Payroll — Wages

Payroll Taxes

Benefits

Retirement Contributions

Education

Charitable Contributions

Taxes — Federal

Taxes — State

Taxes — Property

Repairs & Maintenance

Depreciation

Amortization

Investment Income

Investment Expense

Rental Income

Rental Expense

Distribution

Transfer

Refund / Reversal

Personal — Living

Personal — Medical

Uncategorized

You can rename categories per business (e.g., "Software & SaaS" → "Software — Dev Tools") and create custom sub-categories.

3. The 17 tax tags

Tax tags are the "how does this affect taxes?" layer. They stack on top of categories and drive the tax forecast, deduction simulator, and CPA export. The 17:

Deductible

Partially Deductible

Non-Deductible

Capital Expense

§179 Eligible

Bonus Depreciation

§280F Limited

Business Meal 50%

Home Office

Vehicle — Mileage

Vehicle — Actual

Self-Employment Tax

Retirement Contribution

HSA Contribution

Charitable

Personal (no deduction)

Under Review

Tip: Tax tags are where the real leverage lives. A $65,000 SUV is a "Vehicle Expense" category — but the tax tag "§179 Eligible" (if over 6,000 lbs GVWR) could mean a $65K deduction in year 1 instead of $13K. The Deduction Simulator in Pro+ lets you model this before you buy.

4. The 4-layer memory system

The Classifier doesn't learn in isolation — it uses four layers of memory, stacked in priority order (top wins):

Session context

The immediate context of the transaction under review — adjacent transactions, current business, recent batch uploads. This resolves ambiguity for things like "Amazon" appearing 40 times in a week (all business supplies vs mixed personal).

User corrections (your tenant)

Every time you override a classification, that correction is stored. The next time the same merchant (or similar description) appears in the same business, your prior correction wins. Corrections propagate within 24 hours.

Tenant preferences

Patterns specific to your HaraPro account: "This tenant tags all Shell purchases as Vehicle — Actual, not Personal" or "This tenant categorizes Uber consistently as Travel for businesses A, B and Personal for business C." Tenant preferences apply across all businesses in your account.

Global patterns (anonymized)

Industry-wide patterns from all HaraPro users, fully anonymized. "Intuit" is overwhelmingly "Software & SaaS" in the SMB segment. "Delta Airlines" is overwhelmingly "Travel". This is the fallback when your own tenant has no prior signal.

5. How to correct classifications

Open the Transactions tab. Low-confidence classifications land in Needs review automatically. To correct:

Click any transaction row
Click the Category dropdown to change it
Click Tax tags to add/remove tags
Optionally: Apply to all similar — to retroactively fix every matching past transaction

The "Apply to all similar" option uses merchant name + description fuzzy match. Useful when you realize "Aws" has been miscategorized as "Uncategorized" for months — one click fixes all 47.

6. Reaching 95%+ accuracy

Typical accuracy trajectory for a new tenant:

Day 1: ~85% — global patterns are doing the work
Week 1: ~90% — your first 20–40 corrections kick in
Week 3: ~95%+ — tenant preferences stabilize
Month 3: ~98% — long tail merchants are known

To accelerate: when you do your first review, go through 200–300 transactions in one sitting. The AI learns your patterns much faster from a concentrated burst than from scattered fixes over months.

Privacy note: Your corrections only train your tenant. They never leak to other users. Global pattern learning uses anonymized, aggregated signals only — never individual transaction text. AI processing runs under enterprise no-training agreements with OpenAI and Anthropic. Read the privacy policy →

Next up

📊 Tax forecast & deduction simulator → 📄 Run reports for your CPA → 🏦 Connect more banks → ⚡ Back to getting started →

Understanding AI classification

In this guide

1. What gets classified

2. The 49 categories

3. The 17 tax tags

4. The 4-layer memory system

Session context

User corrections (your tenant)

Tenant preferences

Global patterns (anonymized)

5. How to correct classifications

6. Reaching 95%+ accuracy

Next up