Services

Backend Engineering, DevOps

Industry

B2B Marketplace

Year

2022-2024

Search agents that know when to talk and when to shut up.

A B2B marketplace let users save search criteria - sectors, regions, price ranges - and receive new matches by email. The problem: the first implementation sent every match immediately. Users with broad filters got 30 emails a day. Users with narrow filters got nothing for weeks, then a dump of stale results. The notification system needed frequency control, per-result deduplication, and multi-level rate limiting without a dedicated messaging platform.

01.

THE CHALLENGE

Every User Gets a Different Inbox

The platform had 4,000 active users, each with one to five saved search agents. Agents ran at four frequencies: hourly, daily, weekly, monthly. A naive implementation would query the entire listing database on every run, compare against every agent, and send an email for every match. At scale, this produced duplicate results across frequencies, email provider throttling from SendGrid, and user churn from notification fatigue. The system needed to track which results had already been shown to each agent and stop sending when limits were hit.

4,000 users, five agents each, four frequencies. The naive approach sent 30 emails a day to one user and zero to another. Both churned.

02.

THE SOLUTION

Celery Beat with Layered Rate Limiting

Each search agent is a ScheduledMailReport record storing the user's filter criteria and preferred frequency. A Celery beat schedule triggers four tasks - one per frequency tier. Each task queries only listings created since the agent's last execution, annotates eligible users with an Exists subquery, and collects matches. A many-to-many relation (companies_reported) tracks every listing already shown to each agent, preventing duplicates across frequencies. Three rate limits stack: max_mails_per_run caps the total emails per execution, max_companies caps results per email, and the M2M table ensures a listing never appears twice.

The eligible-user annotation with Exists subquery:

Python

eligible_users = (
    User.objects
    .filter(is_active=True)
    .exclude(email='')
    .annotate(
        has_verified_email=Exists(
            EmailAddress.objects.filter(
                user=OuterRef('pk'),
                verified=True,
            )
        ),
        is_spam_reported=Exists(
            SpamReport.objects.filter(
                email=OuterRef('email'),
            )
        ),
    )
    .filter(has_verified_email=True)
    .exclude(is_spam_reported=True)
)

Watch It Run

Three agents firing at different intervals. Watch matches get collected and duplicates get filtered in real time.

Hourly Daily Weekly

Waiting for first agent run…

0Matched

0Deduplicated

0Emails Sent

0sElapsed

Production Challenges

Eligibility Gating

Not every user should receive emails. The query filters out unverified email addresses, spam-reported accounts, blocked users, and inactive accounts. This is implemented as a chain of Exists subquery annotations on the User queryset rather than post-query filtering, keeping database load constant regardless of the total user count.

Auto-Created Default Agents

Every new user gets a default search agent via a post_save signal. This ensures the onboarding email contains relevant listings from day one, without requiring the user to configure anything. The default agent uses broad criteria and weekly frequency. Users can narrow or disable it later.

SendGrid Throttle Management

SendGrid enforces per-second and per-minute rate limits. The dispatch loop introduces a configurable delay between sends and respects max_mails_per_run as a hard ceiling. If the limit is reached mid-batch, remaining emails are deferred to the next scheduled run rather than queued. This avoids API 429 errors and keeps deliverability scores high.

The rate-limited dispatch loop:

Python

def send_scheduled_reports(frequency):
    agents = ScheduledMailReport.objects.filter(
        frequency=frequency,
        is_active=True,
        user__in=eligible_users,
    ).select_related('user')

    sent_count = 0
    for agent in agents:
        if sent_count >= settings.MAX_MAILS_PER_RUN:
            break

        new_companies = (
            Company.objects
            .filter(**agent.get_criteria())
            .filter(created_at__gte=agent.last_run)
            .exclude(
                pk__in=agent.companies_reported.all()
            )[:settings.MAX_COMPANIES_PER_MAIL]
        )

        if new_companies.exists():
            send_report_email(agent.user, new_companies)
            agent.companies_reported.add(*new_companies)
            agent.last_run = timezone.now()
            agent.save(update_fields=['last_run'])
            sent_count += 1
            time.sleep(0.1)  # SendGrid throttle

03.

THE RESULT

From Noise to Signal

Email open rates increased from 12% to 41% after switching to frequency-controlled agents. User complaints about notification spam dropped to zero. The deduplication layer eliminated an average of 23% redundant results per run. The system processes all 4,000 users across four frequency tiers in under 90 seconds. No third-party messaging platform was needed. The entire system runs on Celery beat, Django ORM queries, and a single M2M table.

KEY METRICS

0%Open Rate

0,000+Active Agents

0%Avg Dedup Rate

WHAT THE CLIENT SAYS

"Before, our users either drowned in emails or forgot we existed. Now the search agents send exactly the right amount. Support tickets about notifications went from ten a week to zero."

Product Manager

B2B Marketplace · Growth Team

FAQ

Why Celery beat instead of a cron job?

How does the M2M deduplication scale?

What happens when a listing matches multiple agents for the same user?

TECHNOLOGY STACK

Django

Python

PostgreSQL

Daniel KDK👋

Search agents that know when to talk and when to shut up.

Every User Gets a Different Inbox

Celery Beat with Layered Rate Limiting

Watch It Run

Production Challenges

Eligibility Gating

Auto-Created Default Agents

SendGrid Throttle Management

From Noise to Signal

KEY METRICS

FAQ

Why Celery beat instead of a cron job?

How does the M2M deduplication scale?

What happens when a listing matches multiple agents for the same user?

TECHNOLOGY STACK

Daniel KDK