Docs
Overview

Getting Started

Cuebird is a real-time AI copilot that streams intelligent suggestions as the other person speaks — not after. It works for job interviews, sales calls, negotiations, customer support, language practice, and live translation.

Quick start: Create a free account, open the Dashboard, click Interview Copilot, select your audio source, and you're live. No installation required.

1. Create an account

Visit the landing page and click Sign In → Create account. You can sign up with email/password or continue with Google. After creating your account, you'll be taken to the Dashboard.

2. Open the Interview Copilot

From the Dashboard, click Interview Copilot. This opens the main session interface with a left sidebar for configuration and a right panel for AI suggestion output.

Cuebird Dashboard The Dashboard — access all tools from here

3. Set your context

In the left sidebar, enter your target Role and Company. Paste the job description or drop in a job URL to auto-fill it. This context is injected into every AI prompt so suggestions are highly relevant.

4. Choose your audio source

Click one of the capture buttons:

  • Mic — captures your microphone only (useful if the interviewer's audio comes through speakers)
  • Screen — captures system audio from your screen share (picks up everything in the video call)
  • Screen + Mic — both simultaneously, merged into one stream

The status indicator in the top bar turns green when audio is flowing. Transcription starts immediately.

5. Get suggestions

As the interviewer speaks, their words appear in the transcript panel. When a question or prompt is detected, the AI suggestion panel begins streaming the response token-by-token. The first words typically appear within 1–2 seconds of the interviewer finishing their sentence.

Overview

How it Works

Understanding the full pipeline helps you get the most out of Cuebird and troubleshoot if something isn't working.

The real-time pipeline

1
Audio capture

Your browser captures audio from the chosen source (mic, screen audio, or both) and streams it to Cuebird in real time.

2
Speech-to-text

The audio is transcribed as it arrives, producing a live rolling transcript with word-level timing. Transcription starts within a second of audio beginning.

3
Conversation tracking

Cuebird tracks who is speaking and maintains a running history of recent conversation turns so the AI always has the right context.

4
AI suggestion

When a new question or prompt is detected, the AI engine builds a response using your profile, resume, STAR stories, and job context — and streams it word-by-word to your screen.

5
Output

The suggestion appears in your browser and, if connected, in the desktop overlay. If TTS is on, it's read aloud sentence by sentence as it arrives.

Session lifecycle

Your session stays live as long as the tab is open. If the connection drops briefly, the UI shows a disconnection indicator and automatically reconnects. Closing the tab ends the session and discards all in-memory transcript data.

Speaker detection

When using the Standard transcription mode with speaker identification enabled, each transcript word is tagged with a speaker label (Speaker A, Speaker B, etc.). In the Context panel, use the Speaker dropdown to tell Cuebird which speaker label is you. Cuebird then knows to generate suggestions only in response to the other speaker's words.

Core Features

Interview Copilot

The Interview Copilot page at /interview is the primary tool for job interviews. It combines real-time AI suggestions with context-loading, interview mode controls, and session analytics.

Interview Copilot interface The Interview Copilot — live transcript on the left, AI suggestions on the right

Left sidebar

Audio capture buttons
Mic / Screen / Screen+Mic / Stop. Clicking a capture button starts audio. The status indicator in the top bar shows green when the connection is live.
Job URL auto-fill
Paste a job posting URL and click the auto-fill button. Cuebird scrapes the page and extracts the company name, role title, and job description automatically.
Role & Company
Shown in every AI prompt. The more specific, the more relevant the suggestions.
Job Description
Pasted or auto-filled JD. Excerpts are included in the prompt so the AI can reference required skills and responsibilities.
Interview Mode
Switches the behavioral framing of the AI. Options: General · Behavioral · Technical · System Design · Coding. Behavioral strongly favors STAR-format answers; Technical emphasizes code, architecture, or data answers; Coding adds pseudocode-friendly formatting.
Response Style
Controls the surface format: Standard · Bullets · Short · STAR · Smart Q's. Smart Q's makes the AI suggest a clarifying follow-up question instead of a direct answer — useful when the question is ambiguous.
Tone & Seniority
Professional / Casual / Enthusiastic tone and Intern through Director+ seniority level. These affect word choice and depth.

More options (collapsible)

Notes
Free-form notes injected into the prompt. Use this for key points you want the AI to reference.
AI Model
Choose between faster models (great for most interviews) and more powerful models (better for complex technical or senior-level questions). The default is optimized for speed and quality.
Q&A Prep pairs
Add specific question–answer pairs. When the AI detects a close match to a prep question, it incorporates your prepared answer. Add pairs with the + Add Pair button.
Active Stories
Shows how many Knowledge Base stories are currently injected. Click the indicator to open the Knowledge Base and manage your stories.
Resume
Upload or select a saved resume. The AI reads your background and uses it to personalize suggestions.
Speaker
Select your speaker label from the diarization output.

Main panel controls

Manual prompt
Type a question into the input at the bottom and press Enter or click Send. This injects the text directly into the suggestion engine as if the interviewer said it — useful for testing or when audio capture isn't working.
Suggest button
Forces the AI to generate a suggestion from the most recent interviewer turn, even if no new transcript has arrived. Use this when the AI skipped a turn or you want to regenerate.
Image attach
Drag-and-drop or paste a screenshot into the prompt area. The image is sent as a base64 data URL to the AI alongside the text question — useful for whiteboard problems or UI screenshots.
Scorecard
Opens a quick in-session scorecard generated from the current transcript turns. Shows a score and brief feedback for each dimension.
Generate question
Generates a single practice question based on your session context (role, JD, mode).
Core Features

Audio Setup

Getting the right audio source is the most critical setup step. Here's how each option works and when to use it.

Microphone capture

Captures audio from your device's microphone only. Use this when:

  • The interviewer's audio is loud enough to be picked up by your mic (in-person, or via speakers near your mic)
  • You want to minimize system latency

The browser will prompt you to grant microphone permission on first use.

Screen audio capture

Captures system audio from your screen. When you click this button, a browser dialog asks which screen or window to share — select your screen and check the Share audio checkbox. This captures everything the interviewer says through your video call software.

Tip: Screen audio capture is the most reliable way to transcribe the interviewer's voice in remote interviews. Pair it with microphone capture (Screen + Mic) to capture both sides of the conversation.

Screen + Mic (combined)

Captures both system audio and your microphone simultaneously, merging them into a single mixed stream. This is the recommended mode for full two-way transcription with speaker diarization.

Tab-specific capture (Chrome extension)

The Chrome extension enables capture of a specific browser tab's audio without displaying any share banners or dialogs. When the extension is installed, a Tab capture button appears. Clicking it opens a tab picker where you can select which tab to capture.

This is useful when you want to avoid the screen-share overlay visible to the interviewer.

Transcription backend

Cuebird offers multiple transcription backends with different speed and accuracy trade-offs:

  • Fast mode — lowest latency, great for most interviews. The default.
  • Standard mode — slightly more lag, but adds speaker identification (who said what).
  • High-accuracy mode — best for complex speech or accents.
  • Language-optimized — automatically selected when using Chinese, Japanese, Korean, or Portuguese for highest accuracy in those languages.
Core Features

AI Suggestions

The suggestion feed is the heart of Cuebird. Here's how to read it and get the most from it.

Reading the feed

The suggestion panel on the right shows streamed AI responses. Each bubble represents one suggestion, showing:

  • The trigger transcript (what the interviewer said that prompted this suggestion)
  • The AI response, rendered as it streams token-by-token
  • A timestamp and turn sequence number

Smart filtering

Cuebird automatically detects when no suggestion is needed — for example, when the other person is just acknowledging something or the conversation doesn't require a response. In those cases, no bubble appears, keeping the feed clean and focused.

Force Suggest

If the AI skips a turn you wanted a suggestion for, click the Suggest button. This bypasses the gating logic and forces a response from the most recent interviewer turn. Force-suggest also suppresses the "don't repeat" instruction so the AI won't re-answer the previous topic.

Manual prompt injection

Use the input box at the bottom of the suggestion panel to type a question directly and press Send. This is equivalent to the interviewer saying those words. You can also attach an image by dragging a file or pasting a screenshot (Ctrl+V) into the input.

Suggestion controls

Suggestions on/off
Toggle whether new suggestions are generated. Does not clear the existing feed.
TTS toggle
Enables/disables audio playback of suggestions as they stream.
Font size
Cycles through three sizes for readability.
Clear feed
Clears all suggestion bubbles from the current session view.
Core Features

Page Scanner

Scan any browser tab — a LeetCode problem, a job description, a company website — and let the AI read and understand it instantly. The scanned content appears in your transcript and the AI generates suggestions based on what it sees.

Interview page with Scan button highlighted

Click the Scan button in the toolbar to start.

How it works

  1. Click Scan — the button is in the bottom toolbar next to the manual prompt input.
  2. Pick a tab — a tab picker appears showing all your open browser tabs. Select the one you want to scan (e.g., a LeetCode problem, a job posting).
  3. Wait 1–2 seconds — the extension extracts the page's main content, code blocks, and title. The Scan button shows "Scanning…" during extraction.
  4. Results flow in — the scanned content appears as a 📄 Page Scan entry in your transcript, and the AI immediately generates a suggestion analyzing the page's content.
Tab picker modal showing LeetCode Two Sum tab

The tab picker lists all open tabs — select the one to scan.

What gets extracted

The scanner intelligently finds the main content area using common selectors (articles, main elements, LeetCode's HTML structure, HackerRank, CodeSignal, and generic content areas). It extracts:

  • Page title — shown in the transcript entry
  • Page URL — included for reference
  • Main text — the readable content with scripts, navigation, and footers stripped out
  • Code blocks — extracted from <pre> and <code> tags, deduplicated

All of this is sent to the AI as context, so the AI "sees" the same page you're looking at.

Transcript showing 📄 Page Scan entry

The page content appears as a 📄 Page Scan entry in your transcript.

AI suggestion analyzing the scanned LeetCode Two Sum problem

The AI instantly generates an analysis of the scanned page.

Full interview view with scan result and AI suggestion

Full view: transcript on the left, AI suggestion on the right — all from one click.

Requirements

  • Chrome Extension — the page scanner requires the Interview Copilot Chrome extension to be installed. It injects a lightweight content script into the target tab to extract the page content.
  • Active session — your interview session must be connected (the status indicator shows "Connected").
  • HTTP/HTTPS tabs only — the scanner can only read normal web pages, not Chrome internal pages (chrome://, edge://, etc.).

Use cases

  • Live coding interviews — scan the LeetCode/HackerRank problem and get solution hints, complexity analysis, and approach suggestions.
  • Job description review — scan a job posting and get tailored talking points about your experience match.
  • Company research — scan a company's About page or product page during an interview to reference details naturally.
  • Documentation lookup — scan API docs or technical references for quick answers during technical discussions.
Tip: You can scan multiple pages in a single session. Each scan adds a new 📄 Page Scan entry to your transcript, and the AI keeps all scanned content as context for future suggestions.
Core Features

TTS Playback

Cuebird can read suggestions aloud as they stream, using one of three TTS providers.

Providers

ProviderQualityNotes
Standard High Multiple natural-sounding voices to choose from
Premium Very high Highly realistic voices; requires a separate account
Low-Latency High Ultra-low latency for the fastest possible playback

Speed control

Adjust TTS playback speed from 0.75× (slower, clearer) to 3.0× (very fast). The default is 1.0×. Speed is applied per-sentence as audio is generated.

Output language

Set a different output language to have the AI translate suggestions into another language before speaking them. This is independent of the session's input language and the AI's suggestion language. Useful for real-time spoken translation.

Input audio toggle

When enabled, Cuebird plays back the transcribed input speech (in the input language) before speaking the AI's suggestion. This is used in Translation Mode so you hear the original, then the translation.

Tools

Knowledge Base

The Knowledge Base at /knowledge-base is your personal library of STAR stories and Q&A pairs. Stories you activate are automatically injected into every interview session's AI context.

Knowledge Base stories list Knowledge Base — manage and activate your STAR stories

What is a Story?

A story is a structured answer — typically in STAR format (Situation, Task, Action, Result) — that you've written in advance. When the AI detects a behavioral question that relates to a story you've saved, it can incorporate the story naturally into its suggestion.

Creating a story

  1. Open Knowledge Base from the Dashboard sidebar.
  2. Click + New Story.
  3. Enter a title (e.g., "Led a high-stakes product launch under pressure").
  4. Write your story in the text area, or click AI Generate to have Cuebird draft a full STAR story from the title alone.
  5. Review, edit, and save.

Activating stories

Each story has an Active toggle. Only active stories are injected into interview sessions. Activate the 3–5 most relevant stories for each job application. The Interview Copilot sidebar shows an Active Stories indicator with the current count.

AI story generation

Click the AI Generate button on any story editor. Cuebird will create a complete, coherent STAR story from just your title. The generated story uses placeholder metrics — replace them with your real numbers and specifics for maximum impact.

Best practice: Keep each story focused on one specific achievement. Use concrete numbers. Write in the first person as you'd say it naturally in an interview. 150–250 words per story is ideal.
Tools

Job Tracker

The Job Tracker at /jobs organizes your job applications in one place and connects them to interview sessions.

Job Tracker Job Tracker — manage applications and link them to interview sessions

Creating a job

Click + New Job in the left list pane. Fill in the job title, company, and status. Paste the job description into the JD field, or paste the job URL and use Auto-fill from URL to have Cuebird scrape and parse the posting automatically.

Job fields

Title & Company
Used to pre-fill the Interview Copilot context when you launch from this job.
Job Description
Excerpts are included in AI prompts. The more complete the JD, the more relevant the suggestions.
Resume
Attach a specific saved resume to this job. It will be used automatically when you open an interview session from this listing.
Status & Stage
Track where you are in the process: Applied → Phone Screen → Technical → Offer → Closed.

Session transcripts & reports

After each interview session, you can generate a Post-session Report from the transcript. The report includes a summary, overall score, key strengths, improvement areas, and a breakdown by question. Reports are stored with the job listing for future reference.

Plan limits

  • Free: up to 10 job listings
  • Pro: up to 50 job listings
Tools

Mock Interview

Mock Interview mode lets you practice with AI-generated questions before the real interview. Questions are tailored to your target role and job description, and each answer is AI-evaluated.

Mock Interview Mock Interview — generate questions and get AI-evaluated feedback on your answers

Starting a mock session

  1. Open Mock Interview from the Dashboard.
  2. Select an interview domain (behavioral, software engineering, data analytics, product management, finance, consulting, system design, or general).
  3. Choose how many questions (1–10).
  4. Click Generate Questions. The AI tailors questions to your session context (role, JD, profile) if provided.

Answering questions

For each question, type or speak your answer. When you're done, click Evaluate. The AI evaluates your answer and returns:

  • Score (1–10 overall)
  • Relevance — how well you addressed what was actually asked
  • Clarity — how clear and organized the response is
  • Depth — amount of insight, specificity, and evidence provided
  • Feedback — concrete improvement suggestions
  • Strengths — what you did well

Supported domains

Behavioral Software Engineering Data Analytics Product Management Finance Consulting System Design General
Tools

Resume Upload

Uploading your resume gives the AI real context about your background, making suggestions specific to your actual experience rather than generic examples.

Supported formats

PDF, DOCX, and TXT. Maximum file size: 5 MB.

Uploading a resume

In the Interview Copilot sidebar (under More Options), click Upload Resume or drag a file onto the upload area. The resume is parsed and saved to your account. You can save multiple resumes and switch between them per job listing.

Plan limits

  • Free: 1 saved resume
  • Pro: up to 10 saved resumes
Tools

Session Analytics

Cuebird provides two types of AI-generated analytics from your session transcript: in-session scorecards and post-session reports.

Quick scorecard

Click the Scorecard button in the Interview Copilot panel at any point during a session. Cuebird sends the current transcript turns to the AI and returns a rapid scorecard covering:

  • Overall score (1–10)
  • Communication clarity
  • Answer structure (STAR adherence)
  • Relevance to the question asked
  • Brief feedback and suggestions

Post-session report

After a session ends, open the Job Tracker, find the linked session, and click Generate Report. The full transcript is analyzed and produces a comprehensive report with:

  • Executive summary of the session
  • Turn-by-turn question and answer review
  • Aggregate scores across all dimensions
  • Top 3 strengths
  • Top 3 areas for improvement
  • Recommended follow-up preparation
Configuration

Settings & Options

Configuration options are available in the Interview Copilot sidebar and the General app (/). Settings are applied per-session and are not persisted across sessions.

Audio settings

Transcription mode
Choose between fast (lowest latency), standard (adds speaker identification), or high-accuracy modes.
Input language
The language being spoken. Cuebird uses this to select the best transcription engine for that language automatically.

TTS settings

TTS provider
Choose your preferred voice provider, or turn TTS off entirely.
Voice
Select from the available voices for your chosen provider.
Speed
0.75× to 3.0×. Applied to each sentence as it's read aloud.
Response language
Language in which TTS speaks. Can differ from the input language — useful for live translation.
Input audio
Play back the original transcribed speech before the AI's suggestion. Used in Translation Mode.

AI settings

AI model
Switch between faster models (best for most interviews) and more powerful models (better for complex or senior-level questions).
Suggestions on/off
Pause automatic suggestions. You can still trigger them manually with the Suggest button.
Configuration

Language Support

Cuebird supports over 12 languages for speech-to-text and can generate AI suggestions in any language the model supports.

Supported STT languages

English Spanish French German Italian Japanese ¹ Chinese ¹ Korean ¹ Portuguese ¹ Hindi Arabic Russian

¹ These languages automatically use an optimized speech recognition engine for higher accuracy.

Automatic language routing

When you select Chinese, Japanese, Korean, or Portuguese, Cuebird automatically switches to a transcription engine optimized for those languages. You don't need to configure anything — just pick your language and Cuebird handles the rest.

AI suggestion language

AI suggestions are generated in whatever language you're conversing in by default. To force a specific output language, set the Response Language in the TTS settings. This instructs both the AI and the TTS engine to use that language.

Desktop & Extensions

Desktop Overlay

The desktop overlay is a transparent, always-on-top window that displays suggestions directly on your screen — without switching apps, and without appearing in screen-share captures.

How it works

The overlay connects to your running Cuebird session automatically. Suggestions that appear in your browser are mirrored to the overlay in real time. The overlay has no audio capture of its own — all transcription and AI processing happens in the web app.

Installation

Download the Cuebird desktop app installer and run it. Once installed, launch it alongside the Cuebird web app — it will connect to your session automatically.

The overlay window appears as a small 440×700 px frameless window with rounded corners and a translucent dark background. You can drag it to any screen position.

Overlay controls

Manual Prompt
Type a question in the overlay's input and press Enter to inject it directly into the active session — the same as using the web app's manual prompt.
Suggest button
Triggers a Force Suggest from the overlay — generates a suggestion from the most recent interviewer turn.
Opacity slider
Adjusts the overlay window transparency (20% – 100%).
Close / Minimize
Standard window controls accessible via the overlay UI.

Screen-share invisibility

On Windows, the overlay is protected from screen-sharing software. It renders normally on your physical display but is invisible in recordings and screen shares — the interviewer won't see it.

Desktop & Extensions

Chrome Extension

The Chrome extension provides two capabilities: silent tab audio capture and suggestion feed relay to the desktop overlay.

Tab capture

The extension lets Cuebird capture audio from a specific browser tab — like the tab where your video call is running — without showing any screen-share banners or popups. When you click the Tab capture button in the Interview Copilot, a tab picker opens. Select your video call tab and Cuebird starts transcribing it silently.

Overlay relay

The extension keeps the desktop overlay in sync with the suggestion feed even when the Cuebird tab is in the background or minimized.

Installation

  1. In Chrome, open the extensions page (⋮ menu → Extensions → Manage extensions).
  2. Enable Developer mode (toggle in the top right).
  3. Click Load unpacked and select the Cuebird extension folder.
  4. The Cuebird extension icon appears in your Chrome toolbar.
Account

Account & Billing

Manage your account, plan, and billing from the Account page at /account.

Account page Account page — view your plan, usage, and manage billing

Free plan

  • Interview & Conversation agent modes
  • Real-time AI suggestions
  • 1 saved resume
  • Up to 10 job listings
  • Knowledge Base (unlimited stories)

Pro plan (coming soon)

  • All six agent modes
  • Unlimited STT minutes
  • TTS playback (multiple voice providers)
  • Access to the most powerful AI models
  • Up to 10 saved resumes
  • Up to 50 job listings
  • Mock interview mode
  • Session scorecards & reports
  • Desktop overlay
  • Multi-language STT

Billing

All payments are processed securely. Manage your subscription, view invoices, and update payment methods from the Billing button on the Account page. This opens the billing portal in a new tab.

Authentication

Sign in with email/password or Google. Passwords are never stored by Cuebird — all credential handling is done by a secure third-party auth provider. You can reset your password at any time via the "Forgot password" link on the sign-in screen.

Account

Privacy & Data

Cuebird is designed to be privacy-preserving by default. Here's exactly what is and isn't stored.

What is NOT stored

  • Audio — Audio is streamed directly to the transcription service. Cuebird's servers never store audio files.
  • Transcripts — Live transcripts exist only in server memory for the duration of the live session. Closing the tab discards all transcript data.
  • AI suggestions — Suggestions are generated on-demand and discarded when the session ends.

What IS stored

  • Your profile — Email, name, and plan status.
  • Resumes — Uploaded resume text, accessible only to your account.
  • Knowledge Base stories — Your STAR stories.
  • Job listings — Job titles, companies, JD text, status, and any generated session reports.

Third-party providers

Cuebird passes audio to a transcription service and text to an AI service to generate suggestions. Each provider's privacy policy governs how they handle that data. Cuebird does not share your profile data with any third party beyond what is necessary for transcription, AI, and billing.