mirror of https://github.com/Monadical-SAS/reflector.git synced 2026-02-04 09:56:47 +00:00

Files

Mathieu Virbel f6ca07505f feat: add transcript format parameter to GET endpoint (#709 )

* feat: add transcript format parameter to GET endpoint

Add transcript_format query parameter to /v1/transcripts/{id} endpoint
with support for multiple output formats using discriminated unions.

Formats supported:
- text: Plain speaker dialogue (default)
- text-timestamped: Dialogue with [MM:SS] timestamps
- webvtt-named: WebVTT subtitles with participant names
- json: Structured segments with full metadata

Response models use Pydantic discriminated unions with transcript_format
as discriminator field. POST/PATCH endpoints return GetTranscriptWithParticipants
for minimal responses. GET endpoint returns format-specific models.

* Copy transcript format

* Regenerate types

* Fix transcript formats

* Don't throw inside try

* Remove any type

* Toast share copy errors

* transcript_format exhaustiveness and python idiomatic assert_never

* format_timestamp_mmss clear type definition

* Rename seconds_to_timestamp

* Test transcript format with overlapping speakers

* exact match for vtt multispeaker test

---------

Co-authored-by: Sergey Mankovsky <sergey@monadical.com>
Co-authored-by: Igor Loskutov <igor.loskutoff@gmail.com>

2025-11-26 18:51:14 +01:00

docs

feat: dailyco poll (#730 )

2025-11-24 22:24:03 -05:00

images

Upload logo

2025-02-03 16:11:01 +01:00

migrations

feat: link transcript participants (#737 )

2025-11-25 19:13:19 +01:00

reflector

feat: add transcript format parameter to GET endpoint (#709 )

2025-11-26 18:51:14 +01:00

scripts

feat: link transcript participants (#737 )

2025-11-25 19:13:19 +01:00

tests

feat: add transcript format parameter to GET endpoint (#709 )

2025-11-26 18:51:14 +01:00

.gitignore

feat: search frontend (#551 )

2025-08-20 20:56:45 -04:00

.python-version

build: move to uv (#488 )

2025-07-16 18:10:11 -06:00

alembic.ini

server: add basic sql migration

2023-08-29 10:58:27 +02:00

Dockerfile

fix: docker image not loading libgomp.so.1 for torch (#560 )

2025-08-21 16:41:35 -06:00

env.example

feat: daily.co support as alternative to whereby (#691 )

2025-11-12 21:21:16 -05:00

pyproject.toml

fix: security review (#656 )

2025-09-29 23:07:49 +02:00

README.md

feat: api tokens (#705 )

2025-10-20 12:55:25 -04:00

runserver.sh

feat: frontend openapi react query (#606 )

2025-09-05 16:01:31 -06:00

test.ics

feat: calendar integration (#608 )

2025-09-17 16:43:20 -06:00

uv.lock

feat: calendar integration (#608 )

2025-09-17 16:43:20 -06:00

README.md

API Key Management

Finding Your User ID

# Get your OAuth sub (user ID) - requires authentication
curl -H "Authorization: Bearer <your_jwt>" http://localhost:1250/v1/me
# Returns: {"sub": "your-oauth-sub-here", "email": "...", ...}

Creating API Keys

curl -X POST http://localhost:1250/v1/user/api-keys \
  -H "Authorization: Bearer <your_jwt>" \
  -H "Content-Type: application/json" \
  -d '{"name": "My API Key"}'

Using API Keys

# Use X-API-Key header instead of Authorization
curl -H "X-API-Key: <your_api_key>" http://localhost:1250/v1/transcripts

AWS S3/SQS usage clarification

Whereby.com uploads recordings directly to our S3 bucket when meetings end.

SQS Queue (AWS_PROCESS_RECORDING_QUEUE_URL)

Filled by: AWS S3 Event Notifications

The S3 bucket is configured to send notifications to our SQS queue when new objects are created. This is standard AWS infrastructure - not in our codebase.

AWS S3 → SQS Event Configuration:

Event Type: s3:ObjectCreated:*
Filter: *.mp4 files
Destination: Our SQS queue

Our System's Role

Polls SQS every 60 seconds via /server/reflector/worker/process.py:24-62:

Every 60 seconds, check for new recordings

sqs = boto3.client("sqs", ...) response = sqs.receive_message(QueueUrl=queue_url, ...)

Requeue

uv run /app/requeue_uploaded_file.py TRANSCRIPT_ID

Pipeline Management

Continue stuck pipeline from final summaries (identify_participants) step:

uv run python -c "from reflector.pipelines.main_live_pipeline import task_pipeline_final_summaries; result = task_pipeline_final_summaries.delay(transcript_id='TRANSCRIPT_ID'); print(f'Task queued: {result.id}')"

Run full post-processing pipeline (continues to completion):

uv run python -c "from reflector.pipelines.main_live_pipeline import pipeline_post; pipeline_post(transcript_id='TRANSCRIPT_ID')"