reflector

mirror of https://github.com/Monadical-SAS/reflector.git synced 2026-02-05 02:16:46 +00:00

Author	SHA1	Message	Date
Mathieu Virbel	84a381220b	fix: make webhook secret/url allowing null (#590 )	2025-08-29 11:55:18 -06:00
Mathieu Virbel	88ed7cfa78	feat(rooms): add webhook for transcript completion (#578 ) * feat(rooms): add webhook notifications for transcript completion - Add webhook_url and webhook_secret fields to rooms table - Create Celery task with 24-hour retry window using exponential backoff - Send transcript metadata, diarized text, topics, and summaries via webhook - Add HMAC signature verification for webhook security - Add test endpoint POST /v1/rooms/{room_id}/webhook/test - Update frontend with webhook configuration UI and test button - Auto-generate webhook secret if not provided - Trigger webhook after successful file pipeline processing for room recordings * style: linting * fix: remove unwanted files * fix: update openapi gen * fix: self-review * docs: add comprehensive webhook documentation - Document webhook configuration, events, and payloads - Include transcript.completed and test event examples - Add security considerations and best practices - Provide example webhook receiver implementation - Document retry policy and signature verification * fix: remove audio_mp3_url from webhook payload - Remove audio download URL generation from webhook - Update documentation to reflect the change - Keep only frontend_url for accessing transcripts * docs: remove unwanted section * fix: correct API method name and type imports for rooms - Fix v1RoomsRetrieve to v1RoomsGet - Update Room type to RoomDetails throughout frontend - Fix type imports in useRoomList, RoomList, RoomTable, and RoomCards * feat: add show/hide toggle for webhook secret field - Add eye icon button to reveal/hide webhook secret when editing - Show password dots when webhook secret is hidden - Reset visibility state when opening/closing dialog - Only show toggle button when editing existing room with secret * fix: resolve event loop conflict in webhook test endpoint - Extract webhook test logic into shared async function - Call async function directly from FastAPI endpoint - Keep Celery task wrapper for background processing - Fixes RuntimeError: event loop already running * refactor: remove unnecessary Celery task for webhook testing - Webhook testing is synchronous and provides immediate feedback - No need for background processing via Celery - Keep only the async function called directly from API endpoint * feat: improve webhook test error messages and display - Show HTTP status code in error messages - Parse JSON error responses to extract meaningful messages - Improved UI layout for webhook test results - Added colored background for success/error states - Better text wrapping for long error messages * docs: adjust doc * fix: review * fix: update attempts to match close 24h * fix: add event_id * fix: changed to uuid, to have new event_id when reprocess. * style: linting * fix: alembic revision	2025-08-29 10:07:49 -06:00
Mathieu Virbel	6f0c7c1a5e	feat(cleanup): add automatic data retention for public instances (#574 ) * feat(cleanup): add automatic data retention for public instances - Add Celery task to clean up anonymous data after configurable retention period - Delete transcripts, meetings, and orphaned recordings older than retention days - Only runs when PUBLIC_MODE is enabled to prevent accidental data loss - Properly removes all associated files (local and S3 storage) - Add manual cleanup tool for testing and intervention - Configure retention via PUBLIC_DATA_RETENTION_DAYS setting (default: 7 days) Fixes #571 * fix: apply pre-commit formatting fixes * fix: properly delete recording files from storage during cleanup - Add storage deletion for orphaned recordings in both cleanup task and manual tool - Delete from storage before removing database records - Log warnings if storage deletion fails but continue with database cleanup * Apply suggestion from @pr-agent-monadical[bot] Co-authored-by: pr-agent-monadical[bot] <198624643+pr-agent-monadical[bot]@users.noreply.github.com> * Apply suggestion from @pr-agent-monadical[bot] Co-authored-by: pr-agent-monadical[bot] <198624643+pr-agent-monadical[bot]@users.noreply.github.com> * refactor: cleanup_old_data for better logging * fix: linting * test: fix meeting cleanup test to not require room controller - Simplify test by directly inserting meetings into database - Remove dependency on non-existent rooms_controller.create method - Tests now pass successfully * fix: linting * refactor: simplify cleanup tool to use worker implementation - Remove duplicate cleanup logic from manual tool - Use the same _cleanup_old_public_data function from worker - Remove dry-run feature as requested - Prevent code duplication and ensure consistency - Update documentation to reflect changes * refactor: split cleanup worker into smaller functions - Move all imports to the top of the file - Extract cleanup logic into separate functions: - cleanup_old_transcripts() - cleanup_old_meetings() - cleanup_orphaned_recordings() - log_cleanup_results() - Make code more maintainable and testable - Add days parameter support to Celery task - Update manual tool to work with refactored code * feat: add TypedDict typing for cleanup stats - Add CleanupStats TypedDict for better type safety - Update all function signatures to use proper typing - Add return type annotations to _cleanup_old_public_data - Improves code maintainability and IDE support * feat: add CASCADE DELETE to meeting_consent foreign key - Add ondelete="CASCADE" to meeting_consent.meeting_id foreign key - Generate and apply migration to update existing constraint - Remove manual consent deletion from cleanup code - Add unit test to verify CASCADE DELETE behavior * style: linting * fix: alembic migration branchpoint * fix: correct downgrade constraint name in CASCADE DELETE migration * fix: regenerate CASCADE DELETE migration with proper constraint names - Delete problematic migration and regenerate with correct names - Use explicit constraint name in both upgrade and downgrade - Ensure migration works bidirectionally - All tests passing including CASCADE DELETE test * style: linting * refactor: simplify cleanup to use transcripts as entry point - Remove orphaned_recordings cleanup (not part of this PR scope) - Remove separate old_meetings cleanup - Transcripts are now the main entry point for cleanup - Associated meetings and recordings are deleted with their transcript - Use single database connection for all operations - Update tests to reflect new approach * refactor: cleanup and rename functions for clarity - Rename _cleanup_old_public_data to cleanup_old_public_data (make public) - Rename celery task to cleanup_old_public_data_task for clarity - Update docstrings and improve code organization - Remove unnecessary comments and simplify deletion logic - Update tests to use new function names - All tests passing * style: linting\ * style: typing and review * fix: add transaction on cleanup_single_transcript * fix: naming --------- Co-authored-by: pr-agent-monadical[bot] <198624643+pr-agent-monadical[bot]@users.noreply.github.com>	2025-08-29 08:47:14 -06:00
Igor Loskutov	009590c080	feat: search frontend (#551 ) * feat: better highlight * feat(search): add long_summary to search vector for improved search results - Update search vector to include long_summary with weight B (between title A and webvtt C) - Modify SearchController to fetch long_summary and prioritize its snippets - Generate snippets from long_summary first (max 2), then from webvtt for remaining slots - Add comprehensive tests for long_summary search functionality - Create migration to update search_vector_en column in PostgreSQL This improves search quality by including summarized content which often contains key topics and themes that may not be explicitly mentioned in the transcript. * fix: address code review feedback for search enhancements - Fix test file inconsistencies by removing references to non-existent model fields - Comment out tests for unimplemented features (room_ids, status filters, date ranges) - Update tests to only use currently available fields (room_id singular, no room_name/processing_status) - Mark future functionality tests with @pytest.mark.skip - Make snippet counts configurable - Add LONG_SUMMARY_MAX_SNIPPETS constant (default: 2) - Replace hardcoded value with configurable constant - Improve error handling consistency in WebVTT parsing - Use different log levels for different error types (debug for malformed, warning for decode, error for unexpected) - Add catch-all exception handler for unexpected errors - Include stack trace for critical errors All existing tests pass with these changes. * fix: correct datetime test to include required duration field * feat: better highlight * feat: search room names * feat: acknowledge deleted room * feat: search filters fix and rank removal * chore: minor refactoring * feat: better matches frontend * chore: self-review (vibe) * chore: self-review WIP * chore: self-review WIP * chore: self-review WIP * chore: self-review WIP * chore: self-review WIP * chore: self-review WIP * chore: self-review WIP * remove swc (vibe) * search url query sync (vibe) * search url query sync (vibe) * better casts and cap while * PR review + simplify frontend hook * pr: remove search db timeouts * cleanup tests * tests cleanup * frontend cleanup * index declarations * refactor frontend (self-review) * fix search pagination * clear "x" for search input * pagination max pages fix * chore: cleanup * cleanup * cleanup * cleanup * cleanup * cleanup * cleanup * cleanup * lockfile * pr review	2025-08-20 20:56:45 -04:00
Mathieu Virbel	1311714451	ci: add pre-commit hook and fix linting issues (#545 ) * style: deactivate PLC0415 only on part that it's ok + re-run pre-commit run --all * ci: add pre-commit hook * build: move from yarn to pnpm * build: move from yarn to pnpm * build: fix node-version * ci: install pnpm prior node (?) * build: update deps and pnpm trying to fix vercel build * feat: docker www corepack * style: pre-commit --------- Co-authored-by: Igor Loskutov <igor.loskutoff@gmail.com>	2025-08-14 20:59:54 -06:00
Mathieu Virbel	9eab952c63	feat: postgresql migration and removal of sqlite in pytest (#546 ) * feat: remove support of sqlite, 100% postgres * fix: more migration and make datetime timezone aware in postgres * fix: change how database is get, and use contextvar to have difference instance between different loops * test: properly use client fixture that handle lifetime/database connection * fix: add missing client fixture parameters to test functions This commit fixes NameError issues where test functions were trying to use the 'client' fixture but didn't have it as a parameter. The changes include: 1. Added 'client' parameter to test functions in: - test_transcripts_audio_download.py (6 functions including fixture) - test_transcripts_speaker.py (3 functions) - test_transcripts_upload.py (1 function) - test_transcripts_rtc_ws.py (2 functions + appserver fixture) 2. Resolved naming conflicts in test_transcripts_rtc_ws.py where both HTTP client and StreamClient were using variable name 'client'. StreamClient instances are now named 'stream_client' to avoid conflicts. 3. Added missing 'from reflector.app import app' import in rtc_ws tests. Background: Previously implemented contextvars solution with get_database() function resolves asyncio event loop conflicts in Celery tasks. The global client fixture was also created to replace manual AsyncClient instances, ensuring proper FastAPI application lifecycle management and database connections during tests. All tests now pass except for 2 pre-existing RTC WebSocket test failures related to asyncpg connection issues unrelated to these fixes. * fix: ensure task are correctly closed * fix: make separate event loop for the live server * fix: make default settings pointing at postgres * build: remove pytest-docker deps out of dev, just tests group	2025-08-14 11:40:52 -06:00
Igor Loskutov	6fb5cb21c2	feat: search backend (#537 ) * docs: transient docs * chore: cleanup * webvtt WIP * webvtt field * chore: webvtt tests comments * chore: remove useless tests * feat: search TASK.md * feat: full text search by title/webvtt * chore: search api task * feat: search api * feat: search API * chore: rm task md * chore: roll back unnecessary validators * chore: pr review WIP * chore: pr review WIP * chore: pr review * chore: top imports * feat: better lint + ci * feat: better lint + ci * feat: better lint + ci * feat: better lint + ci * chore: lint * chore: lint * fix: db datetime definitions * fix: flush() params * fix: update transcript mutability expectation / test * fix: update transcript mutability expectation / test * chore: auto review * chore: new controller extraction * chore: new controller extraction * chore: cleanup * chore: review WIP * chore: pr WIP * chore: remove ci lint * chore: openapi regeneration * chore: openapi regeneration * chore: postgres test doc * fix: .dockerignore for arm binaries * fix: .dockerignore for arm binaries * fix: cap test loops * fix: cap test loops * fix: cap test loops * fix: get_transcript_topics * chore: remove flow.md docs and claude guidance * chore: remove claude.md db doc * chore: remove claude.md db doc * chore: remove claude.md db doc * chore: remove claude.md db doc	2025-08-13 10:03:38 -04:00
Mathieu Virbel	f5b82d44e3	style: use ruff for linting and formatting (#524 )	2025-07-31 17:57:43 -06:00
Igor Loskutov	7e3027adb6	fix: room concurrency (theoretically) (#511 ) * fix: room concurrency (theoretically) * cleanup * cleanup	2025-07-25 17:37:51 -04:00
Mathieu Virbel	033bd4bc48	feat: improve transcript listing with room_id (#496 ) Added a new field in transcript for room_id, and set room_id/meeting_id in a transcript now. Use this field to list the transcripts. URL is now very fast.	2025-07-17 15:43:36 -06:00
Mathieu Virbel	f3ae187274	fix: waveform can generate NaN in json database (#481 ) * refactor: fixes transcript duration type, NaN in waveform, and prepare for postgres migration * fix: ensure we don't have NaN in waveform * fix: missing assertionerror Co-authored-by: pr-agent-monadical[bot] <198624643+pr-agent-monadical[bot]@users.noreply.github.com> * fix: potential empty array --------- Co-authored-by: pr-agent-monadical[bot] <198624643+pr-agent-monadical[bot]@users.noreply.github.com>	2025-07-15 20:46:19 -06:00
Mathieu Virbel	9deb717e5b	refactor: improve transcript list performance (#480 ) * refactor: improve transcript list performance * fix: sync openapi * fix: frontend types * fix: remove drop table _alembic_tmp_meeting * fix: remove create table too * fix: remove uq_recording_object_key	2025-07-15 15:10:05 -06:00
Mathieu Virbel	3d370336cc	fix: alembic migrations (#470 ) * fix: alembic migrations This commit fixes all the migrations that was half-backed, due to auto creation in the db init before. The process was to checkout at the commit where the migration was created, and use --autogenerate to regenerate at the state of the migration. 4 migrations was fixed. It also includes a workflow to ensure migration can applies correctly. * fix: db migration check * fix: nullable on meeting_consent * fix: try fixing tests	2025-06-27 12:03:10 -06:00
Mathieu Virbel	542a277001	fix: re-add missing migration (#468 )	2025-06-26 11:09:58 -06:00
Igor Loskutov	c23e0e07ef	update audio-deleted flow	2025-06-18 15:43:50 -04:00
Igor Loskutov	fdf42cf60b	slop removal	2025-06-17 19:48:46 -04:00
Igor Loskutov	0c91f5dd59	slop review WIP	2025-06-17 19:26:11 -04:00
Igor Loskutov	91c7c8b83a	meeting consent vibe	2025-06-17 16:30:23 -04:00
Sergey Mankovsky	f43045b41c	Add recordings	2025-03-11 15:12:25 +01:00
Sergey Mankovsky	dd021e9e71	Deactivate meeting when session ends	2025-01-28 12:41:23 +01:00
Sergey Mankovsky	159bd82e1c	Create new meeting after previous has ended	2024-12-24 14:18:35 +01:00
Sergey Mankovsky	2cbcfefb3f	Remove viewer room url	2024-10-11 13:50:22 +02:00
Sergey Mankovsky	c99add09e8	Fix recording processing	2024-10-08 13:59:11 +02:00
Sergey Mankovsky	ecb91bedc3	Add shared rooms	2024-10-04 17:20:35 +02:00
Sergey Mankovsky	39d02ab265	Add transcript source kind	2024-10-04 16:38:29 +02:00
Sergey Mankovsky	83857507ea	Make sure room names are unique	2024-09-25 13:13:18 +02:00
Sergey Mankovsky	6d976044d0	Update zulip message	2024-09-06 16:09:44 +02:00
Sergey Mankovsky	5c89a07996	Room config	2024-09-04 12:34:28 +02:00
Sergey Mankovsky	55697e670d	Permanent room urls	2024-08-19 17:56:32 +02:00
Sergey Mankovsky	2381428ae2	Link recorded meeting to a transcript	2024-08-09 17:30:45 +02:00
Mathieu Virbel	0976cf3eb5	server: fix migration script	2023-12-13 15:58:03 +01:00
Mathieu Virbel	a15a63bc8d	server: add reviewed field in transcript	2023-12-13 15:42:17 +01:00
Mathieu Virbel	7ac6d25217	server: add participant API Also break out views into different files for easier reading	2023-11-30 19:13:37 +01:00
Mathieu Virbel	f8407874f7	server: fixes share_mode script	2023-11-23 12:41:39 +01:00
Sara	2212d440d4	Merge branch 'main' of github.com:Monadical-SAS/reflector into feat-sharing	2023-11-22 19:28:45 +01:00
Mathieu Virbel	5ffa931822	server: update backend tests results (rpc does not work with chords)	2023-11-22 14:41:40 +01:00
Mathieu Virbel	06b29d9bd4	server: add audio_location and move to external storage if possible	2023-11-22 14:41:40 +01:00
Sara	fe7f1a0e78	Merge branch 'main' of github.com:Monadical-SAS/reflector into feat-sharing	2023-11-21 12:11:58 +01:00
Mathieu Virbel	e18a7c8d4e	server: correctly save duration, when filewriter is finished	2023-11-11 01:00:09 +01:00
Mathieu Virbel	226b92c347	www/server: introduce share mode	2023-11-07 12:39:48 +01:00
Mathieu Virbel	eb76cd9bcd	server/www: rename topic text field to transcript This aleviate the current issue with vercel deployment	2023-11-02 19:59:56 +01:00
Mathieu Virbel	37f6fe6345	server: rename migration script for readability	2023-11-02 19:17:34 +01:00
Mathieu Virbel	239fae6189	hotfix/server: add migration script to migrate transcript field to text	2023-11-02 19:02:02 +01:00
projects-g	9fe261406c	Feature additions (#210 ) * initial * add LLM features * update LLM logic * update llm functions: change control flow * add generation config * update return types * update processors and tests * update rtc_offer * revert new title processor change * fix unit tests * add comments and fix HTTP 500 * adjust prompt * test with reflector app * revert new event for final title * update * move onus onto processors * move onus onto processors * stash * add provision for gen config * dynamically pack the LLM input using context length * tune final summary params * update consolidated class structures * update consolidated class structures * update precommit * add broadcast processors * working baseline * Organize LLMParams * minor fixes * minor fixes * minor fixes * fix unit tests * fix unit tests * fix unit tests * update tests * update tests * edit pipeline response events * update summary return types * configure tests * alembic db migration * change LLM response flow * edit main llm functions * edit main llm functions * change llm name and gen cf * Update transcript_topic_detector.py * PR review comments * checkpoint before db event migration * update DB migration of past events * update DB migration of past events * edit LLM classes * Delete unwanted file * remove List typing * remove List typing * update oobabooga API call * topic enhancements * update UI event handling * move ensure_casing to llm base * update tests * update tests	2023-09-13 11:26:08 +05:30
Mathieu Virbel	68dce235ec	server: pass source and target language from api to pipeline	2023-08-29 11:16:23 +02:00
Mathieu Virbel	cce8a9137a	server: add basic sql migration	2023-08-29 10:58:27 +02:00

46 Commits