reflector

mirror of https://github.com/Monadical-SAS/reflector.git synced 2026-02-04 18:06:48 +00:00

Author	SHA1	Message	Date
Mathieu Virbel	dc177af3ff	feat: implement service-specific Modal API keys with auto processor pattern (#528 ) * fix: refactor modal API key configuration for better separation of concerns - Split generic MODAL_API_KEY into service-specific keys: - TRANSCRIPT_API_KEY for transcription service - DIARIZATION_API_KEY for diarization service - TRANSLATE_API_KEY for translation service - Remove deprecated _MODAL_API_KEY settings - Add proper validation to ensure URLs are set when using modal processors - Update README with new configuration format BREAKING CHANGE: Configuration keys have changed. Update your .env file: - TRANSCRIPT_MODAL_API_KEY → TRANSCRIPT_API_KEY - LLM_MODAL_API_KEY → (removed, use TRANSCRIPT_API_KEY) - Add DIARIZATION_API_KEY and TRANSLATE_API_KEY if using those services fix: update Modal backend configuration to use service-specific API keys - Changed from generic MODAL_API_KEY to service-specific keys: - TRANSCRIPT_MODAL_API_KEY for transcription - DIARIZATION_MODAL_API_KEY for diarization - TRANSLATION_MODAL_API_KEY for translation - Updated audio_transcript_modal.py and audio_diarization_modal.py to use modal_api_key parameter - Updated documentation in README.md, CLAUDE.md, and env.example * feat: implement auto/modal pattern for translation processor - Created TranscriptTranslatorAutoProcessor following the same pattern as transcript/diarization - Created TranscriptTranslatorModalProcessor with TRANSLATION_MODAL_API_KEY support - Added TRANSLATION_BACKEND setting (defaults to "modal") - Updated all imports to use TranscriptTranslatorAutoProcessor instead of TranscriptTranslatorProcessor - Updated env.example with TRANSLATION_BACKEND and TRANSLATION_MODAL_API_KEY - Updated test to expect TranscriptTranslatorModalProcessor name - All tests passing * refactor: simplify transcript_translator base class to match other processors - Moved all implementation from base class to modal processor - Base class now only defines abstract _translate method - Follows the same minimal pattern as audio_diarization and audio_transcript base classes - Updated test mock to use _translate instead of get_translation - All tests passing * chore: clean up settings and improve type annotations - Remove deprecated generic API key variables from settings - Add comments to group Modal-specific settings - Improve type annotations for modal_api_key parameters * fix: typing * fix: passing key to openai * test: fix rtc test failing due to change on transcript It also correctly setup database from sqlite, in case our configuration is setup to postgres. * ci: deactivate translation backend by default * test: fix modal->mock * refactor: implementing igor review, mock to passthrough	2025-08-04 12:07:30 -06:00
Mathieu Virbel	28ac031ff6	feat: use llamaindex everywhere (#525 ) * feat: use llamaindex for transcript final title too * refactor: removed llm backend, replaced with one single class+llamaindex * refactor: self-review * fix: typing * fix: tests * refactor: extract clean_title and add tests * test: fix * test: remove ensure_casing/nltk * fix: tiny mistake	2025-08-01 12:13:00 -06:00
Mathieu Virbel	f5b82d44e3	style: use ruff for linting and formatting (#524 )	2025-07-31 17:57:43 -06:00
Mathieu Virbel	406164033d	feat: new summary using phi-4 and llama-index (#519 ) * feat: add litellm backend implementation * refactor: improve generate/completion methods for base LLM * refactor: remove tokenizer logic * style: apply code formatting * fix: remove hallucinations from LLM responses * refactor: comprehensive LLM and summarization rework * chore: remove debug code * feat: add structured output support to LiteLLM * refactor: apply self-review improvements * docs: add model structured output comments * docs: update model structured output comments * style: apply linting and formatting fixes * fix: resolve type logic bug * refactor: apply PR review feedback * refactor: apply additional PR review feedback * refactor: apply final PR review feedback * fix: improve schema passing for LLMs without structured output * feat: add PR comments and logger improvements * docs: update README and add HTTP logging * feat: improve HTTP logging * feat: add summary chunking functionality * fix: resolve title generation runtime issues * refactor: apply self-review improvements * style: apply linting and formatting * feat: implement LiteLLM class structure * style: apply linting and formatting fixes * docs: env template model name fix * chore: remove older litellm class * chore: format * refactor: simplify OpenAILLM * refactor: OpenAILLM tokenizer * refactor: self-review * refactor: self-review * refactor: self-review * chore: format * chore: remove LLM_USE_STRUCTURED_OUTPUT from envs * chore: roll back migration lint changes * chore: roll back migration lint changes * fix: make summary llm configuration optional for the tests * fix: missing f-string * fix: tweak the prompt for summary title * feat: try llamaindex for summarization * fix: complete refactor of summary builder using llamaindex and structured output when possible * fix: separate prompt as constant * fix: typings * fix: enhance prompt to prevent mentioning others subject while summarize one * fix: various changes after self-review * fix: from igor review --------- Co-authored-by: Igor Loskutov <igor.loskutoff@gmail.com>	2025-07-31 15:29:29 -06:00
Igor Loskutov	27b43d85ab	feat: Diarization cli (#509 ) * diarisation cli * feat: s3 upload for modal diarisation cli call * chore: cleanup * chore: s3 cleanup improvement * chore: lint * chore: cleanup * chore: cleanup * chore: cleanup * chore: cleanup	2025-07-25 16:24:06 -04:00
Mathieu Virbel	2a2af5fff2	fix: remove fief out of the source code (#502 ) * fix: remove fief out of the source code * fix: remove corresponding test about migration	2025-07-21 21:09:05 -06:00
Mathieu Virbel	86ce68651f	build: move to uv (#488 ) * build: move to uv * build: add packages declaration * build: move to python 3.12, as sentencespiece does not work on 3.13 * ci: remove pre-commit check, will be done in another branch. * ci: fix name checkout * ci: update lock and dockerfile * test: remove event_loop, not needed in python 3.12 * test: updated test due to av returning AudioFrame with 4096 samples instead of 1024 * build: prevent using fastapi cli, because there is no way to set default port I don't want to pass --port 1250 every time, so back on previous approach. I deactivated auto-reload for production. * ci: remove main.py * test: fix quirck with httpx	2025-07-16 18:10:11 -06:00
Mathieu Virbel	5267ab2d37	feat: retake summary using NousResearch/Hermes-3-Llama-3.1-8B model (#415 ) This feature a new modal endpoint, and a complete new way to build the summary. ## SummaryBuilder The summary builder is based on conversational model, where an exchange between the model and the user is made. This allow more context inclusion and a better respect of the rules. It requires an endpoint with OpenAI-like completions endpoint (/v1/chat/completions) ## vLLM Hermes3 Unlike previous deployment, this one use vLLM, which gives OpenAI-like completions endpoint out of the box. It could also handle guided JSON generation, so jsonformer is not needed. But, the model is quite good to follow JSON schema if asked in the prompt. ## Conversion of long/short into summary builder The builder is identifying participants, find key subjects, get a summary for each, then get a quick recap. The quick recap is used as a short_summary, while the markdown including the quick recap + key subjects + summaries are used for the long_summary. This is why the nextjs component has to be updated, to correctly style h1 and keep the new line of the markdown.	2024-09-14 02:28:38 +02:00
Mathieu Virbel	873cbb0a42	fix: user migration confusion with user_id (#401 ) + added tests	2024-09-03 22:07:36 +02:00
Mathieu Virbel	03561453c5	feat: Monadical SSO as replacement of Fief (#393 ) * sso: first pass for integrating SSO still have issue on refreshing maybe customize the login page, or completely avoid it make 100% to understand how session server/client are working need to test with different configuration option (features flags and requireLogin) * sso: correctly handle refresh token, with pro-active refresh Going on interceptors make extra calls to reflector when 401. We need then to circle back with NextJS backend to update the jwt, session, then retry the failed request. I prefered to go pro-active, and ensure the session AND jwt are always up to date. A minute before the expiration, we'll try to refresh it. useEffect() of NextJS cannot be asynchronous, so we cannot wait for the token to be refreshed. Every 20s, a minute before the expiration (so 3x in total max) we'll try to renew. When the accessToken is renewed, the session is updated, and dispatching up to the client, which updates the useApi(). Therefore, no component will left without a incorrect token. * fixes: issue with missing key on react-select-search because the default value is undefined * sso: fixes login/logout button, and avoid seeing the login with authentik page when clicking * sso: ensure /transcripts/new is not behind protected page, and feature flags page are honored * sso: fixes user sub->id * fixes: remove old layout not used * fixes: set default NEXT_PUBLIC_SITE_URL as localhost * fixes: removing fief again due to merge with main * sso: ensure session is always ready before doing any action * sso: add migration from fief to jwt in server, only from transcripts list * fixes: user tests * fixes: compilation issues	2024-09-03 19:27:15 +02:00
Sergey Mankovsky	9b36adedae	Fix broken test	2024-07-19 12:30:01 +02:00
Sergey Mankovsky	fa6467c5ae	Merge pull request #375 from Monadical-SAS/restart-processing Restart processing	2024-07-19 12:00:30 +02:00
Sergey Mankovsky	68be967e66	Don't request permission for file upload	2024-07-18 15:18:36 +02:00
Sergey Mankovsky	562f2c94f9	Restart processing	2024-07-18 11:34:42 +02:00
Sergey Mankovsky	df5b735959	Chunked filed upload	2024-07-15 11:25:47 +02:00
Sergey Mankovsky	42bdabbc1b	Merge pull request #335 from Monadical-SAS/sara/UI-improvements Sara/UI improvements & fix transcript deletion	2024-05-28 12:28:50 +02:00
projects-g	72b22d1005	Update all modal deployments and change seamless configuration due to changes in src repo (#353 ) * update all modal deployments and change seamless configuration due to change in src repo * add fixture	2024-04-16 21:12:24 +05:30
Sara	6fe61cd5e3	fix transcript delete	2024-01-13 18:27:12 +01:00
Mathieu Virbel	eba60b7de4	Merge branch 'main' into feat-api-speaker-reassignment	2023-12-15 11:14:19 +01:00
Mathieu Virbel	a15a63bc8d	server: add reviewed field in transcript	2023-12-13 15:42:17 +01:00
Mathieu Virbel	6585002dfa	tests/upload: use shorter audio	2023-12-13 12:09:56 +01:00
Mathieu Virbel	f7f67521fc	server: try reconcile both tests webrtc and upload with celery worker	2023-12-13 11:25:46 +01:00
Mathieu Virbel	e5e1b70213	server: include endpoint to upload a audio/video file	2023-12-12 20:39:15 +01:00
Mathieu Virbel	37b11fdcb8	server: allow reassign speaker range using participant_id	2023-12-12 10:57:21 +01:00
Mathieu Virbel	d790308ec7	server: add an endpoint to merge speaker	2023-12-11 19:56:24 +01:00
Mathieu Virbel	07b29d42a7	server: add topic duration, and endpoint for getting words group per speaker on a topic	2023-12-11 19:46:05 +01:00
Mathieu Virbel	6f3d7df507	server: add API to reassign speakers, and get topics with words	2023-12-06 16:41:18 +01:00
Mathieu Virbel	7ac6d25217	server: add participant API Also break out views into different files for easier reading	2023-11-30 19:13:37 +01:00
Mathieu Virbel	3ebb21923b	server: enhance diarization algorithm	2023-11-29 20:34:43 +01:00
Mathieu Virbel	99b973f36f	server: fix tests	2023-11-22 14:41:40 +01:00
Mathieu Virbel	5ffa931822	server: update backend tests results (rpc does not work with chords)	2023-11-22 14:41:40 +01:00
Sara	a846e38fbd	fix waveform in pipeline	2023-11-17 13:38:32 +01:00
Sara	1fc261a669	try to move waveform to pipeline	2023-11-15 20:30:00 +01:00
Mathieu Virbel	e18a7c8d4e	server: correctly save duration, when filewriter is finished	2023-11-11 01:00:09 +01:00
Mathieu Virbel	14946921f3	server: add support for HEAD route on audio mp3	2023-11-10 14:41:21 +01:00
Mathieu Virbel	b9149d6e68	server: ensure retry works even with 303 redirection	2023-11-02 21:00:14 +01:00
Mathieu Virbel	eb76cd9bcd	server/www: rename topic text field to transcript This aleviate the current issue with vercel deployment	2023-11-02 19:59:56 +01:00
Mathieu Virbel	4da890b95f	server: add dummy diarization and fixes instanciation	2023-11-02 17:39:21 +01:00
Mathieu Virbel	d8a842f099	server: full diarization processor implementation based on gokul app	2023-11-02 17:39:21 +01:00
Mathieu Virbel	07c4d080c2	server: refactor with diarization, logic works	2023-11-02 17:39:21 +01:00
Mathieu Virbel	f4cffc0e66	server: add tests on segmentation and fix issue with speaker	2023-11-02 17:39:21 +01:00
Mathieu Virbel	6d074ed457	www: update frontend to support new transcript format in topics	2023-11-02 17:39:21 +01:00
projects-g	1d92d43fe0	New summary (#283 ) * handover final summary to Zephyr deployment * fix display error * push new summary feature * fix failing test case * Added markdown support for final summary * update UI render issue * retain sentence tokenizer call --------- Co-authored-by: Koper <andreas@monadical.com>	2023-10-13 22:53:29 +05:30
Mathieu Virbel	4e40cc511a	server: create fixture for starting the server, and always close server even if one test fail	2023-10-13 15:01:58 +02:00
Koper	149342f854	Fix unit tests	2023-10-13 10:42:52 +01:00
projects-g	6a43297309	Translation enhancements (#247 )	2023-09-26 19:49:54 +05:30
Gokul Mohanarangan	0b00881ce4	update tests: LLM mock to return LLM TITLE for all cases	2023-09-25 10:22:41 +05:30
Mathieu Virbel	07204ee2db	server: add missing mp3	2023-09-13 17:26:03 +02:00
Mathieu Virbel	2b9eef6131	server: use mp3 as default for audio storage Closes #223	2023-09-13 17:26:03 +02:00
projects-g	9fe261406c	Feature additions (#210 ) * initial * add LLM features * update LLM logic * update llm functions: change control flow * add generation config * update return types * update processors and tests * update rtc_offer * revert new title processor change * fix unit tests * add comments and fix HTTP 500 * adjust prompt * test with reflector app * revert new event for final title * update * move onus onto processors * move onus onto processors * stash * add provision for gen config * dynamically pack the LLM input using context length * tune final summary params * update consolidated class structures * update consolidated class structures * update precommit * add broadcast processors * working baseline * Organize LLMParams * minor fixes * minor fixes * minor fixes * fix unit tests * fix unit tests * fix unit tests * update tests * update tests * edit pipeline response events * update summary return types * configure tests * alembic db migration * change LLM response flow * edit main llm functions * edit main llm functions * change llm name and gen cf * Update transcript_topic_detector.py * PR review comments * checkpoint before db event migration * update DB migration of past events * update DB migration of past events * edit LLM classes * Delete unwanted file * remove List typing * remove List typing * update oobabooga API call * topic enhancements * update UI event handling * move ensure_casing to llm base * update tests * update tests	2023-09-13 11:26:08 +05:30

1 2

82 Commits