feat: use llamaindex everywhere (#525)

* feat: use llamaindex for transcript final title too * refactor: removed llm backend, replaced with one single class+llamaindex * refactor: self-review * fix: typing * fix: tests * refactor: extract clean_title and add tests * test: fix * test: remove ensure_casing/nltk * fix: tiny mistake
2026-02-04 09:56:47 +00:00 · 2025-08-01 12:13:00 -06:00
parent 1878834ce6
commit 28ac031ff6
25 changed files with 284 additions and 1539 deletions
--- a/server/env.example
+++ b/server/env.example
@@ -46,38 +46,11 @@ TRANSLATE_URL=https://monadical-sas--reflector-translator-web.modal.run
 ## llm backend implementation
 ## =======================================================

-## Using serverless modal.com (require reflector-gpu-modal deployed)
-LLM_BACKEND=modal
-LLM_URL=https://monadical-sas--reflector-llm-web.modal.run
-LLM_MODAL_API_KEY=
-ZEPHYR_LLM_URL=https://monadical-sas--reflector-llm-zephyr-web.modal.run
-
-
-## Using OpenAI
-#LLM_BACKEND=openai
-#LLM_OPENAI_KEY=xxx
-#LLM_OPENAI_MODEL=gpt-3.5-turbo
-
-## Using GPT4ALL
-#LLM_BACKEND=openai
-#LLM_URL=http://localhost:4891/v1/completions
-#LLM_OPENAI_MODEL="GPT4All Falcon"
-
-## Default LLM MODEL NAME
-#DEFAULT_LLM=lmsys/vicuna-13b-v1.5
-
-## Cache directory to store models
-CACHE_DIR=data
-
-## =======================================================
-## Summary LLM configuration
-## =======================================================
-
 ## Context size for summary generation (tokens)
-SUMMARY_LLM_CONTEXT_SIZE_TOKENS=16000
-SUMMARY_LLM_URL=
-SUMMARY_LLM_API_KEY=sk-
-SUMMARY_MODEL=
+# LLM_MODEL=microsoft/phi-4
+LLM_CONTEXT_WINDOW=16000
+LLM_URL=
+LLM_API_KEY=sk-

 ## =======================================================
 ## Diarization