12 Commits

Author SHA1 Message Date
Igor Loskutov
009590c080 feat: search frontend (#551)
* feat: better highlight

* feat(search): add long_summary to search vector for improved search results

- Update search vector to include long_summary with weight B (between title A and webvtt C)
- Modify SearchController to fetch long_summary and prioritize its snippets
- Generate snippets from long_summary first (max 2), then from webvtt for remaining slots
- Add comprehensive tests for long_summary search functionality
- Create migration to update search_vector_en column in PostgreSQL

This improves search quality by including summarized content which often contains
key topics and themes that may not be explicitly mentioned in the transcript.

* fix: address code review feedback for search enhancements

- Fix test file inconsistencies by removing references to non-existent model fields
  - Comment out tests for unimplemented features (room_ids, status filters, date ranges)
  - Update tests to only use currently available fields (room_id singular, no room_name/processing_status)
  - Mark future functionality tests with @pytest.mark.skip

- Make snippet counts configurable
  - Add LONG_SUMMARY_MAX_SNIPPETS constant (default: 2)
  - Replace hardcoded value with configurable constant

- Improve error handling consistency in WebVTT parsing
  - Use different log levels for different error types (debug for malformed, warning for decode, error for unexpected)
  - Add catch-all exception handler for unexpected errors
  - Include stack trace for critical errors

All existing tests pass with these changes.

* fix: correct datetime test to include required duration field

* feat: better highlight

* feat: search room names

* feat: acknowledge deleted room

* feat: search filters fix and rank removal

* chore: minor refactoring

* feat: better matches frontend

* chore: self-review (vibe)

* chore: self-review WIP

* chore: self-review WIP

* chore: self-review WIP

* chore: self-review WIP

* chore: self-review WIP

* chore: self-review WIP

* chore: self-review WIP

* remove swc (vibe)

* search url query sync (vibe)

* search url query sync (vibe)

* better casts and cap while

* PR review + simplify frontend hook

* pr: remove search db timeouts

* cleanup tests

* tests cleanup

* frontend cleanup

* index declarations

* refactor frontend (self-review)

* fix search pagination

* clear "x" for search input

* pagination max pages fix

* chore: cleanup

* cleanup

* cleanup

* cleanup

* cleanup

* cleanup

* cleanup

* cleanup

* lockfile

* pr review
2025-08-20 20:56:45 -04:00
Igor Loskutov
27b43d85ab feat: Diarization cli (#509)
* diarisation cli

* feat: s3 upload for modal diarisation cli call

* chore: cleanup

* chore: s3 cleanup improvement

* chore: lint

* chore: cleanup

* chore: cleanup

* chore: cleanup

* chore: cleanup
2025-07-25 16:24:06 -04:00
7f53398926 Mount server dir 2025-02-28 13:00:22 +01:00
b1527ad7b3 Process recordings uploaded to s3 2024-07-29 16:37:11 +02:00
Koper
9df6ab4429 Added feature for "sendToZulip", fixed visual issue, .gitignore for db dump files 2023-12-05 20:17:25 +07:00
Koper
827c9cdf13 Remove .mp3 from .gitignore
See 2a5fbed867

It was making the tests fail
2023-10-12 14:34:34 +01:00
b43bd00fc0 server: fixes wav not saved correctly and mp3 generation invalid if started from /tmp from another device 2023-08-17 16:49:22 +02:00
857505124f server: implement data persistence with database
Using databases + sqlite/postgresql depending of what you want.
Use DATABASE_URL to configure

Closes #70
2023-08-15 17:40:26 +02:00
Mathieu Virbel
509840cb4c processors: Introduce processors implementation
Each processor is standalone, with define INPUT/OUTPUT.
Processor can be threaded or not (can be extensible later)
TODO: Pipeline that automatically connect all processors, flush and clean data

To test: python -m reflector.processors tests/records/test_mathieu_hello.wav

```
Transcript: [00:00.500]:  Hi there, everyone.
Transcript: [00:02.700]:  Today, I want to share my incredible experience.
Transcript: [00:05.461]:  with Reflector, a cutineage product that revolutionizes audio processing.
Transcript: [00:10.922]:  With Refector, I can easily convert any audio into accurate transcription.
Transcript: [00:16.493]:  serving me hours of tedious manual work.
```

This is not a good transcript, but not the purpose here.
2023-07-28 20:08:33 +02:00
Mathieu Virbel
833a206110 test: add missing wav 2023-07-27 18:30:49 +02:00
Gokul Mohanarangan
2184bc3252 split into config / secrets 2023-07-27 13:29:35 +05:30
Koper
c0400b4232 Moved all server files to server/ 2023-07-26 15:13:46 +07:00