Skip to content

NkaLight/Note_Pilot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

321 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Note Pilot an "LLM-powered" study dashboard.

E2E Tests

Description:

Note pilot is a study dashboard focused on helping users learn faster and more efficiently using LLMs in a reliable manner. Unlike regular chatbot sessions, lecture pdfs are used to generate study content based on those specific lectures such as flash-cards, exam style questions, summaries etc.

Tech Stack

Category Tools
Framework Next JSNodeJSReact
Language TypeScript
Styling TailwindCSS
Database Postgres Postgres
Deployment Vercel

Live Deployment

https://note-pilot-nu.vercel.app

Core Features

  • AI study content generation from uploaded lecture notes
  • Secure authentication with dual token system (JWT + opaque token)
  • Password reset with one-time cryptographic tokens via Mailjet
  • E2E tested with Playwright against a containerized PostgreSQL.

Testing

  • E2E: Playwright with containerized database via Docker
  • Unit: Jest

Architecture

The application consists of a number of moving parts:

  • Next.js application hosted on Vercel
  • Python microservice hosted on HuggingFace Space
    • Python microservice performs the pdf-to-text extraction(the fastest of all the libs), chunking and vectorization on upload pdf and user query. Before I used AWS ECS so I can say I used AWS but then AWS charged me $0.2 last month so I switched to this free service.
  • Embedding model hosted on the google servers AI-studio
    • Vectorizes the chunks.
  • LLM reflex agent hosted on OpenRouter.(OpenRouter picks and chooses whichever model to use)
    • Consumes the built prompt with all the necessary context and returns text data to be rendered by Next.js application.

The application is designed to be as agnostic as possible as to which LLM is used as for my use case the value of how good the model performs is more tied to prompting techniques and RAG strategies.

Current major limitation

I have setup configs for getting a simple RAG setup working which you can test on the application hosted here. Whilst I have moved from naive context fetching directly from user input to Hypothetical Document Embeddings I still lack an empirical evaluation framework. Upstream adjustments (like switching to HyDE, decreasing/increasing vector dimensions, switching to dynamic chunking based on lecture sections) are all currently guided by theoretical benchmarks from literature rather than deterministic, project-specific metrics.

The GOAL now is to integrate an evaluation framework (such as RAGAS, TruLens or establish my own) to quantitatively measure context precision, context recall, and faithfulness against user queries.

About

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors