Note Pilot an "LLM-powered" study dashboard.

Description:

Note pilot is a study dashboard focused on helping users learn faster and more efficiently using LLMs in a reliable manner. Unlike regular chatbot sessions, lecture pdfs are used to generate study content based on those specific lectures such as flash-cards, exam style questions, summaries etc.

Tech Stack

Category	Tools
Framework
Language
Styling
Database
Deployment

Live Deployment

https://note-pilot-nu.vercel.app

Core Features

AI study content generation from uploaded lecture notes
Secure authentication with dual token system (JWT + opaque token)
Password reset with one-time cryptographic tokens via Mailjet
E2E tested with Playwright against a containerized PostgreSQL.

Testing

E2E: Playwright with containerized database via Docker
Unit: Jest

Architecture

The application consists of a number of moving parts:

Next.js application hosted on Vercel
Python microservice hosted on HuggingFace Space
- Python microservice performs the pdf-to-text extraction(the fastest of all the libs), chunking and vectorization on upload pdf and user query. Before I used AWS ECS so I can say I used AWS but then AWS charged me $0.2 last month so I switched to this free service.
Embedding model hosted on the google servers AI-studio
- Vectorizes the chunks.
LLM reflex agent hosted on OpenRouter.(OpenRouter picks and chooses whichever model to use)
- Consumes the built prompt with all the necessary context and returns text data to be rendered by Next.js application.

The application is designed to be as agnostic as possible as to which LLM is used as for my use case the value of how good the model performs is more tied to prompting techniques and RAG strategies.

Current major limitation

I have setup configs for getting a simple RAG setup working which you can test on the application hosted here. Whilst I have moved from naive context fetching directly from user input to Hypothetical Document Embeddings I still lack an empirical evaluation framework. Upstream adjustments (like switching to HyDE, decreasing/increasing vector dimensions, switching to dynamic chunking based on lecture sections) are all currently guided by theoretical benchmarks from literature rather than deterministic, project-specific metrics.

The GOAL now is to integrate an evaluation framework (such as RAGAS, TruLens or establish my own) to quantitatively measure context precision, context recall, and faithfulness against user queries.

Name		Name	Last commit message	Last commit date
Latest commit History 321 Commits
.github/workflows		.github/workflows
.vscode		.vscode
postgres_schema		postgres_schema
prisma		prisma
public/icons		public/icons
src		src
tests		tests
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
README.md		README.md
deployment-readiness-report.md		deployment-readiness-report.md
docker-compose.test.yml		docker-compose.test.yml
eslint.config.mjs		eslint.config.mjs
jest.config.ts		jest.config.ts
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
playwright.config.ts		playwright.config.ts
postcss.config.mjs		postcss.config.mjs
static-analysis-report.md		static-analysis-report.md
statistical_report.md		statistical_report.md
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json
tsconfig.test.json		tsconfig.test.json
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Note Pilot an "LLM-powered" study dashboard.

Description:

Tech Stack

Live Deployment

Core Features

Testing

Architecture

Current major limitation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Note Pilot an "LLM-powered" study dashboard.

Description:

Tech Stack

Live Deployment

Core Features

Testing

Architecture

Current major limitation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages