Skip to content
View AD-Styles's full-sized avatar
๐Ÿ˜ƒ
Focusing
๐Ÿ˜ƒ
Focusing

Block or report AD-Styles

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
AD-Styles/README.md

์•ˆ๋…•ํ•˜์„ธ์š”๐Ÿ‘‹
์›๋ฆฌ๋ฅผ ํŒŒ๊ณ ๋“ค์–ด ์‹ค์ „์— ์ ์šฉํ•˜๋Š” 'AI ํ…Œํฌ ์ธ์žฌ' ๊น€๋„์œค ์ž…๋‹ˆ๋‹ค!

๋‹จ์ˆœํžˆ ๋งŒ๋“ค์–ด์ง„ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์„ ๋„˜์–ด, "๋ชจ๋ธ ๋‚ด๋ถ€์˜ ํ•ต์‹ฌ ๋กœ์ง๊ณผ ์ˆ˜ํ•™์  ์›๋ฆฌ๋ฅผ ์ดํ•ดํ•ด์•ผ๋งŒ ์‹ค๋ฌด์˜ ์—ฃ์ง€ ์ผ€์ด์Šค(Edge Case)๋ฅผ ํ•ด๊ฒฐํ•  ์ˆ˜ ์žˆ๋‹ค" ๊ณ  ๋ฏฟ์Šต๋‹ˆ๋‹ค. ๋…ผ๋ฌธ์„ ์ง์ ‘ ๊ตฌํ˜„ํ•˜๋ฉฐ ๋‹ค์ง„ '๊ธฐ์ดˆ ์ฒด๋ ฅ' ์œ„์—, ๋ฐ‘๋ฐ”๋‹ฅ(From Scratch) โ†’ ์ „์ดํ•™์Šต/ํŒŒ์ธํŠœ๋‹ โ†’ SOTA ๋ชจ๋ธ ์ตœ์ ํ™” ๋ฐ ์„œ๋น™(Triton/TensorRT) ๊นŒ์ง€ ์ด์–ด์ง€๋Š” ์—”๋“œํˆฌ์—”๋“œ AI ํŒŒ์ดํ”„๋ผ์ธ์„ ์„ค๊ณ„ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

ํ•ต์‹ฌ 3๋Œ€ ํ‚ค์›Œ๋“œ โ€” From Scratch (์›๋ฆฌ ๊ฒ€์ฆ) ยท Production & Edge (Triton ยท RAPIDS ยท Jetson) ยท Multi-Modal (Vision ยท Language ยท Voice ยท Sensor)


๐Ÿ› ๏ธ Tech Stack & Skills

  • Foundations (๋…ผ๋ฌธ ์ง์ ‘ ๊ตฌํ˜„): PyTorch, NumPy only โ€” Transformer, ResNet, GPT, VAE, GAN, Diffusion, CLIP, Mini-LLaVA (VLM)
  • LLM & NLP: LangChain, LangGraph, Gemini 2.0 Flash, OpenAI API, Qwen2.5, EXAONE-3.5, Hugging Face Transformers, KoGPT2, KLUE-BERT, Sentence-Transformers
  • LLM Fine-Tuning & Optimization: Unsloth, QLoRA (4-bit NF4), GGUF, PEFT, Pydantic-Structured Output
  • Computer Vision: PyTorch, Torchvision, YOLOv5, YOLOv8, U-Net, FCN, SMP (segmentation_models_pytorch), ResNet, GANs, OpenCV
  • RAG & Vector DB: ChromaDB, FAISS, RecursiveCharacterTextSplitter, LCEL, Multi-Collection Routing
  • Reinforcement Learning: Q-Learning, DQN, Experience Replay, Frame Stacking, Gymnasium
  • Data Engineering & Acceleration: NVIDIA RAPIDS (cuDF / cuGraph), Dask, CUDA (Numba), Parquet, NetworkX
  • MLOps & Deployment: NVIDIA Triton Inference Server, TensorRT, ONNX, llama.cpp, Dynamic Batching, Hugging Face Spaces, Gradio, Streamlit
  • Edge AI & Full-Stack: NVIDIA Jetson, ROS 2, FastAPI, Flutter

๐Ÿš€ ๋Œ€ํ‘œ ํ”„๋กœ์ ํŠธ (Representative Projects)

1๏ธโƒฃ Foundations โ€” ๋…ผ๋ฌธ / ํ•ต์‹ฌ ์•Œ๊ณ ๋ฆฌ์ฆ˜ From Scratch

"๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ ํ•œ ์ค„๋กœ ๋๋‚˜๋Š” ์ฝ”๋“œ๋Š” ๋ˆ„๊ตฌ๋‚˜ ์ž‘์„ฑํ•  ์ˆ˜ ์žˆ๋‹ค." โ€” ๊ทธ๋ž˜์„œ ์ €๋Š” ๋ชจ๋ธ์˜ ํ•ต์‹ฌ ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ PyTorch / NumPy ๋งŒ์œผ๋กœ ์ง์ ‘ ๊ตฌํ˜„ํ•˜๋ฉฐ "์™œ ์ด ๊ตฌ์กฐ์ธ๊ฐ€"๋ฅผ ์„ค๋ช…ํ•  ์ˆ˜ ์žˆ๋Š” ๊นŠ์ด๋ฅผ ์Œ“์•˜์Šต๋‹ˆ๋‹ค.

  • โญ vlm-from-scratch-v4 โ€” v3 ๊ฐ€ ์ •๋Ÿ‰ ์ž…์ฆํ•œ "vision encoder ํฌ๊ธฐ โ‰  VLM ๋Šฅ๋ ฅ, LLM ์ด ์ง„์งœ ๋ณ‘๋ชฉ" ๊ฒฐ๋ก ์„ ์ •๋ฉด ๋ŒํŒŒํ•œ 4์„ธ๋Œ€ ๋ฐ˜๋ณต. v3 ๊ฐ€ ์ถ”๋ก  wrapper 5์ข…์œผ๋กœ ์šฐํšŒ ํ–ˆ๋˜ 0.5B LLM ํ•œ๊ณ„๋ฅผ, v4 ๋Š” ์–ธ์–ด ๋ชจ๋ธ ์ž์ฒด๋ฅผ Qwen2.5-1.5B-Instruct ๋กœ ๊ต์ฒด (0.5B โ†’ 1.5B, 3๋ฐฐ) ํ•ด ์ •๊ณต๋ฒ•์œผ๋กœ ํ•ด๊ฒฐ. CLIP-ViT-B/32 (frozen) + ํ•™์Šตํ˜• 2-layer MLP projector (768โ†’1536, GELU) + 4-bit NF4 QLoRA ๊ตฌ์กฐ๋ฅผ double quantization ์œผ๋กœ ์••์ถ•(fp32 6 GB โ†’ 0.9 GB)ํ•ด ๋‹จ์ผ 8 GB consumer GPU ์—์„œ 2-stage ํ•™์Šต (projector alignment loss 5.0โ†’1.98 / instruction tuning loss 3.65โ†’1.01). Qwen2.5 ๋‚ด์žฅ <|image_pad|> ํ† ํฐ์„ ์žฌ์‚ฌ์šฉํ•ด ์‹ ๊ทœ ํ† ํฐ ์—†์ด v3 ์˜ 1 GB adapter bloat ๋ฅผ ์›์ฒœ ์ฐจ๋‹จ, custom _merge ํ•จ์ˆ˜๋กœ ์ž„๋ฒ ๋”ฉ splice ์ง์ ‘ ๊ตฌํ˜„. VQAv2 36.7% โ†’ 56.8%, POPE 50.0% โ†’ 71.8% (yes-F1 0.735 ยท prediction bias 0.568 ยท refusal rate 0.000), first-token entropy ๊ธฐ๋ฐ˜ OOD ํƒ์ง€ ROC AUC 0.971 ์™€ OOD abstention layer ๋กœ ํ™˜๊ฐ ๊ฒฝ๊ณ . ๊ฒ€์ฆ ๊ฒŒ์ดํŠธ๋ฅผ ๋ฐฐํฌ ์ด์ „ ์œผ๋กœ ์ด๋™, Stage 2 ๋Š” VQAv2ยทLocalizedNarrativesยทA-OKVQAยทKoLLaVA(ํ•œ๊ตญ์–ด) 46K ํ˜ผํ•ฉ ๋ฐ์ดํ„ฐ.
  • vlm-from-scratch-v3 โ€” v2 ์˜ 3๊ฐ€์ง€ ํ•œ๊ณ„ (ํ•œ๊ตญ์–ด catastrophic forgetting ยท OOD ํ™˜๊ฐ ยท 1GB adapter) ๋ฅผ ๋ชจ๋‘ ํ•ด๊ฒฐํ•œ ์ฐจ์„ธ๋Œ€ ๋ฐ˜๋ณต. ์ถ”๋ก  wrapper 5์ข… (CLIP yes/no ๊ฒŒ์ดํŒ… ยท CLIP ์ƒ‰์ƒ ๋ถ„๋ฅ˜ ยท ์ถœ๋ ฅ ํ›„์ฒ˜๋ฆฌ ยท m2m100 ํ•œโ†”์˜ ๋ฒˆ์—ญ ยท OOD ๊ฐ์ง€ layer) ์œผ๋กœ 0.5B LLM ํ•œ๊ณ„๋ฅผ ์šฐํšŒ. POPE 50% โ†’ 53.33% (untuned) / 70% (tuned), LoRA adapter 1045 MB โ†’ 8.28 MB (โˆ’99.21%, greedy ์ถœ๋ ฅ bit-๋‹จ์œ„ ๋™์ผ). ViT-L/14 ablation ์‹คํŒจ์—์„œ "vision encoder ํฌ๊ธฐ โ‰  VLM ๋Šฅ๋ ฅ โ€” LLM ์ด ์ง„์งœ ๋ณ‘๋ชฉ" ์ •๋Ÿ‰ ์ž…์ฆ. HF Spaces Live Demo + gradio_client API + Playwright Chromium ์ž๋™ํ™”๋กœ 3์ค‘ ์™ธ๋ถ€ ๊ฒ€์ฆ.
  • vlm-from-scratch (Mini-LLaVA v1โ†’v2) โ€” v3 ์˜ ์ถœ๋ฐœ์ ์ด ๋œ baseline. CLIP-from-scratch + GPT-from-scratch ์˜ ๋นŒ๋”ฉ ๋ธ”๋ก์„ ์‹ค๋ฌด ์Šค์ผ€์ผ๋กœ ์กฐ๋ฆฝํ•œ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ LLM. HuggingFace LlavaForConditionalGeneration ๋ฏธ์‚ฌ์šฉ, <image> ํ† ํฐ splice / projector / LoRA adapter ํ†ตํ•ฉ ์ง์ ‘ ๊ตฌํ˜„. v1 (Stage 1 alignment) ํ•œ๊ณ„ ์ •๋Ÿ‰ ๋ถ„์„ โ†’ v2 (Stage 2 LoRA + ๊ท ํ˜• instruction ๋ฐ์ดํ„ฐ) ๋กœ ๊ฐœ์„ ํ•˜๋Š” ๋‘ ์ฐจ๋ก€ ๋ฐ˜๋ณต ์‚ฌ์ดํด ์ „์ฒด ๊ธฐ๋ก. ์˜๋ฌธ ๊ฐ•์•„์ง€ VQA 5๋ฌธํ•ญ 4/5 ์ •ํ™•, ํ•œ๊ตญ์–ด์—์„œ catastrophic forgetting ์ •๋Ÿ‰ ์ž…์ฆ, ํ”ผ์นด์ธ„(OOD)์—์„œ ์ฒด๊ณ„์  ์˜ค๋ฅ˜ ํŒจํ„ด ๋ถ„์„ ๊นŒ์ง€ ๋ชจ๋ธ ํ•œ๊ณ„์˜ ์†”์งํ•œ ํ•ด๋ถ€.
  • โญ transformer-from-scratch โ€” ใ€ŽAttention Is All You Needใ€ ๋…ผ๋ฌธ ์žฌํ˜„. nn.Transformer / HuggingFace ๋ฏธ์‚ฌ์šฉ, Scaled Dot-Product Attention ยท Multi-Head ยท Sinusoidal Positional Encoding ยท Encoder-Decoder ๋ฅผ ํ…์„œ ์—ฐ์‚ฐ๋งŒ์œผ๋กœ ๊ตฌํ˜„. ํ† ์ด ํƒœ์Šคํฌ์—์„œ Val Acc 98.4%, Attention Heatmap์˜ anti-diagonal ํŒจํ„ด์œผ๋กœ ํ•™์Šต ์›๋ฆฌ ๊ฒ€์ฆ.
  • gpt-from-scratch โ€” nanoGPT ์˜๊ฐ์˜ Decoder-only Transformer (10.79M params). Q/K/V ๋ถ„ํ• ๊นŒ์ง€ ์†์ž‘์„ฑ, ์ž์ฒด Char-level Tokenizer (vocab 65), Tiny Shakespeare ํ•™์Šต ํ›„ Greedy / Temperature / Top-k ์ƒ˜ํ”Œ๋ง ๋น„๊ต.
  • resnet-from-scratch โ€” Skip Connection์ด Degradation Problem์„ ์ •๋ง ํ•ด๊ฒฐํ•˜๋Š”๊ฐ€? ๋ฅผ ๊ฒ€์ฆ. ๋™์ผ ๊นŠ์ดยทํŒŒ๋ผ๋ฏธํ„ฐ์˜ Plain-20 vs ResNet-20 ๋น„๊ต ์‹คํ—˜์—์„œ out + shortcut(x) ๋‹จ ํ•œ ์ค„์˜ ํšจ๊ณผ๋ฅผ +2.19%p (89.16% vs 86.97%) ๋กœ ์ •๋Ÿ‰ํ™”.
  • diffusion-models-from-scratch โ€” U-Net + Sinusoidal Time Embedding + Classifier-Free Guidance ๋ฅผ ํ•œ ํŒŒ์ผ์—. Forward/Reverse ํ™•์‚ฐ ๊ณผ์ •๊ณผ Bernoulli Context Mask๊นŒ์ง€ ์ง์ ‘ ๊ตฌํ˜„.
  • vae-from-scratch โ€” Reparameterization Trick ยท ELBO (Reconstruction + ฮฒยทKL) ์ง์ ‘ ๊ตฌํ˜„. 16์ฐจ์› Latent Space์—์„œ Latent Traversal๊ณผ Interpolation์œผ๋กœ ์ฐจ์›๋ณ„ ์˜๋ฏธ ๋ถ„์„.
  • gan-from-scratch โ€” DCGAN (ConvTranspose Generator + Conv Discriminator) ์œผ๋กœ Minimax ๊ฒŒ์ž„ ์ง์ ‘ ์ž‘์„ฑ. Mode Collapse ๊ฒ€์ฆ, D Accuracy 0.5 ๊ท ํ˜• ๋ชจ๋‹ˆํ„ฐ๋ง, VAE vs GAN (Blurry vs Sharp) ๋™์ผ ์กฐ๊ฑด ๋น„๊ต.
  • clip-from-scratch โ€” ์ด์ „ From-Scratch ๋ถ€ํ’ˆ(ResNet-20 + Transformer)์„ ์ง์ ‘ ์กฐ๋ฆฝํ•œ Multi-Modal ์‹œ์Šคํ…œ. Symmetric InfoNCE Loss + Learnable Temperature ฯ„๋กœ ํ•™์Šต, ํ•™์Šต ์–‘์‹ 64.4% vs Zero-shot 64.6% โ€” ์ž„๋ฒ ๋”ฉ ๊ณต๊ฐ„ ์ •๋ ฌ(Alignment) ์„ฑ๊ณต.
  • object-detection-fundamentals โ€” IoU ยท NMS ยท ๋ฉ€ํ‹ฐํƒœ์Šคํฌ(์ขŒํ‘œ ํšŒ๊ท€ + ๋ถ„๋ฅ˜) Loss ๋ฐธ๋Ÿฐ์‹ฑ ์„ NumPy๋กœ ์ง์ ‘ ๊ตฌํ˜„. ResNet18 Backbone ์œ„์— ํ”ฝ์…€ ์ขŒํ‘œ๊ณ„๋ถ€ํ„ฐ ์†์œผ๋กœ.
  • image-segmentation-from-scratch โ€” 3๋‹จ๊ณ„ ์‹ฌํ™” ํ•™์Šต (FCN8s ๋…ผ๋ฌธ ์žฌํ˜„ โ†’ U-Net ์ง์ ‘ ์„ค๊ณ„ โ†’ SMP ์‹ค๋ฌด). ๋‹จ์ˆœ ์ฝ”๋“œ ์žฌํ˜„์ด ์•„๋‹Œ "์™œ ์ด ๊ตฌ์กฐ์ธ๊ฐ€"๋ฅผ ์„ค๋ช…ํ•  ์ˆ˜ ์žˆ๋Š” ๊นŠ์ด.

2๏ธโƒฃ NLP & LLM โ€” ์ง€๋Šฅํ˜• ์‹œ์Šคํ…œ ์„ค๊ณ„ / ํŒŒ์ธํŠœ๋‹ / ์„œ๋น™

LLM ์‘์šฉ & RAG ์‹œ์Šคํ…œ

  • langchain-rag-enterprise-chatbot โ€” 5์ข… ์‚ฌ๋‚ด ๋ฌธ์„œ(TXT/JSON/JSONL/CSV/PDF) ๋ฅผ ๋ฉ€ํ‹ฐ ์ปฌ๋ ‰์…˜ ChromaDB๋กœ ๋ถ„๋ฆฌ ์ €์žฅํ•˜์—ฌ ์ž์—ฐ์–ด ๋ผ์šฐํŒ… ๊ธฐ๋ฐ˜ RAG ์ฑ—๋ด‡ ๊ตฌ์ถ•. JSON์€ ํ†ต์งธ๋กœ ๋ณด์กด, ํ…์ŠคํŠธ๋Š” ์ฒญํฌ ๋ถ„ํ•  ๋“ฑ ํ˜•์‹๋ณ„ ์ „๋žต ์ฐจ๋ณ„ํ™”.
  • โญ langchain-production-chatbot โ€” LangGraph + Pydantic Structured Output ์œผ๋กœ ๋ถ„๋ฅ˜ยท๋ฉ”๋ชจ๋ฆฌยท๋ผ์šฐํŒ…์„ ํ•œ ํ˜ธ์ถœ์—. InMemorySaver thread ๊ฒฉ๋ฆฌ + SummarizationMiddleware (4000 ํ† ํฐ ์ž„๊ณ„์น˜) ๋กœ ํ† ํฐ ํญ์ฃผ ๋ฐฉ์ง€.
  • langchain-agent-tool-integration โ€” LLM์ด ์™ธ๋ถ€ ์‹œ์Šคํ…œ(์‹œ๊ฐ„ ์กฐํšŒ / ์›น ์Šคํฌ๋ž˜ํ•‘ / SQL)์— ์ง์ ‘ ์ ‘๊ทผํ•˜๋Š” 3์ข… Tool ํ†ตํ•ฉ Agent. ์‹œ์Šคํ…œ ํ”„๋กฌํ”„ํŠธ๋กœ ์ฝ๊ธฐ ์ „์šฉ SQL ๊ฐ•์ œ ๋“ฑ ์•ˆ์ „ ์„ค๊ณ„.
  • โญ rag-chatbot โ€” LangChain + Chroma + Gemini 2.0 Flash ๋กœ ๋Œ€ํ™” ๋ฉ”๋ชจ๋ฆฌ ์œ ์ง€ RAG ์ฑ—๋ด‡ ๊ตฌ์ถ•, Gradio UI๋กœ Hugging Face Spaces ๋ฐฐํฌ๊นŒ์ง€ ์—”๋“œํˆฌ์—”๋“œ ์™„์„ฑ.
  • nlp-portfolio โ€” ์ˆ˜ํ•™์  ๊ธฐ์ดˆ(NumPy Self-Attention) โ†’ ๋ถ„๋ฅ˜(BERT/Logistic Regression) โ†’ ์˜๋ฏธ ๊ฒ€์ƒ‰(Multilingual-MiniLM) 3๋‹จ๊ณ„ ์ผ๊ด€ ํŒŒ์ดํ”„๋ผ์ธ. Error Analysis ์ค‘์‹ฌ์˜ Data-Centric ์ ‘๊ทผ.

LLM ํŒŒ์ธํŠœ๋‹ & ํšจ์œจํ™”

  • unsloth-qlora-finetuning โ€” Llama-3 8B๋ฅผ T4 16GB ํ•œ ์žฅ์œผ๋กœ ํŒŒ์ธํŠœ๋‹. 4-bit NF4 ์–‘์žํ™” + LoRA๋กœ ์ „์ฒด ํŒŒ๋ผ๋ฏธํ„ฐ์˜ 0.08% ๋ฏธ๋งŒ๋งŒ ํ•™์Šต, VRAM 60% ์ ˆ๊ฐ / ํ›ˆ๋ จ ์†๋„ 2.6๋ฐฐ ๊ฐ€์†.
  • โญ kogpt2-korean-finetuning โ€” KoGPT2๋ฅผ NSMC ์˜ํ™” ๋ฆฌ๋ทฐ ๋„๋ฉ”์ธ์œผ๋กœ ํŒŒ์ธํŠœ๋‹. PreTrainedTokenizerFast ์ธ๋ฑ์Šค ๋ฐ€๋ฆผ ๋ฌธ์ œ๋ฅผ ๊ทผ๋ณธ ๋””๋ฒ„๊น…, Top-P ยท Temperature ยท Repetition Penalty ๋””์ฝ”๋”ฉ ์ „๋žต ๋น„๊ต.
  • nlp-bert-finetuning โ€” Self-Attention โ†’ BERT โ†’ ์ „์ด ํ•™์Šต([CLS] Token Pooling)๊นŒ์ง€ ๋‹จ๊ณ„๋ณ„ ํ•™์Šต.

LLM Serving (Production)

  • nlp-triton-deployment โ€” BERT๋ฅผ PyTorch โ†’ ONNX โ†’ NVIDIA Triton Inference Server ๋กœ ๋ฐฐํฌ. config.pbtxt์˜ dims:[-1] ๋™์  ์ถ• ์ง€์› + Dynamic Batching + ๋™์  ํŒจ๋”ฉ ์œผ๋กœ ์ง€์—ฐ ์‹œ๊ฐ„ 45ms โ†’ 12.5ms (โ–ผ72%) / ์ฒ˜๋ฆฌ๋Ÿ‰ 22 โ†’ 145 TPS (โ–ฒ6.6๋ฐฐ) ๋‹ฌ์„ฑ.

NLP ๊ธฐ์ดˆ ๋‹ค์ง€๊ธฐ

  • nlp-foundations โ€” Attention ๋ฉ”์ปค๋‹ˆ์ฆ˜์„ NumPy๋กœ ์ง์ ‘ ๊ตฌํ˜„ + KLUE-BERT ํŒŒ์ธํŠœ๋‹ (NSMC ์ •ํ™•๋„ 50% โ†’ 89%) + Masked LM์˜ ์‚ฌํšŒ์  ํŽธํ–ฅ์„ฑ ๋ถ„์„.
  • nlp-preprocessing-foundation โ€” ํ…์ŠคํŠธ ์ •์ œยท์ •๊ทœํ™”ยทํ† ํฐํ™”ยทTF-IDFยทRNN/LSTM ๊นŒ์ง€ OOP ๋ชจ๋“ˆํ™”.
  • nlp-text-classification โ€” ์˜ํ™” ๋ฆฌ๋ทฐ(์ด์ง„) + ๊ตญ๋ฏผ์ฒญ์›(17๊ฐœ ๋‹ค์ค‘ ๋ถ„๋ฅ˜) ํŒŒ์ดํ”„๋ผ์ธ. ๋ฐ์ดํ„ฐ ๋ถˆ๊ท ํ˜•ยท๋ฐ˜์–ด๋ฒ• ํ•œ๊ณ„ ๋ถ„์„.
  • nlp-semantic-search โ€” Sentence-Transformers + Cosine Similarity ์˜๋ฏธ ๊ฒ€์ƒ‰ ์‹œ์Šคํ…œ.

3๏ธโƒฃ Computer Vision โ€” ๊ธฐ์ดˆ ์›๋ฆฌ๋ถ€ํ„ฐ ์‹ค์‹œ๊ฐ„ ํƒ์ง€๊นŒ์ง€

  • yolov5-pothole-detector โ€” object-detection-fundamentals ์˜ ์›๋ฆฌ ์œ„์— YOLOv5s ๋‹จ์ผ ํด๋ž˜์Šค(nc=1) ์ปค์Šคํ…€ ํ•™์Šต ์œผ๋กœ ์‹ค์ œ ๋„๋กœ ์˜์ƒ์—์„œ ํฌํŠธํ™€ ํƒ์ง€. mAP@0.5 0.85+ ์˜ ์‹ค๋ฌด ์„œ๋น„์Šค ์ˆ˜์ค€ ํŒŒ์ดํ”„๋ผ์ธ ์™„์„ฑ.
  • multimodal-ai-sensor-fusion โ€” RGB + LiDAR ์„ผ์„œ ๊ฒฐํ•ฉ. Early / Late / Intermediate 3๊ฐ€์ง€ ์œตํ•ฉ ์•„ํ‚คํ…์ฒ˜ ๋น„๊ต + CLIP ์Šคํƒ€์ผ NT-Xent ๋Œ€์กฐํ•™์Šต. ๋‹จ์ผ ๋ชจ๋ธ 92.7% โ†’ ์œตํ•ฉ ๋ชจ๋ธ 100% ์ธ์‹๋ฅ , ์œ ์‚ฌ๋„ ํ–‰๋ ฌ 95% ๊ฐ์†Œ๋กœ ์ •๋ ฌ(Alignment) ์ž…์ฆ.
  • pytorch-image-classification โ€” ํ…์„œ ์กฐ์ž‘ โ†’ MLP/CNN/VGG ์ง์ ‘ ์„ค๊ณ„ โ†’ ResNet ์ „์ดํ•™์Šต๊นŒ์ง€ ์ด๋ฏธ์ง€ ๋ถ„๋ฅ˜ ์ „ ๊ณผ์ •. OOM ๋ฌธ์ œ ํ•ด๊ฒฐ๋กœ ํ•˜๋“œ์›จ์–ด ํŒŒ์ดํ”„๋ผ์ธ ์—”์ง€๋‹ˆ์–ด๋ง ์—ญ๋Ÿ‰ ํ™•๋ณด.
  • โญ resnet-transfer-learning-cifar10 โ€” ResNet50 vs ResNet101 ํšจ์œจ์„ฑ ๋น„๊ต, Stage A/B ์ „์ดํ•™์Šต ์ „๋žต ๋ถ„์„, ImageNet Pretrained Weights์˜ Skip Connection ํšจ๊ณผ ์‹คํ—˜์  ๊ฒ€์ฆ.

4๏ธโƒฃ Reinforcement Learning โ€” ์ž์œจ ์—์ด์ „ํŠธ ์„ค๊ณ„

  • โญ car-racing-dqn โ€” CarRacing-v2 ์—์„œ 96x96 ํ”ฝ์…€ โ†’ CNN-DQN ์ง์ ‘ ํ•™์Šต. Frame Stacking (4 frames) ยท Experience Replay ยท Target Network ๋กœ ํ•™์Šต ์•ˆ์ •ํ™”, Hugging Face ๋ฐฐํฌ ์™„๋ฃŒ.
  • rl-optimization-benchmark โ€” ๋™์ผํ•œ Q-Learning ์•Œ๊ณ ๋ฆฌ์ฆ˜์œผ๋กœ Taxi โ†’ CliffWalking โ†’ Blackjack ๋„์žฅ๊นจ๊ธฐ. 500๊ฐœ ์ƒํƒœ๋ถ€ํ„ฐ ํŠœํ”Œ ์ƒํƒœ๊นŒ์ง€ ์ž๋ฃŒ๊ตฌ์กฐ ํ™•์žฅ์œผ๋กœ ๋ฒ”์šฉ ์„ค๊ณ„ ๋Šฅ๋ ฅ ์ž…์ฆ.
  • rl-q-learning โ€” FrozenLake์—์„œ ํฌ์†Œ ๋ณด์ƒ์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๋Š” ๋ฐ€์ง‘ ๋ณด์ƒ(Dense Reward) ์„ค๊ณ„ + Epsilon-Greedy + ์—์ด์ „ํŠธ-ํ™˜๊ฒฝ ๋ถ„๋ฆฌ OOP ์•„ํ‚คํ…์ฒ˜.

5๏ธโƒฃ Data Engineering & Domain Application โ€” GPU ๊ฐ€์† ๋ฐ์ดํ„ฐ ํŒŒ์ดํ”„๋ผ์ธ / ๋„๋ฉ”์ธ ํŠนํ™” ๋ถ„์„

  • rapids-dask-pipeline โ€” ๋Œ€๊ทœ๋ชจ ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ ๋ณ‘๋ชฉ ํ•ด์†Œ๋ฅผ ์œ„ํ•ด NVIDIA RAPIDS cuDF + Dask ๊ฒฐํ•ฉ. CSV โ†’ Parquet, CPU โ†’ GPU ๋กœ ์ €์žฅ ํฌ๋งท๊ณผ ์—ฐ์‚ฐ ์—”์ง„์„ ํ•จ๊ป˜ ์ตœ์ ํ™”, MapReduce DAG ์‹œ๊ฐํ™”๋กœ Lazy Evaluation ํšจ์œจ์„ฑ ์ž…์ฆ.
  • road-network-graph-analytics โ€” NVIDIA cuGraph ๋กœ ์˜๊ตญ ๋„๋กœ๋ง (1,225 ๋…ธ๋“œ / 2,622 ์—ฃ์ง€) ๊ทธ๋ž˜ํ”„ ๋ถ„์„. Dijkstra SSSP + 5์ข… ์ค‘์‹ฌ์„ฑ (Betweenness ยท Eigenvector ยท PageRank ยท Katz ยท Degree) ๋น„๊ต๋กœ ์‹ค๋ฌด ์ธ์‚ฌ์ดํŠธ ๋„์ถœ.
  • skhynix-stock-analysis โ€” ๋ฐ˜๋„์ฒด ๋„๋ฉ”์ธ ์ง€์‹์„ ๊ฒฐํ•ฉํ•œ SKํ•˜์ด๋‹‰์Šค ์ฃผ๊ฐ€ ์˜ˆ์ธก. Bidirectional LSTM + ๊ธฐ์ˆ ์  ์ง€ํ‘œ (MA/RSI/Bollinger/MACD) + Huber Loss, ๋ฐ์ดํ„ฐ ๋ถ„ํฌ ๋ณ€ํ™”์— ๋Œ€ํ•œ ๋ชจ๋ธ ํ•œ๊ณ„์˜ ๊ฐ๊ด€์  ์ธ์ •.

6๏ธโƒฃ Team Project โ€” ํŒ€ ํ˜‘์—…์œผ๋กœ ์™„์„ฑํ•œ ์˜จ๋””๋ฐ”์ด์Šค ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ AI ์‹œ์Šคํ…œ

  • โญ MIND-CARE-Conversational-ChatBot โ€” ๋…๊ฑฐ๋…ธ์ธ ์ผ€์–ด์šฉ ์˜จ๋””๋ฐ”์ด์Šค HRI ์‹œ์Šคํ…œ '๋งˆ์Œ๋Œ๋ด„'. ํด๋ผ์šฐ๋“œ ์˜์กด ์—†์ด NVIDIA Jetson AGX Xavier ํ•œ ๋Œ€ ์—์„œ ์Œ์„ฑยท์–ธ์–ดยท๋น„์ „ ์ถ”๋ก ์„ ์ „๋ถ€ ๋กœ์ปฌ ์‹คํ–‰ํ•˜๋Š” ํ”„๋กœ๋•์…˜๊ธ‰ ์‹œ์Šคํ…œ. ROS 2 ๊ธฐ๋ฐ˜ 4๊ฐœ ๋А์Šจํ•œ ๊ฒฐํ•ฉ ์„œ๋ธŒ์‹œ์Šคํ…œ (์Œ์„ฑ๋Œ€ํ™” STTโ†’LLMโ†’TTS ยท ๋น„์ „ ยท ์‘๊ธ‰ ์ƒํƒœ๋จธ์‹  ยท WebSocket API ๊ฒŒ์ดํŠธ์›จ์ด) ์œผ๋กœ ๋ชจ๋“ˆํ™”. LLM ์€ EXAONE-3.5-7.8B-Instruct ๋ฅผ GGUF ์–‘์žํ™” (Q3_K_M ์†๋„ / Q4_K_M ํ’ˆ์งˆ) + llama.cpp ๋ถ€๋ถ„ GPU offload, ์˜๋ฃŒ ์ง€์‹์€ ChromaDB + ์„œ์šธ์•„์‚ฐ๋ณ‘์› ์งˆํ™˜ ๋ฐฑ๊ณผ ๊ธฐ๋ฐ˜ RAG ๋กœ ์‚ฌ์‹ค ๊ทผ๊ฑฐ ์ œ๊ณต. ๋‚™์ƒ ๊ฐ์ง€๋Š” YOLOv8n-pose (TensorRT FP16) ๊ณจ๊ฒฉ ์ถ”์ถœ ํ›„ frame-level + ์‹œ๊ฐ„์  ํ™•์ธ 2-stage ๊ฒ€์ฆ ์œผ๋กœ ์•‰๊ธฐยท์ˆ™์ด๊ธฐ ์˜คํƒ์„ ์–ต์ œ โ€” Recall 0.77 / Precision 0.68 (URFDD). '๊ฒฝ๋ณด ์ „ ์งˆ๋ฌธ(query-before-alert)' ๋กœ์ง ยท ์ง„๋‹จ/ํˆฌ์•ฝ ๊ธˆ์ง€ ํ”„๋กฌํ”„ํŠธ ๋“ฑ false negative ๋ฐฉ์ง€๋ฅผ ์šฐ์„ ํ•œ ์•ˆ์ „ ์„ค๊ณ„, GPIOยทFCMยทSMS ๋‹ค์ค‘ ์ฑ„๋„ ๋ณดํ˜ธ์ž ์•Œ๋ฆผ๊ณผ Flutter ๋ชจ๋ฐ”์ผ ์•ฑ๊นŒ์ง€ ์—”๋“œํˆฌ์—”๋“œ ์™„์„ฑ.

  • โญ Fashion-King-Virtual-Fitting โ€” 360ยฐ ๊ฐ€์ƒ ํ”ผํŒ…๊ณผ AI ์ƒ‰์ƒ ์ถ”์ฒœ์„ ๊ฒฐํ•ฉํ•œ ์›น ํŒจ์…˜ ์‹œ๋ฎฌ๋ ˆ์ดํ„ฐ '๋‚ด์ผ์€ ํŒจ์…˜์™•'. ์ฒดํ˜• ์Šฌ๋ผ์ด๋”๋กœ ๋งŒ๋“  3D ๋งˆ๋„คํ‚น์— ์˜ท์„ ์ž…ํ˜€ 360ยฐ๋กœ ๋Œ๋ ค๋ณด๊ณ , ์–ผ๊ตด ์‚ฌ์ง„ ํ•œ ์žฅ์œผ๋กœ ํผ์Šค๋„ ์ปฌ๋Ÿฌ๋ฅผ ์ง„๋‹จํ•ด ์–ด์šธ๋ฆฌ๋Š” ์ฝ”๋”” ์ƒ‰๊นŒ์ง€ ์ถ”์ฒœํ•˜๋Š” ์—”๋“œํˆฌ์—”๋“œ ์‹œ์Šคํ…œ. ํ”„๋ก ํŠธ๋Š” Next.js 14 + Three.js / React Three Fiber๋กœ ์ฒดํ˜• 7-ํŒŒ๋ผ๋ฏธํ„ฐ ๋งˆ๋„คํ‚นยท์˜๋ฅ˜ GLB ์Šคํ‚ค๋‹ยทOrbitControls 360ยฐ ํšŒ์ „์„ ์‹ค์‹œ๊ฐ„ ๋ Œ๋”๋งํ•˜๊ณ , ์˜ท ์‚ฌ์ง„โ†’๊ฐ€์ƒ ์ฐฉ์šฉ์€ IDM-VTON ํ•ฉ์„ฑ ํŒŒ์ดํ”„๋ผ์ธ์œผ๋กœ ์—ฐ๋™. ํ•ต์‹ฌ ๋‹ด๋‹น ๋ชจ๋“ˆ์ธ 2D ์ƒ‰์ƒ ์ถ”์ฒœ ์‹œ์Šคํ…œ์€ ResNet18 ๋ฉ€ํ‹ฐํƒœ์Šคํฌ CNN์ด ์–ผ๊ตด์—์„œ ์›œ์ฟจยท๋ช…๋„ยท์„ ๋ช…๋„ 3์ถ•์„ ํšŒ๊ท€ํ•ด ์‚ฌ๊ณ„์ ˆ 12ํƒ€์ž…์œผ๋กœ ๋ถ„๋ฅ˜ โ€” ์‹œ์ฆŒ ์ •ํ™•๋„ 90.4% / ํƒ€์ž… 79.5% (์ถ• MAE 0.14~0.27). ๋ผ๋ฒจ์€ FairFace ์ฝ”ํผ์Šค์— SegFormer ํ”ผ๋ถ€ยท๋ชจ๋ฐœ ์„ธ๊ทธ๋ฉ˜ํ…Œ์ด์…˜ + 2-pass ๋ถ€ํŠธ์ŠคํŠธ๋žฉ์œผ๋กœ ์ธ๊ตฌํ†ต๊ณ„ ํŽธํ–ฅ(ํ”ผ๋ถ€ ํ™ฉ์ƒ‰๋„ยท๋ช…๋„)์„ ์ฝ”ํผ์Šค ํ†ต๊ณ„๋กœ ๋ณด์ •ํ•ด ์ž๋™ ์ƒ์„ฑํ•˜๊ณ , ์ƒ‰์ด ๊ณง ์‹ ํ˜ธ์ด๋ฏ€๋กœ ColorJitter ์ƒ‰ ์ฆ๊ฐ•์„ ๋ฐฐ์ œ(๊ธฐํ•˜ ์ฆ๊ฐ•๋งŒ)ํ•ด ๋ผ๋ฒจ ์˜ค์—ผ์„ ์ฐจ๋‹จ. ๋ฐฐํฌ๋Š” torch ์—†์ด ONNX Runtime๋งŒ์œผ๋กœ ์ถ”๋ก ํ•˜๋ฉฐ mediapipe 1.25ร— ์–ผ๊ตด ํฌ๋กญ์œผ๋กœ ํ•™์Šต-์ถ”๋ก  ๋ถ„ํฌ๋ฅผ ์ผ์น˜์‹œ์ผœ ์ •ํ™•๋„ ์ €ํ•˜๋ฅผ ๋ฐฉ์ง€. ์ถ”์ฒœ์€ CIELab ฮ”E76 ์ง€๊ฐ ๊ฑฐ๋ฆฌ๋กœ ์‹œ์ฆŒ ํŒ”๋ ˆํŠธ์™€ ์˜๋ฅ˜ ์ƒ‰์„ ๋น„๊ตยท์ •๋ ฌ(ฮ”E<12 ์ž˜์–ด์šธ๋ฆผ / <28 ๋ฌด๋‚œ), FastAPI /personal-color ์—”๋“œํฌ์ธํŠธ์™€ React ๊ฒฐ๊ณผ ํŒจ๋„๊นŒ์ง€ ์—ฐ๋™ ์™„์„ฑ.


๐Ÿ“ฌ Contact

Pinned Loading

  1. car-racing-dqn car-racing-dqn Public

    "CNN-DQN Autonomous Driving Agent for CarRacing" / "CNN-DQN ๊ธฐ๋ฐ˜ CarRacing ์ž์œจ์ฃผํ–‰ ์—์ด์ „ํŠธ"

    Python 1

  2. rag-chatbot rag-chatbot Public

    "Conversational RAG Chatbot: Paper analysis with LangChain & Gemini 2.0 Flash, deployed on Hugging Face Spaces." / "๋Œ€ํ™”ํ˜• RAG ์ฑ—๋ด‡: LangChain ๋ฐ Gemini 2.0 Flash๋ฅผ ํ™œ์šฉํ•œ ๋…ผ๋ฌธ ๋ถ„์„ ๋ฐ Hugging Face Spaces ๋ฐฐํฌ"

    Python 1

  3. kogpt2-korean-finetuning kogpt2-korean-finetuning Public

    "Optimizing Korean Text Generation via KoGPT-2 Fine-tuning and Decoding Strategy Analysis" / "KoGPT-2 ํ•œ๊ตญ์–ด ํŒŒ์ธํŠœ๋‹๊ณผ ๋””์ฝ”๋”ฉ ์ „๋žต ๋ถ„์„์„ ํ†ตํ•œ ๋ฌธ์žฅ ์ƒ์„ฑ ์ตœ์ ํ™”"

    Python 1

  4. vlm-from-scratch-v4 vlm-from-scratch-v4 Public

    Mini-LLaVA v4 - CLIP-ViT + Qwen2.5-1.5B QLoRA multimodal LLM from scratch (final of the v1-v4 series)

    Python 1

  5. MIND-CARE-Conversational-ChatBot MIND-CARE-Conversational-ChatBot Public

    On-device HRI care system for seniors living alone โ€” Korean voice dialogue, medical RAG, fall detection & emergency alerts, all on a single NVIDIA Jetson (ROS 2). / NVIDIA Jetson ๊ธฐ๋ฐ˜ ์˜จ๋””๋ฐ”์ด์Šค ๋Œ๋ด„ ๋กœ๋ด‡ โ€” ์Œโ€ฆ

    Python 2

  6. Fashion-King-Virtual-Fitting Fashion-King-Virtual-Fitting Public

    360 virtual fitting & personal color recommendation AI system

    TypeScript 1