Deep Learning for Natural Language Processing (131307379A)

Deep Learning for Natural Language Processing (131307379A)#

Overview#

In recent years, natural language processing (NLP) research has undergone a massive transformation. The emergence of large language models (LLMs) has dramatically improved the ability to generate and understand text, revolutionizing various application domains such as translation, question answering, and summarization. In 2024-2025, multimodal LLMs like GPT-5 and Gemini 2.5 Pro that can simultaneously process text, images, and audio have emerged, further expanding the scope of applications. Particularly noteworthy is the emergence of new architectures beyond Transformer. For example, Mamba, a state space model (SSM), can efficiently process up to millions of tokens with linear O(n) complexity, while RWKV can process conversational messages at 10x or more lower cost than existing methods in real-time.

This course reflects these latest developments to provide hands-on deep learning-based NLP techniques. Students first learn core tool utilization methods such as PyTorch and Hugging Face usage, then directly experience fine-tuning of Transformer-based models and latest SSM architectures, prompt engineering, retrieval-augmented generation (RAG), reinforcement learning from human feedback (RLHF), and agent framework implementation. Additionally, we cover latest parameter-efficient fine-tuning (PEFT) techniques (WaveFT, DoRA, VB-LoRA, etc.) and advanced RAG architectures (HippoRAG, GraphRAG), and practice cutting-edge concepts such as multimodal LLMs and ultra-long context processing. Finally, through team projects, students integrate learned content to implement complete models and applications that solve real problems.

This course is designed for third-year undergraduate level and assumes completion of prerequisite course Language Models and Natural Language Processing (131107967A). Through team projects, students challenge real problem-solving using Korean corpora, and in the final project phase, we provide opportunities to work with industry datasets and receive feedback from industry experts, considering industry-academia collaboration.

Learning Objectives#

  • Understand the role and limitations of large language models in modern NLP and utilize related tools such as PyTorch and Hugging Face.

  • Understand the principles and trade-offs of State Space Models (e.g., Mamba, RWKV) along with latest architectures.

  • Apply fine-tuning to pre-trained models or latest parameter-efficient fine-tuning methods like WaveFT, DoRA, VB-LoRA.

  • Learn methods to systematically optimize prompts using prompt engineering techniques and DSPy framework.

  • Understand the evolution of evaluation metrics (e.g., G-Eval, LiveCodeBench, etc.) and the importance of human evaluation, and learn latest alternatives to RLHF such as DPO (Direct Preference Optimization).

  • Design and implement advanced RAG (Retrieval-Augmented Generation) architectures like HippoRAG, GraphRAG and hybrid search strategies.

  • Understand AI regulatory frameworks like EU AI Act and acquire methodologies for implementing responsible AI systems.

  • Track latest research trends to discuss multimodal LLMs, small language models (SLM), state space models (SSM), multi-agent systems, mixture of experts (MoE), and other diverse latest technologies.

  • Understand the characteristics and challenges of Korean NLP and develop application capabilities through hands-on practice using Korean corpora.

  • Strengthen collaboration and practical problem-solving capabilities through team projects and gain project experience connected to industry.

Course Schedule#

Week

Main Topics and Keywords

Key Hands-on/Assignments

1

Transformer and Next-Generation Architectures
• Self-Attention Mechanism and Limitations
• Mamba (SSM), RWKV, Jamba

Transformer Component Implementation
Mamba vs Transformer Performance Comparison
Architecture Complexity Analysis

2

PyTorch 2.x and Latest Deep Learning Frameworks
• torch.compile Compiler Revolution
• FlashAttention-3 Hardware Acceleration
• AI Agent Frameworks

torch.compile Performance Optimization
FlashAttention-3 Implementation
AI Agent Framework Comparison

3

Modern PEFT Techniques for Efficient Fine-tuning
• LoRA, DoRA, QLoRA
• Advanced PEFT Techniques

PEFT Method Comparison Experiment
LoRA/DoRA/QLoRA Performance Evaluation
Memory Efficiency Analysis

4

Advanced Prompt Techniques and Optimization
• Prompt Engineering Fundamentals
• Self-Consistency, Tree of Thoughts
• DSPy Framework

DSPy-based Automatic Prompt Optimization
Self-Consistency Implementation
Tree of Thoughts Problem Solving

5

LLM Evaluation Paradigms and Benchmarks
• Evaluation Paradigm Evolution
• LLM-as-a-Judge (GPTScore, G-Eval, FLASK)
• Specialized and Domain-specific Benchmarks

G-Eval Implementation
Benchmark Comparison Experiment
Evaluation Bias Analysis

6

Multimodal NLP Advancements
• Vision-Language Models (LLaVA, MiniGPT-4, Qwen-2.5-Omni)
• Visual Reasoning (QVQ-Max)
• Speech Integration

Multimodal QA Application Development
Vision-Language Model Comparison
End-to-end Multimodal System

7

Ultra-Long Context Processing and Efficient Inference
• Context Window Revolution (1M+ tokens)
• Attention Mechanism Optimization
• LongRoPE and RAG Integration

FlashAttention-3 Integration
Long Context Processing Comparison
Performance Analysis

8

Core Review and Latest Trends
• Architecture Review
• Latest Model Trends (GPT-5, Gemini 2.5 Pro, Claude 4.1)
• Industry Applications

Comprehensive Review
Model Comparison
Industry Case Analysis

9

Advanced RAG Systems – HippoRAG, GraphRAG, Hybrid Search Strategies

Assignment 3: Building Korean Enterprise Search System based on GraphRAG

10

Innovation in Alignment Techniques – DPO, Constitutional AI, Process Reward Models

Comparison Practice between DPO and Existing RLHF Techniques

11

Production Agent Systems – CrewAI, Mirascope, Type-Safety Development

Multi-agent Orchestration Implementation

12

AI Regulation and Responsible AI – EU AI Act, Differential Privacy, Federated Learning

Assignment for Designing Regulation-Compliant AI Systems

13

Ontology and AI – Modeling Reality and Operating it with AI
• Data Science to Decision Science
• Semantic Ontology, GraphRAG
• Kinetic Ontology, Closed-Loop Systems

Semantic Ontology Modeling
GraphRAG Implementation
Closed-Loop Simulation

14

Final Project Development and MLOps

Team Prototype Implementation and Feedback Sessions (Industry Mentor Participation)

15

Final Project Presentations and Comprehensive Evaluation

Team Presentations, Course Content Summary and Future Prospects Discussion

Table of Contents#