Zzong's Notes

❯

❯

❯

reading list

2026년 6월 14일2 min read

LLM

2401.02412 LLM Augmented LLMs: Expanding Capabilities through Composition
- Related Reddit: Expanding Capabilities through Composition (CALM) : r/LocalLLaMA
2312.15166 SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
GitHub - pratyushasharma/laser: The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
- Simultaneously Enhance Performance and Reduce LLM Size with no Additional Training - LASER by Microsoft : r/LocalLLaMA
mixtral: 2401.04088\ Mixtral of Experts
Phi model is free ! microsoft/phi-2 · Hugging Face
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
GitHub - pratyushasharma/laser: The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
Paper page - LLaMA Beyond English: An Empirical Study on Language Capability Transfer
GitHub - microsoft/TransformerCompression: For releasing code related to compression methods for transformers, accompanying our publications
Evolving New Foundation Models: Unleashing the Power of Automating Model Development

A.1) QA

Paper page - ChatQA: Building GPT-4 Level Conversational QA Models

A.2) MoE

Paper page - Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM
Paper page - Soaring from 4K to 400K: Extending LLM’s Context with Activation Beacon
Mixture of Experts Explained
mlabonne/phixtral-4x2_8 · Hugging Face
Mixture of Experts for Clowns (at a Circus)

B) RAG

대규모 언어 모델을 위한 검색-증강 생성(RAG) 기술 현황 - 2/2편 - 읽을거리&정보공유 - 파이토치 한국 사용자 모임
Long-Context Retrieval Models with Monarch Mixer · Hazy Research
RAFT: RAG 기법을 활용한 LLM 검색 증강형 미세조정(RAG + FineTuning) - 읽을거리&정보공유 - 파이토치 한국 사용자 모임

C) Code

skypilot/llm/codellama at master · skypilot-org/skypilot · GitHub

D) Benchmark

EQ-Bench Leaderboard

E) LLM Task

[2402.10171] Data Engineering for Scaling Language Models to 128K Context

F) CPU

LLaMA Now Goes Faster on CPUs

G) Related

H) References

Daily Papers - Hugging Face

함께 보면 좋은 글

paper review

논문의 퀄리티 측정법 뻔하지 않은 결과에는 합리적이고 디테일한 설명이 필요하다. novelty 가 부족하면 안된다. 기존 방식이 가진 이슈를 제기하고, 이를 해결할 수 있는 방안을 제시해야한다. C) References 2401.02412.pdf .

BEQUE - Large Language Model based Long-tail Query Rewriting in Taobao Search

한줄 요약 LLM 기반 3-stage fine-tuning으로 Long-tail Query의 semantic gap을 해결하여 Taobao 검색에서 GMV +0.4%, few-recall query GMV +18.66% 달성.

SIRIP

SIRIP: SIGIR Symposium on IR in Practice (Industry Track) Z.1) SIRIP (Industry Track) 개요 행사 정보 48회 SIGIR 컨퍼런스 (2025년 7월 13일–18일, 이탈리아 파두아 개최) 내에서 진행된 Industry Track입니다.

PASS-GLM

polynomial approximate sufficient statistics for scalable Bayesian GLM inference paper link Abstract GLM 학습을 위한 새로운 approach 를 제안.

SIGIR 2025

Accepted Papers SIGIR 2025, Padua, 13-18 July | Accepted Papers B) Bloomberg의 AI 정보 검색 연구 발표 (SIGIR 2025) Bloomberg’s AI Engineers Publish 3 Information Retrieval Research...

Making contextual decisions with low technical debt

paper link: arxiv.org/abs/1606.03966 Contextual Bandit pipeline 을 구성할 때 어떻게 하면 기술 부채가 적어지는지에 대한 내용을 다루는 것 같음.

MIPO

해당 논문은 대규모 언어 모델(LLM)을 인간의 선호도에 맞게 미세 조정하는 DPO(Direct Preference Optimization) 방식의 한계를 지적하고, 이를 개선한 MIPO(Modulated Intervention Preference Optimization) 라는 새로운 방법을 제안합니다.

LLaVA

핵심 요약 LLaVA (Large Language and Vision Assistant): Vision Encoder + LLM을 연결하여 이미지를 이해하고 자연어로 대화할 수 있는 멀티모달 LLM.

Self-Rewarding Language Models

Self-Rewarding Language Models Self-Rewarding Language Models, where the language model itself is used via LLM-as-a-Judge prompting to provide its own rewards during training.

Do not Stop Pretraining - Adapt Language Models to Domains and Tasks

Do not Stop Pretraining - Adapt Language Models to Domains and Tasks B) Related C) References LangCon 2023 - 특정 도메인에 맞는 언어모델은 어떻게 만들까? .

LLM
A.1) QA
A.2) MoE
B) RAG
C) Code
D) Benchmark
E) LLM Task
F) CPU
G) Related
H) References