Julio's dev
🏷️ Tags
πŸ’» Profile
profile_image
Hoyoun Jung (Julio)
AI researcher
If you can measure it, you can improve it.
πŸ”Ž Search
🏷️ Tags
πŸ“‚ All Posts
😎 Daily

Deepseek-v3: 671B MoE λͺ¨λΈ, μš°λ¦¬κ°€ λ”°λΌμž‘μ„ 수 μžˆμ„κΉŒ?

Jan 31, 2025

NLP
πŸ“„ Paper

[Snapshot] 2 OLMo 2 Furious (`24.12)

Jan 9, 2025

NLP
πŸ“„ Paper

[Snapshot] Byte Latent Transformer: Patches Scale Better Than Tokens(`24.12)

Jan 5, 2025

NLP
πŸ“„ Paper

[Snapshot] Liquid: Language Models are Scalable Multi-modal Generators (`24.12)

Dec 30, 2024

NLP
CV
πŸ“„ Paper

[Snapshot] Nemotron-CC: Transforming Common Crawl into a Refined Long-Horizon Pretraining Dataset (`24.12)

Dec 30, 2024

NLP
πŸ“„ Paper

[Snapshot] RedStone: Curating General, Code, Math, and QA Data for Large Language Models (`24.12)

Dec 30, 2024

NLP
πŸ“„ Paper

[Snapshot] Densing Law of LLMs (`24. 12)

Dec 30, 2024

NLP
πŸ“„ Paper

[Snapshot] Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement (`24.12)

Dec 30, 2024

NLP
πŸ“„ Paper

[Snapshot] Training Bilingual LMs with Data Constraints in the Targeted Language (`24.11)

Dec 30, 2024

NLP
πŸ“„ Paper

[Snapshot] granite-3.0-language-models (`24.10)

Dec 30, 2024

NLP
πŸ“„ Paper

[Snapshot] TxT360: A Top-Quality LLM Pre-training Dataset Requires the Perfect Blend (`24.10)

Dec 30, 2024

NLP
πŸ“„ Paper

[Snapshot] Qwen2.5 Technical Report (`24.12)

Dec 30, 2024

NLP
πŸ“„ Paper

[Review] DeepSeek-V3 Technical Report (`24.12)

Dec 30, 2024

NLP
πŸ“„ Paper

[Review] Analysing The Impact of Sequence Composition on Language Model Pre-Training (`24. 2)

Sep 5, 2024

NLP
πŸ“„ Paper

[Snapshot] Specific versus General Principles for Constitutional AI (`23. 10)

Aug 30, 2024

NLP
πŸ“„ Paper

[Snapshot] Does your data spark joy? Performance gains from domain upsampling at the end of training (`24.06)

Aug 30, 2024

NLP
πŸ“„ Paper

[Snapshot] Instruction-tuned Language Models are Better Knowledge Learners(`24.02)

Aug 30, 2024

NLP
πŸ“„ Paper

[Snapshot] Building and better understanding vision-language models: insights and future directions (`24.08)

Aug 29, 2024

NLP
CV
πŸ“„ Paper

[Review] LLM Pruning and Distillation in Practice: The Minitron Approach (`24.08)

Aug 25, 2024

NLP
πŸ“„ Paper

[Snapshot] Multi-modal preference alignment remedies regression of visual instruction tuning on language model (`24.02)

Aug 21, 2024

NLP
CV
πŸ“„ Paper

[Snapshot] HELPSTEER: Multi-attribute Helpfulness Dataset for STEERLM(`23.11)

Aug 19, 2024

NLP
πŸ“„ Paper

[Snapshot] SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF( `23.10)

Aug 19, 2024

NLP
πŸ“„ Paper

[Snapshot] Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models(`24.08)

Aug 16, 2024

NLP
πŸ“„ Paper

[Review] Concept-skill Transferability-based Data Selection for Large Vision-Language Models(`24.06)

Jul 13, 2024

NLP
CV
πŸ“„ Paper

[Review] Your Vision-Language Model Itself Is a Strong Filter: Towards High-Quality Instruction Tuning with Data Selection(`24.02)

Jul 13, 2024

NLP
CV
πŸ“„ Paper

[Snapshot] InFoBench: Evaluating Instruction Following Ability in Large Language Models(`24.01)

Jun 11, 2024

NLP
πŸ“„ Paper

[Review] Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance(`24.03)

Jun 11, 2024

NLP
πŸ“„ Paper

[Review] DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining(`23.05)

Jun 11, 2024

NLP
πŸ“„ Paper

[Review] Data Mixing Made Efficient- A Bivariate Scaling Law for Language Model Pretraining(`24.05)

Jun 11, 2024

NLP
πŸ“„ Paper

[Snapshot] Rethinking Overlooked Aspects in Vision-Language Models (`24.05)

May 24, 2024

NLP
CV
πŸ“„ Paper

[Review] Open Vocabulary Obejct Detection (OwlVit)

Apr 26, 2024

CV
πŸ“„ Paper

[Snapshot] Self-Supervised Visual Preference Alignment (`24. 04)

Apr 22, 2024

NLP
CV
πŸ“„ Paper

[Snapshot] Towards Multimodal In-Context Learning for Vision & Language Models (`24. 04)

Apr 19, 2024

NLP
CV
πŸ“„ Paper

[Review] Efficiently Programming Large Language Models using SGLang (`23. 12)

Apr 15, 2024

NLP
πŸ“„ Paper

[Snapshot] Tree of Thoughts: Deliberate Problem Solving with Large Language Models (`23. 12)

Apr 15, 2024

NLP
πŸ“„ Paper

[Snapshot] Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models (`24. 03)

Mar 31, 2024

NLP
CV
πŸ“„ Paper

[Review] Synth^2: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings (`24. 03)

Mar 29, 2024

NLP
πŸ“„ Paper

[Review] ROFORMER: ENHANCED TRANSFORMER WITH ROTARY POSITION EMBEDDING (21. 04)

Mar 22, 2024

NLP
πŸ“„ Paper

[Review] Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models (`23. 05)

Mar 11, 2024

NLP
CV
πŸ“„ Paper

[Snapshot] Balancing Enhancement, Harmlessness, and General Capabilities- Enhancing Conversational LLMs with Direct RLHF (`24. 03)

Mar 7, 2024

NLP
πŸ“„ Paper

[Snapshot] CLoVe- Encoding Compositional Language in Contrastive Vision-Language Models (`24. 02)

Mar 5, 2024

NLP
CV
πŸ“„ Paper

[Snapshot] The False Promise of Imitating Proprietary LLMs (`23. 05)

Mar 5, 2024

NLP
πŸ“„ Paper

[Snapshot] SUGARCREPE- Fixing Hackable Benchmarks for Vision-Language Compositionality (`23. 06)

Mar 4, 2024

NLP
CV
πŸ“„ Paper

[Snapshot] Learning or Self-aligning-Rethinking Instruction Fine-tuning (`24. 3)

Mar 2, 2024

NLP
πŸ“„ Paper

[Review] ShareGPT4V - Improving Large Multi-Modal Models with Better Captions (`23. 11)

Feb 29, 2024

NLP
CV
πŸ“„ Paper

[Review] Patching open-vocabulary models by interpolating weights (`22. 10)

Feb 14, 2024

CV
πŸ“„ Paper

[Review] Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond (`23. 8)

Feb 2, 2024

NLP
CV
πŸ’» Projects
PromptCompressor: Enjoy 25% Savings on GPT API Without Losing Quality!

PromptCompressor: Enjoy 25% Savings on GPT API Without Losing Quality!

Oct 10, 2023

Web service that offers a cost-effective way to use language models like ChatGPT.

NLP
πŸ“„ Paper
Discrete Prompt Compression with Reinforcement Learning

Discrete Prompt Compression with Reinforcement Learning

Sep 21, 2023

A paper on the methodology to compress the length of a prompt using RL

RL
NLP
πŸ’» Projects
Kaggle: Lux AI Season2 Competition Review

Kaggle: Lux AI Season2 Competition Review

May 5, 2023

Review of the Kaggle Lux Season2 competition

Competition
πŸ’» Projects
Game AI Summer School 2022 Review (2) - Game Jam

Game AI Summer School 2022 Review (2) - Game Jam

Sep 16, 2022

Experience at Game AI School: Part 2. Game AI Jam

Competition
😎 Daily
Game AI Summer School 2022 Review (1) - Lectures

Game AI Summer School 2022 Review (1) - Lectures

Sep 6, 2022

Experience at Game AI School: Part 1. Lecture

Daily
πŸ’» Projects
Kaggle: Kore2022 Competition Review

Kaggle: Kore2022 Competition Review

Aug 26, 2022

Review of the Kaggle Kore 2022 competition

Competition
πŸ’» Profile
Hoyoun Jung (Julio)
AI researcher
If you can measure it, you can improve it.
🌟 Service
πŸ’¬ Contact
github
instagram
email
linkedin