Yasunori Ozaki's picture

In a Training Loop 🔄

Yasunori Ozaki PRO

alfredplpl

·

https://alfredplpl.github.io/en/index.html

AI & ML interests

Computer Vision, LLM

Recent Activity

liked a model about 7 hours ago

openbmb/VoxCPM2

liked a model about 8 hours ago

baidu/ERNIE-Image-Turbo

liked a model about 8 hours ago

baidu/ERNIE-Image

View all activity

Organizations

upvoted a collection 3 days ago

Qwen3-Omni

6 items • Updated Dec 31, 2025 • 196

upvoted a collection 6 days ago

WAON

WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models • 4 items • Updated Mar 2 • 3

upvoted a collection 10 days ago

Bonsai

1-bit Bonsai models • 6 items • Updated about 1 hour ago • 175

upvoted a collection 11 days ago

日本語LLM

297 items • Updated 1 day ago • 7

upvoted 2 collections 12 days ago

Marco-MoE

A suit of multilingual MoE models with highly-sparse architectures • 5 items • Updated 7 days ago • 14

PLaMo 2-VL

2 items • Updated 12 days ago • 3

upvoted an article 12 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

+5

13 days ago

•

846

upvoted a collection 12 days ago

Gemma 4

8 items • Updated 13 days ago • 602

upvoted a collection 14 days ago

Sarashina2.2

Large Language Models developed by SB Intuitions. Pretrained and instruction-tuned models are available in three sizes: 0.5B, 1B, and 3B. • 6 items • Updated Mar 5, 2025 • 9

upvoted a collection 15 days ago

LLM-jp-4 Models

5 items • Updated 15 days ago • 16

upvoted a collection 19 days ago

Domain-Specific LLMs: Japanese Finance

ドメイン特化LLM：日本語金融 • 6 items • Updated Mar 9 • 3

upvoted a paper 19 days ago

Constructing Synthetic Instruction Datasets for Improving Reasoning in Domain-Specific LLMs: A Case Study in the Japanese Financial Domain

Paper • 2603.01353 • Published Mar 2 • 3

upvoted a changelog 23 days ago

Hugging Face Changelog

Protected Spaces with Public URLs

26 days ago

• 121

upvoted a paper 25 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 27 days ago • 66

upvoted 2 papers about 1 month ago

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published Mar 10 • 48

DEJIMA: A Novel Large-scale Japanese Dataset for Image Captioning and Visual Question Answering

Paper • 2512.00773 • Published Nov 30, 2025 • 1

upvoted a paper about 2 months ago

SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model

Paper • 2602.21818 • Published Feb 25 • 55

upvoted 3 collections about 2 months ago

Qwen3.5

Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 11 days ago • 146

GPT-OSS-Swallow-v0.1

4 items • Updated Feb 20 • 13

Qwen3-Swallow-v0.2

12 items • Updated Feb 23 • 9