smart glasses reading list
updated
Human-inspired Perspectives: A Survey on AI Long-term Memory
Paper
• 2411.00489
• Published
• 1
Multimodal Fusion with LLMs for Engagement Prediction in Natural
Conversation
Paper
• 2409.09135
• Published
• 2
Reading Recognition in the Wild
Paper
• 2505.24848
• Published
• 1
EgoLife: Towards Egocentric Life Assistant
Paper
• 2503.03803
• Published
• 46
AIMI: Leveraging Future Knowledge and Personalization in Sparse Event
Forecasting for Treatment Adherence
Paper
• 2503.16091
• Published
• 1
LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV
Cache and Retrieval
Paper
• 2505.15269
• Published
• 1
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale
Paper
• 2504.16030
• Published
• 36
Cooperative Face Liveness Detection from Optical Flow
Paper
• 2508.10786
• Published
CLIPC8: Face liveness detection algorithm based on image-text pairs and
contrastive learning
Paper
• 2311.17583
• Published
Camera-Driven Representation Learning for Unsupervised Domain Adaptive
Person Re-identification
Paper
• 2308.11901
• Published
LiveStar: Live Streaming Assistant for Real-World Online Video Understanding
Paper
• 2511.05299
• Published
• 2
YOLO-World: Real-Time Open-Vocabulary Object Detection
Paper
• 2401.17270
• Published
• 43
YOLO-TS: Real-Time Traffic Sign Detection with Enhanced Accuracy Using
Optimized Receptive Fields and Anchor-Free Fusion
Paper
• 2410.17144
• Published
YOLOE: Real-Time Seeing Anything
Paper
• 2503.07465
• Published
• 16
MediaPipe Hands: On-device Real-time Hand Tracking
Paper
• 2006.10214
• Published
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices
Paper
• 2407.05712
• Published
Sharing emotions at scale: The Vent dataset
Paper
• 1901.04856
• Published
Natural Language Processing for Cognitive Analysis of Emotions
Paper
• 2210.05296
• Published
• 1
How you feelin'? Learning Emotions and Mental States in Movie Scenes
Paper
• 2304.05634
• Published
A Brain Wave Encodes a Thousand Tokens: Modeling Inter-Cortical Neural Interactions for Effective EEG-based Emotion Recognition
Paper
• 2511.13954
• Published
• 5
EmoNet-Face: An Expert-Annotated Benchmark for Synthetic Emotion
Recognition
Paper
• 2505.20033
• Published
• 4
Beyond Emotion Recognition: A Multi-Turn Multimodal Emotion
Understanding and Reasoning Benchmark
Paper
• 2508.16859
• Published
"Only ChatGPT gets me": An Empirical Analysis of GPT versus other Large
Language Models for Emotion Detection in Text
Paper
• 2503.04831
• Published
• 1
OV-MER: Towards Open-Vocabulary Multimodal Emotion Recognition
Paper
• 2410.01495
• Published
Don't Judge Before You CLIP: A Unified Approach for Perceptual Tasks
Paper
• 2503.13260
• Published
• 2
Gaze into the Heart: A Multi-View Video Dataset for rPPG and Health
Biomarkers Estimation
Paper
• 2508.17924
• Published
• 14
R2I-rPPG: A Robust Region of Interest Selection Method for Remote
Photoplethysmography to Extract Heart Rate
Paper
• 2410.15851
• Published
rPPG-Toolbox: Deep Remote PPG Toolbox
Paper
• 2210.00716
• Published
RPGBENCH: Evaluating Large Language Models as Role-Playing Game Engines
Paper
• 2502.00595
• Published
Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in
Large Language Models
Paper
• 2505.02847
• Published
• 29
CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset
for Conversational AI
Paper
• 2205.14727
• Published
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent
Systems
Paper
• 2505.18943
• Published
• 25
VIBE: Can a VLM Read the Room?
Paper
• 2506.11162
• Published
BlazePose: On-device Real-time Body Pose tracking
Paper
• 2006.10204
• Published
QBitOpt: Fast and Accurate Bitwidth Reallocation during Training
Paper
• 2307.04535
• Published
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary
Detection
Paper
• 2409.08513
• Published
• 14
FER-YOLO-Mamba: Facial Expression Detection and Classification Based on
Selective State Space
Paper
• 2405.01828
• Published
• 1
QuickSRNet: Plain Single-Image Super-Resolution Architecture for Faster
Inference on Mobile Platforms
Paper
• 2303.04336
• Published
Real-Time Neural Light Field on Mobile Devices
Paper
• 2212.08057
• Published
ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory
Perceptions
Paper
• 2505.14668
• Published
Computational Life: How Well-formed, Self-replicating Programs Emerge
from Simple Interaction
Paper
• 2406.19108
• Published
Synheart Emotion: Privacy-Preserving On-Device Emotion Recognition from Biosignals
Paper
• 2511.06231
• Published
• 1
EgoPet: Egomotion and Interaction Data from an Animal's Perspective
Paper
• 2404.09991
• Published
LLMs Learn to Deceive Unintentionally: Emergent Misalignment in
Dishonesty from Misaligned Samples to Biased Human-AI Interactions
Paper
• 2510.08211
• Published
• 22
Put Myself in Your Shoes: Lifting the Egocentric Perspective from
Exocentric Videos
Paper
• 2403.06351
• Published
SELF-PERCEPT: Introspection Improves Large Language Models' Detection of
Multi-Person Mental Manipulation in Conversations
Paper
• 2505.20679
• Published
LALM: Long-Term Action Anticipation with Language Models
Paper
• 2311.17944
• Published
HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI
Assistants
Paper
• 2509.08494
• Published
• 3
EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in
the Wild
Paper
• 2502.14892
• Published
• 6
Decision-Oriented Dialogue for Human-AI Collaboration
Paper
• 2305.20076
• Published
AI for Service: Proactive Assistance with AI Glasses
Paper
• 2510.14359
• Published
• 77
COPILOT: Human-Environment Collision Prediction and Localization from
Egocentric Videos
Paper
• 2210.01781
• Published
TeleEgo: Benchmarking Egocentric AI Assistants in the Wild
Paper
• 2510.23981
• Published
Ego-EXTRA: video-language Egocentric Dataset for EXpert-TRAinee assistance
Paper
• 2512.13238
• Published
• 1
In the Eye of MLLM: Benchmarking Egocentric Video Intent Understanding
with Gaze-Guided Prompting
Paper
• 2509.07447
• Published
• 1
Proactive Hearing Assistants that Isolate Egocentric Conversations
Paper
• 2511.11473
• Published
• 8
AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video
Understanding
Paper
• 2406.13807
• Published
LifelongMemory: Leveraging LLMs for Answering Queries in Egocentric
Videos
Paper
• 2312.05269
• Published
Proactive Assistant Dialogue Generation from Streaming Egocentric Videos
Paper
• 2506.05904
• Published
• 2
Semantic MapNet: Building Allocentric Semantic Maps and Representations
from Egocentric Views
Paper
• 2010.01191
• Published
EgoM2P: Egocentric Multimodal Multitask Pretraining
Paper
• 2506.07886
• Published
• 1
EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT
Paper
• 2510.23569
• Published
• 3
Vinci: A Real-time Embodied Smart Assistant based on Egocentric
Vision-Language Model
Paper
• 2412.21080
• Published
MM-Ego: Towards Building Egocentric Multimodal LLMs
Paper
• 2410.07177
• Published
• 22
Listen to Look into the Future: Audio-Visual Egocentric Gaze
Anticipation
Paper
• 2305.03907
• Published
• 1
Project Aria: A New Tool for Egocentric Multi-Modal AI Research
Paper
• 2308.13561
• Published
EgoMe: Follow Me via Egocentric View in Real World
Paper
• 2501.19061
• Published
Entering Real Social World! Benchmarking the Theory of Mind and
Socialization Capabilities of LLMs from a First-person Perspective
Paper
• 2410.06195
• Published
State Your Intention to Steer Your Attention: An AI Assistant for
Intentional Digital Living
Paper
• 2510.14513
• Published
Mixed-Session Conversation with Egocentric Memory
Paper
• 2410.02503
• Published
• 8
Multi-Advisor Reinforcement Learning
Paper
• 1704.00756
• Published
• 1
EgoPrivacy: What Your First-Person Camera Says About You?
Paper
• 2506.12258
• Published
• 3
Can Vision-Language Models Think from a First-Person Perspective?
Paper
• 2311.15596
• Published
• 3
Multimodal Distillation for Egocentric Action Recognition
Paper
• 2307.07483
• Published
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
Paper
• 2502.07411
• Published
AssistantX: An LLM-Powered Proactive Assistant in Collaborative
Human-Populated Environment
Paper
• 2409.17655
• Published
EgoVLM: Policy Optimization for Egocentric Video Understanding
Paper
• 2506.03097
• Published
Embodied VideoAgent: Persistent Memory from Egocentric Videos and
Embodied Sensors Enables Dynamic Scene Understanding
Paper
• 2501.00358
• Published
Aligning VLM Assistants with Personalized Situated Cognition
Paper
• 2506.00930
• Published
• 2
ProPerSim: Developing Proactive and Personalized AI Assistants through
User-Assistant Simulation
Paper
• 2509.21730
• Published
HAPRec: Hybrid Activity and Plan Recognizer
Paper
• 2004.13482
• Published
SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in
Cyber World
Paper
• 2412.07472
• Published
Game-theoretic LLM: Agent Workflow for Negotiation Games
Paper
• 2411.05990
• Published
• 8
GRIM: GRaph-based Interactive narrative visualization for gaMes
Paper
• 2311.09213
• Published
• 13
GTAlign: Game-Theoretic Alignment of LLM Assistants for Mutual Welfare
Paper
• 2510.08872
• Published
• 4
Game-TARS: Pretrained Foundation Models for Scalable Generalist
Multimodal Game Agents
Paper
• 2510.23691
• Published
• 54
Game Theory with Simulation in the Presence of Unpredictable
Randomisation
Paper
• 2410.14311
• Published
A Survey on Large Language Model-Based Game Agents
Paper
• 2404.02039
• Published
Persuasion for Good: Towards a Personalized Persuasive Dialogue System
for Social Good
Paper
• 1906.06725
• Published
• 1
Make an Offer They Can't Refuse: Grounding Bayesian Persuasion in Real-World Dialogues without Pre-Commitment
Paper
• 2510.13387
• Published
Persuasion at Play: Understanding Misinformation Dynamics in
Demographic-Aware Human-LLM Interactions
Paper
• 2503.02038
• Published
Monopoly Deal: A Benchmark Environment for Bounded One-Sided Response
Games
Paper
• 2510.25080
• Published
• 2
Context versus Prior Knowledge in Language Models
Paper
• 2404.04633
• Published
• 5
Sotopia-RL: Reward Design for Social Intelligence
Paper
• 2508.03905
• Published
• 23
Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy
Planning
Paper
• 2305.13660
• Published
The Persuasive Power of Large Language Models
Paper
• 2312.15523
• Published
PRINCIPLES: Synthetic Strategy Memory for Proactive Dialogue Agents
Paper
• 2509.17459
• Published
ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind
Paper
• 2505.22961
• Published
• 8
Communication is All You Need: Persuasion Dataset Construction via
Multi-LLM Communication
Paper
• 2502.08896
• Published
Human Choice Prediction in Language-based Persuasion Games:
Simulation-based Off-Policy Evaluation
Paper
• 2305.10361
• Published
• 1
Language of Persuasion and Misrepresentation in Business Communication:
A Textual Detection Approach
Paper
• 2508.09935
• Published
Persuasion Should be Double-Blind: A Multi-Domain Dialogue Dataset With
Faithfulness Based on Causal Theory of Mind
Paper
• 2502.21297
• Published