Paper Released: MusiXQA

MusiXQA: Advancing Visual Music Understanding in Multimodal Large Language Models I’m excited to share our new paper, MusiXQA: Advancing Visual Music Understanding in Multimodal Large Language M...

Jun 26, 2025 Research, Multimodal Large Language Models

Berklee Online Certificate

I’ve earned my Songwriting certificate from Berklee Online via Coursera.

Apr 30, 2025 Music, Education

绫华 Assist is Online!!!

我部署了一个模仿游戏《原神》中神里绫华人格的聊天机器人。在左侧边栏也可以与绫华聊天。快试试吧。 A chatbot inspired by Kamisato Ayaka’s personality from Genshin Impact is now available below and in the sidebar to the left. She would be delighted...

Apr 18, 2025 Social, News

I am a Doctor!!!

Thrilled to share that I’ve successfully passed my dissertation defense! Update on 07/02/2025 I have got my official diploma!!! Here is the official certified PDF: And perm download link...

Mar 23, 2025 Social, News

Paper Released: SV-RAG

SV-RAG: LoRA-Contextualizing Adaptation of MLLMs for Long Document Understanding My Adobe internship work has been accepted as a conference paper at ICLR 2025: “SV-RAG: LoRA-Contextualizing Adaptat...

Jan 21, 2025 Research, Multimodal Large Language Models

Spark: Bird Flock Simulation

I am a TA for CSE-587: Data Intensive Computing in the Fall 2024 semester. While creating assignment problems, I created a bird flock simulation using PySpark.

Oct 28, 2024 Code, plot

Paper Released: TextLap

Our paper TextLap: Customizing Language Models for Text-to-Layout Planning has been accepted to EMNLP 2024 and is available on arXiv.

Oct 10, 2024 Research, spacial planning

E-Attending ISMIR 2024

I’m exploring AI for music as a hobby and will be attending the 25th International Society for Music Information Retrieval (ISMIR) remotely to gain insights into the latest research trends and em...

Sep 25, 2024 Social, Event

Paper Released: MMR Benchmark

We build the MMR: Multi-Modal Reading Benchmark for Evaluating Reading Ability of Large Multimodal Models. The MMR Benchmark paper and code is released and currently available on arXiv.

Aug 28, 2024 Research, Multimodal Large Language Models

arXiv with aux file

For some unknown reason, LaTeX occasionally fails to find citations even when they are explicitly listed using \bibitem. This issue can often be resolved by compiling the document twice locally: th...

Jul 31, 2024 Code, latex

1
2
3
1 / 3