Skip to content
View pjlintw's full-sized avatar
🦦
🦦

Block or report pjlintw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pjlintw/README.md

👋 Hi there! I'm Pin-Jie, and I'm a PhD student at the LLMs Lab at Virginia Tech.
Personal Weibsite / X / Google Scholar 

Research

My research focuses on building capable LLMs and agent systems with less data and minimal supervision, through synthetic data, fine-tuning transfer, and PEFT. I explore methods that improve the efficiency of model updating (e.g., SFT, DPO, RLVR) while effectively blending diverse skills, including safety, tool use, coding, multilingual, and instruction-following. My overarching goal is to build modular, reusable systems in which the intelligent system continually learns and evolves over time. Currently, it covers the areas:

  • Efficient and continual model development. I explore strategies to make LLM updates faster and more reusable. In particular, I study how alignment capabilities can be transferred through model weights, enabling reuse across distinct model versions and eliciting the CoT behavior in latent space (e.g., fine-tuning transfer with diff vectors; Master Key Hypothesis).

  • Continually evolving agents. Building LLM agents that evolve skills and effectively extract useful signals from raw trajectories for long-horizon tasks.

  • Model harnessing. Designing systems that effectively orchestrate LLM agents with tools, memory, and feedback to improve system reasoning.

Pinned Loading

  1. finetuning-transfer finetuning-transfer Public

    Efficient Model Development through Fine-tuning Transfer

    Python 9 1

  2. Meta-BERT Meta-BERT Public

    Meta-BERT: Learning to Learn fast For Low-Resource Text Classification

    Python 19 4

  3. NNLM NNLM Public

    Implementation of "A Neural Probabilistic Language Model" by Yoshua Bengio et al. - Tensorflow

    Python 11 1

  4. integrated-gradients integrated-gradients Public

    Implementation for Conditional Text GANs and Analysis with Integrated Gradients

    HTML

  5. intermediate-task-selection intermediate-task-selection Public

    Official Code for "Exploring Task Selection For Intermediate Task Transfer Learning"

    Python

  6. minigrad minigrad Public

    Jupyter Notebook