Tag: english | Yingfa Chen

(EREN) Robust and Scalable Model Editing for Large Language Models

2024-03-14

525 words, 3 min

Paper

llm research ai paper english arxiv knowledge emnlp model editing in-context-learning serac rome eren mend

Interpreting a Maze-Solving Network

I can't believe I haven't read this until now. This is mind-provoking, and the result is an important step towards understanding neural networks.

2023-10-07

2024-01-11

56 words, 1 min

Thoughts

llm english representation-engineering activation-engineering interpretability rl alignment maze

Activation Addition (ActAdd)

Paper

TLDR: Propose ActAdd, a method for controlling model behavior during inference by modifying activations with a bias term that is learned from a pair of prompt.

Summary:

Propose ActAdd, a method for controlling model behavior by modifying activations at inference time.
Steering vectors are computed by taking the activation differences that result from pairs of prompts. The vectors are added as bias during inference.
ActAdd provides control over high-level properties of the output, and preserves off-target model performance, and requires little computational and implementational costs.

2023-10-07

2024-01-11

709 words, 4 min

Paper Note

llm english ai-alignment gpt activation-modification adaptation model-editing representation-engineering fine-tuning parameter-efficient-tuning

Safety and Ethical Concerns of Large Language Models

I will be holding a seminar at ModelBest (面壁智能) in Sep 20, 2023 in Beijing, Haidian, 科技园. The seminar will be in Chinese, and it's called "大模型安全与伦理问题" (translation: Safety and Ethical Concerns of Large Language Models). Below is a list of references.

2023-09-19

2024-01-10

635 words, 3 min

Thoughts

llm research english machine-learning life ai-alignment eren ethics safety 面壁智能 modelbest tutorial chatgpt claude 三体

CFDBench: A Comprehensive Benchmark for Machine Learning Methods in Fluid Dynamics

Code | Paper (on hold by ArXiv) | Paper (preprints.org) | 知乎

I did this work with my girlfriend, whose research direction is computational fluid dynamics (CFD). We observed that there are numerous research works in applying deep learning (DL) to solve CFD problems. E.g., Pangu-Weather have shown that DL methods can not only be more accurate than the best numerical methods, but can also be multiple magnitudes faster.

2023-09-16

2024-01-11

312 words, 1 min

Research

research paper cfd dataset 00 english pinn fno physics machine-learning deep-learning deeponet ai4science

Some Binary Search

2023-09-14

2024-01-11

157 words, 1 min

Test

english algorithm binary-search rust python c++ test code