Configuration Parsing Warning:In adapter_config.json: "peft.task_type" must be a string

stat214-lab3-bert-lora-r4-maxlen256

LoRA adapter for bert-base-uncased, fine-tuned on transcripts from the Huth Lab fMRI story-listening dataset for the Stat 214 (Spring 2026) final project at UC Berkeley.

The adapter is used to extract context-aware word embeddings that are then fed into a per-voxel ridge regression to predict whole-brain BOLD signal from spoken-story stimuli.

Configuration

Hyperparameter Value
Base model bert-base-uncased
LoRA rank r 4
LoRA alpha 8
LoRA dropout 0.1
Target modules query, value
Training objective Masked Language Modeling (MLM, 15%)
Training stories 86 (Huth Lab podcast transcripts)
MLM max sequence length 256
Epochs 3
Optimizer AdamW, lr=2e-4
Batch size 16
Final MLM training loss

Encoding-model performance

After extracting per-word embeddings from this adapter (using ±10 word context windows + Lanczos downsampling + 4 TR delays) and fitting per-voxel ridge regression on Subjects 2 and 3:

Subject Mean CC Top 5% CC Top 1% CC Top-1 voxel
Subject 2 0.0643 0.2143 0.2906 0.4736
Subject 3 0.0660 0.2176 0.3043 0.5159

(See full project repository for ridge weights, evaluation code, and SHAP / LIME word-importance analyses.)

Loading the adapter

from transformers import BertForMaskedLM, BertTokenizerFast
from peft import PeftModel

tokenizer = BertTokenizerFast.from_pretrained("bert-base-uncased")
base = BertForMaskedLM.from_pretrained("bert-base-uncased")
model = PeftModel.from_pretrained(base, "RheaTinghe/stat214-lab3-bert-lora-r4-maxlen256")
model.eval()

# Extract per-word embeddings via ±10 word context windows
# (see scripts/run_bert_pretrained.py in the project repo for the
# complete extraction pipeline)

Citation

@misc{stat214lab3,
  author = {Galloro, Drew and Wang, Ruihang and Khothsombath, Benjamin and Zhang, Rhea},
  title  = {Stat 214 Lab 3: BERT-LoRA encoding model for fMRI},
  year   = {2026},
  note   = {UC Berkeley Spring 2026},
}
Downloads last month
16
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for RheaTinghe/stat214-lab3-bert-lora-r4-maxlen256

Adapter
(131)
this model