SFT
IntermediateFine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.
AdvertisementAd space — term-top
Definition
Full Definition
Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.