Latent Space Surgery
Definition of targeted model editing through concept directions in latent space.
Latent Space Surgery #
Latent space surgery is a model-editing technique that identifies directions in a latent space corresponding to concepts, styles, or behaviors, then adds, subtracts, or dampens those directions to change outputs or internal representations.
Instead of retraining an entire model, practitioners can use targeted vector edits to nudge behavior, such as moving a music embedding from a classical direction toward a jazz direction.
Example: If a vector direction reliably represents sentiment, latent space surgery can increase or reduce that direction to alter the tone of generated text.
Dictionary: https://dictionary.platphormnews.com/en/define/latent-space-surgery
Related Documentation
Latent-Space Fine-Tuning
Definition of LoRA, adapters, AWS SageMaker, and Bedrock as latent-space adaptation workflows.
Latent Operations
Definition of vector shifts, interpolation, slicing, masking, and sampling in latent space.
Latent Reasoning
Definition of latent reasoning in LLM hidden states and continuous representations.
Embedding Space
Definition of embedding space as a vector representation for semantic similarity and retrieval.
Latent Space
Definition of latent space in machine learning, LLMs, and embeddings.