What is transfer learning and how does it apply to NLP models?
Transfer learning applies pre-trained model knowledge (general language understanding) to new tasks, requiring less data and compute than training from scratch.
Why this answer
Transfer learning in NLP involves taking a large pre-trained model (like BERT or GPT) that has been trained on a massive corpus and fine-tuning it on a smaller, task-specific dataset. This approach dramatically reduces the amount of labeled data and training time needed, while leveraging the linguistic knowledge already captured by the base model. In Azure, services like Azure Cognitive Service for Language use transfer learning to provide high-accuracy custom models with minimal training data.
Exam trap
The trap here is that candidates confuse the general idea of 'transferring' something (data, code, or location) with the specific machine learning concept of transferring learned knowledge from a pre-trained model to a new task.
How to eliminate wrong answers
Option A is wrong because moving a model between Azure regions is a deployment or migration operation, not a learning technique; it has nothing to do with reusing learned representations for a new task. Option C is wrong because transferring labeled data between projects is a data management activity, not a model training methodology; transfer learning specifically refers to transferring learned weights or features, not raw data. Option D is wrong because translating ML models between programming languages is a code conversion or interoperability concern, unrelated to the core concept of leveraging pre-trained knowledge for a new but related task.