Инструкция по точной настройке FLAN T5 XL с помощью Amazon SageMaker Jumpstart

Переиздано Платоном

Читают: 0

Генеративный ИИ переживает период ошеломляющего роста. Постоянно выпускаются все более функциональные базовые модели, причем большие языковые модели (LLM) являются одним из наиболее заметных классов моделей. LLM — это модели, состоящие из миллиардов параметров, обученных на обширных корпусах текстов, до сотен миллиардов или даже триллионов токенов. Эти модели оказались чрезвычайно эффективными для широкого спектра текстовых задач, от ответов на вопросы до анализа настроений.

Сила LLM заключается в их способности учиться и обобщать обширные и разнообразные обучающие данные. Начальное обучение этих моделей выполняется с различными целями, контролируемыми, неконтролируемыми или гибридными. Завершение текста или вменение — одна из наиболее распространенных задач без учителя: по фрагменту текста модель учится точно предсказывать, что будет дальше (например, предсказывать следующее предложение). Модели также можно обучать под наблюдением, используя помеченные данные для выполнения набора задач (например, является ли этот обзор фильма положительным, отрицательным или нейтральным). Независимо от того, обучена ли модель автозавершению текста или какой-либо другой задаче, часто это не та задача, для которой клиенты хотят использовать модель.

Чтобы повысить производительность предварительно обученного LLM для конкретной задачи, мы можем настроить модель, используя примеры целевой задачи в процессе, известном как инструкция тонкой настройки. Тонкая настройка инструкций использует набор помеченных примеров в виде пар {приглашение, ответ} для дальнейшего обучения предварительно обученной модели адекватному прогнозированию ответа на запрос. Этот процесс изменяет веса модели.

В этом посте описывается, как выполнить точную настройку инструкций LLM, а именно FLAN T5 XL, с помощью Быстрый запуск Amazon SageMaker. Мы покажем, как это сделать с помощью пользовательского интерфейса Jumpstart и блокнота в Студия Amazon SageMaker. Вы можете найти сопроводительная записная книжка в amazon-sagemaker-примеры Репозиторий GitHub.

Обзор решения

Целевая задача в этом посте состоит в том, чтобы, учитывая фрагмент текста в подсказке, вернуть вопросы, которые связаны с текстом, но на которые нельзя ответить на основе содержащейся в нем информации. Это полезная задача для выявления отсутствующей информации в описании или определения того, требуется ли ответ на запрос дополнительной информации.

Модели FLAN T5 представляют собой инструкции, точно настроенные для широкого круга задач, чтобы повысить производительность этих моделей при выполнении многих распространенных задач[1]. Дополнительная точная настройка инструкций для конкретной задачи клиента может еще больше повысить точность этих моделей, особенно если целевая задача ранее не использовалась для обучения модели FLAN T5, как в случае с нашей задачей.

В нашем примере задачи мы заинтересованы в создании релевантных, но оставшихся без ответа вопросов. С этой целью мы используем подмножество версии 2 Стэнфордского набора данных для ответов на вопросы (SQuAD2.0) [2] для точной настройки модели. Этот набор данных содержит вопросы, заданные комментаторами-людьми в ряде статей Википедии. Помимо вопросов с ответами, SQuAD2.0 содержит около 50,000 XNUMX вопросов без ответов. Такие вопросы правдоподобны, но на них нельзя напрямую ответить из содержания статей. Мы используем только вопросы без ответа. Наши данные структурированы в виде файла JSON Lines, где каждая строка содержит контекст и вопрос.

Скриншот нескольких записей набора данных SQuADv2.

Предпосылки

Для начала все, что вам нужно, — это учетная запись AWS, в которой вы можете использовать Studio. Вам потребуется создать профиль пользователя для Studio, если у вас его еще нет.

Точная настройка FLAN-T5 с помощью пользовательского интерфейса Jumpstart

Чтобы точно настроить модель с помощью пользовательского интерфейса Jumpstart, выполните следующие шаги:

На консоли SageMaker откройте Studio.
Под SageMaker Jumpstart на панели навигации выберите Модели, ноутбуки, решения.

Вы увидите список базовых моделей, в том числе FLAN T5 XL, который помечен как настраиваемый.

Выберите Посмотреть модель.

Пользовательский интерфейс JumpStart с FLAN-T5 XL.

Под Источник данных, вы можете указать путь к своим обучающим данным. Источник данных, используемых в этом посте, предоставляется по умолчанию.
Вы можете оставить значение по умолчанию для конфигурации развертывания (включая тип экземпляра), безопасности и гиперпараметров, но вам следует увеличить количество эпох как минимум до трех, чтобы получить хорошие результаты.
Выберите Train для обучения модели.

Интерфейс поезда JumpStart для модели FLAN-T5 XL.

Вы можете отслеживать статус задания обучения в пользовательском интерфейсе.

Пользовательский интерфейс Jumpstart для текущего обучения.

По завершении обучения (в нашем случае примерно через 53 минуты) выберите Развертывание для развертывания отлаженной модели.

Обучение пользовательскому интерфейсу JumpStart завершено.

После создания конечной точки (несколько минут) вы можете открыть записную книжку и приступить к использованию настроенной модели.

Точная настройка FLAN-T5 с помощью ноутбука Python

В нашем примере блокнота показано, как использовать Jumpstart и SageMaker для программной точной настройки и развертывания модели FLAN T5 XL. Его можно запустить в Studio или локально.

В этом разделе мы сначала рассмотрим некоторые общие настройки. Затем вы настраиваете модель, используя наборы данных SQuADv2. Затем вы развертываете предварительно обученную версию модели за конечной точкой SageMaker и делаете то же самое с точно настроенной моделью. Наконец, вы можете запросить конечные точки и сравнить качество выходных данных предварительно обученной и точно настроенной модели. Вы обнаружите, что результат тонко настроенной модели имеет гораздо более высокое качество.

Настроить предварительные условия

Начните с установки и обновления необходимых пакетов. Перезапустите ядро после запуска следующего кода:

!pip install nest-asyncio==1.5.5 --quiet
!pip install ipywidgets==8.0.4 --quiet
!pip install --upgrade sagemaker --quiet

Затем получите роль выполнения, связанную с текущим экземпляром записной книжки:

import boto3
import sagemaker
# Get current region, role, and default bucket
aws_region = boto3.Session().region_name
aws_role = sagemaker.session.Session().get_caller_identity_arn()
output_bucket = sagemaker.Session().default_bucket()
# This will be useful for printing
newline, bold, unbold = "n", "33[1m", "33[0m"
print(f"{bold}aws_region:{unbold} {aws_region}")
print(f"{bold}aws_role:{unbold} {aws_role}")
print(f"{bold}output_bucket:{unbold} {output_bucket}"

Вы можете определить удобное раскрывающееся меню, в котором будут перечислены размеры моделей, доступные для тонкой настройки:

import IPython
from ipywidgets import Dropdown
from sagemaker.jumpstart.filters import And
from sagemaker.jumpstart.notebook_utils import list_jumpstart_models
# Default model choice
model_id = "huggingface-text2text-flan-t5-xl"
# Identify FLAN T5 models that support fine-tuning
filter_value = And( "task == text2text", "framework == huggingface", "training_supported == true"
)
model_list = [m for m in list_jumpstart_models(filter=filter_value) if "flan-t5" in m]
# Display the model IDs in a dropdown, for user to select
dropdown = Dropdown(
value=model_id,
options=model_list,
description="FLAN T5 models available for fine-tuning:",
style={"description_width": "initial"},
layout={"width": "max-content"},
)
display(IPython.display.Markdown("### Select a pre-trained model from the dropdown below"))
display(dropdown)

Jumpstart автоматически извлекает соответствующие типы экземпляров обучения и логического вывода для выбранной вами модели:

from sagemaker.instance_types import retrieve_default
model_id, model_version = dropdown.value, "*"
# Instance types for training and inference
training_instance_type = retrieve_default(
model_id=model_id, model_version=model_version, scope="training"
)
inference_instance_type = retrieve_default(
model_id=model_id, model_version=model_version, scope="inference"
)
print(f"{bold}model_id:{unbold} {model_id}")
print(f"{bold}training_instance_type:{unbold} {training_instance_type}")
print(f"{bold}inference_instance_type:{unbold} {inference_instance_type}") If you have chosen the FLAN T5 XL, you will see the following output: model_id: huggingface-text2text-flan-t5-xl training_instance_type: ml.p3.16xlarge inference_instance_type: ml.g5.2xlarge

Теперь вы готовы приступить к тонкой настройке.

Переобучите модель на наборе данных тонкой настройки

После завершения настройки выполните следующие шаги:

Используйте следующий код для получения URI необходимых артефактов:

from sagemaker import image_uris, model_uris, script_uris
# Training instance will use this image
train_image_uri = image_uris.retrieve(
region=aws_region,
framework=None,  # automatically inferred from model_id
model_id=model_id,
model_version=model_version,
image_scope="training",
instance_type=training_instance_type,
)
# Pre-trained model
train_model_uri = model_uris.retrieve(
model_id=model_id, model_version=model_version, model_scope="training"
)
# Script to execute on the training instance
train_script_uri = script_uris.retrieve(
model_id=model_id, model_version=model_version, script_scope="training"
)
print(f"{bold}image uri:{unbold} {train_image_uri}")
print(f"{bold}model uri:{unbold} {train_model_uri}")
print(f"{bold}script uri:{unbold} {train_script_uri}")

Данные обучения находятся в общедоступном Простой сервис хранения Amazon (Amazon S3) ведро.

Используйте следующий код, чтобы указать местоположение данных и настроить выходное местоположение в сегменте вашей учетной записи:

from sagemaker.s3 import S3Downloader # We will use the train split of SQuAD2.0
original_data_file = "train-v2.0.json" # The data was mirrored in the following bucket
original_data_location = f"s3://sagemaker-sample-files/datasets/text/squad2.0/{original_data_file}"
S3Downloader.download(original_data_location, ".")

Исходные данные не в формате, соответствующем задаче, под которую вы настраиваете модель, поэтому вы можете переформатировать ее:

import json local_data_file = "task-data.jsonl"  # any name with .jsonl extension with open(original_data_file) as f:
data = json.load(f) with open(local_data_file, "w") as f:
for article in data["data"]:
for paragraph in article["paragraphs"]:
# iterate over questions for a given paragraph
for qas in paragraph["qas"]:
if qas["is_impossible"]:
# the question is relevant, but cannot be answered
example = {"context": paragraph["context"], "question": qas["question"]}
json.dump(example, f)
f.write("n") template = { "prompt": "Ask a question which is related to the following text, but cannot be answered based on the text. Text: {context}", "completion": "{question}",
}
with open("template.json", "w") as f:
json.dump(template, f) from sagemaker.s3 import S3Uploader train_data_location = f"s3://{output_bucket}/train_data"
S3Uploader.upload(local_data_file, train_data_location)
S3Uploader.upload("template.json", train_data_location)
print(f"{bold}training data:{unbold} {train_data_location}")

Теперь вы можете определить некоторые гиперпараметры для обучения:

from sagemaker import hyperparameters # Retrieve the default hyper-parameters for fine-tuning the model
hyperparameters = hyperparameters.retrieve_default(model_id=model_id, model_version=model_version) # We will override some default hyperparameters with custom values
hyperparameters["epochs"] = "3"
# TODO
# hyperparameters["max_input_length"] = "300"  # data inputs will be truncated at this length
# hyperparameters["max_output_length"] = "40"  # data outputs will be truncated at this length
# hyperparameters["generation_max_length"] = "40"  # max length of generated output
print(hyperparameters)

Теперь вы готовы запустить задание обучения:

from sagemaker.estimator import Estimator
from sagemaker.utils import name_from_base model_name = "-".join(model_id.split("-")[2:])  # get the most informative part of ID
training_job_name = name_from_base(f"js-demo-{model_name}-{hyperparameters['epochs']}")
print(f"{bold}job name:{unbold} {training_job_name}") training_metric_definitions = [
{"Name": "val_loss", "Regex": "'eval_loss': ([0-9.]+)"},
{"Name": "train_loss", "Regex": "'loss': ([0-9.]+)"},
{"Name": "epoch", "Regex": "'epoch': ([0-9.]+)"},
] # Create SageMaker Estimator instance
sm_estimator = Estimator(
role=aws_role,
image_uri=train_image_uri,
model_uri=train_model_uri,
source_dir=train_script_uri,
entry_point="transfer_learning.py",
instance_count=1,
instance_type=training_instance_type,
volume_size=300,
max_run=360000,
hyperparameters=hyperparameters,
output_path=output_location,
metric_definitions=training_metric_definitions,
) # Launch a SageMaker training job over data located in the given S3 path
# Training jobs can take hours, it is recommended to set wait=False,
# and monitor job status through SageMaker console
sm_estimator.fit({"training": train_data_location}, job_name=training_job_name, wait=False)

В зависимости от размера данных точной настройки и выбранной модели точная настройка может занять до нескольких часов.

Вы можете отслеживать показатели производительности, такие как потери при обучении и проверке, с помощью Amazon CloudWatch во время обучения. Для удобства вы также можете получить самый последний снимок метрик, запустив следующий код:

from sagemaker import TrainingJobAnalytics # This can be called while the job is still running
df = TrainingJobAnalytics(training_job_name=training_job_name).dataframe()
df.head(10) model uri: s3://sagemaker-us-west-2-802376408542/avkan/training-huggingface-text2text-huggingface-text2text-flan-t5-xl-repack.tar.gz
job name: jumpstart-demo-xl-3-2023-04-06-08-16-42-738
INFO:sagemaker:Creating training-job with name: jumpstart-demo-xl-3-2023-04-06-08-16-42-738

Когда обучение будет завершено, у вас будет точно настроенная модель в model_uri. Давайте использовать его!

Вы можете создать две конечные точки вывода: одну для исходной предварительно обученной модели и одну для точной модели. Это позволяет сравнить выходные данные обеих версий модели. На следующем шаге вы развертываете конечную точку вывода для предварительно обученной модели. Затем вы развертываете конечную точку для отлаженной модели.

Разверните предварительно обученную модель

Начнем с развертывания предварительно обученной модели, которая извлекает URI образа Docker для логического вывода. Это базовое изображение контейнера Hugging Face. Используйте следующий код:

from sagemaker import image_uris # Retrieve the inference docker image URI. This is the base HuggingFace container image
deploy_image_uri = image_uris.retrieve(
region=None,
framework=None,  # automatically inferred from model_id
model_id=model_id,
model_version=model_version,
image_scope="inference",
instance_type=inference_instance_type,
)

Теперь вы можете создать конечную точку и развернуть предварительно обученную модель. Обратите внимание, что вам необходимо передать класс Predictor при развертывании модели через класс Model, чтобы иметь возможность выполнять вывод через SageMaker API. См. следующий код:

from sagemaker import model_uris, script_uris
from sagemaker.model import Model
from sagemaker.predictor import Predictor
from sagemaker.utils import name_from_base # Retrieve the URI of the pre-trained model
pre_trained_model_uri = model_uris.retrieve(
model_id=model_id, model_version=model_version, model_scope="inference"
) pre_trained_name = name_from_base(f"jumpstart-demo-pre-trained-{model_id}") # Create the SageMaker model instance of the pre-trained model
if ("small" in model_id) or ("base" in model_id):
deploy_source_uri = script_uris.retrieve(
model_id=model_id, model_version=model_version, script_scope="inference"
)
pre_trained_model = Model(
image_uri=deploy_image_uri,
source_dir=deploy_source_uri,
entry_point="inference.py",
model_data=pre_trained_model_uri,
role=aws_role,
predictor_cls=Predictor,
name=pre_trained_name,
)
else:
# For those large models, we already repack the inference script and model
# artifacts for you, so the `source_dir` argument to Model is not required.
pre_trained_model = Model(
image_uri=deploy_image_uri,
model_data=pre_trained_model_uri,
role=aws_role,
predictor_cls=Predictor,
name=pre_trained_name,
) print(f"{bold}image URI:{unbold}{newline} {deploy_image_uri}")
print(f"{bold}model URI:{unbold}{newline} {pre_trained_model_uri}")
print("Deploying an endpoint ...") # Deploy the pre-trained model. Note that we need to pass Predictor class when we deploy model
# through Model class, for being able to run inference through the SageMaker API
pre_trained_predictor = pre_trained_model.deploy(
initial_instance_count=1,
instance_type=inference_instance_type,
predictor_cls=Predictor,
endpoint_name=pre_trained_name,
)
print(f"{newline}Deployed an endpoint {pre_trained_name}")

Создание конечной точки и развертывание модели может занять несколько минут, после чего ваша конечная точка готова принимать вызовы логического вывода.

Разверните доработанную модель

Давайте развернем точно настроенную модель на ее собственной конечной точке. Процесс почти идентичен тому, который мы использовали ранее для предварительно обученной модели. Единственная разница в том, что мы используем точно настроенное имя модели и URI:

from sagemaker.model import Model
from sagemaker.predictor import Predictor
from sagemaker.utils import name_from_base fine_tuned_name = name_from_base(f"jumpstart-demo-fine-tuned-{model_id}")
fine_tuned_model_uri = f"{output_location}{training_job_name}/output/model.tar.gz" # Create the SageMaker model instance of the fine-tuned model
fine_tuned_model = Model(
image_uri=deploy_image_uri,
model_data=fine_tuned_model_uri,
role=aws_role,
predictor_cls=Predictor,
name=fine_tuned_name,
) print(f"{bold}image URI:{unbold}{newline} {deploy_image_uri}")
print(f"{bold}model URI:{unbold}{newline} {fine_tuned_model_uri}")
print("Deploying an endpoint ...") # Deploy the fine-tuned model.
fine_tuned_predictor = fine_tuned_model.deploy(
initial_instance_count=1,
instance_type=inference_instance_type,
predictor_cls=Predictor,
endpoint_name=fine_tuned_name,
)
print(f"{newline}Deployed an endpoint {fine_tuned_name}")

Когда этот процесс завершен, как предварительно обученные, так и точно настроенные модели развертываются за их собственными конечными точками. Давайте сравним их выходы.

Генерация выходных данных и сравнение результатов

Определите некоторые служебные функции для запроса конечной точки и анализа ответа:

import boto3
import json # Parameters of (output) text generation. A great introduction to generation
# parameters can be found at https://huggingface.co/blog/how-to-generate
parameters = { "max_length": 40,  # restrict the length of the generated text "num_return_sequences": 5,  # we will inspect several model outputs "num_beams": 10,  # use beam search
} # Helper functions for running inference queries
def query_endpoint_with_json_payload(payload, endpoint_name):
encoded_json = json.dumps(payload).encode("utf-8")
client = boto3.client("runtime.sagemaker")
response = client.invoke_endpoint(
EndpointName=endpoint_name, ContentType="application/json", Body=encoded_json
)
return response def parse_response_multiple_texts(query_response):
model_predictions = json.loads(query_response["Body"].read())
generated_text = model_predictions["generated_texts"]
return generated_text def generate_questions(endpoint_name, text):
expanded_prompt = prompt.replace("{context}", text)
payload = {"text_inputs": expanded_prompt, **parameters}
query_response = query_endpoint_with_json_payload(payload, endpoint_name=endpoint_name)
generated_texts = parse_response_multiple_texts(query_response)
for i, generated_text in enumerate(generated_texts):
print(f"Response {i}: {generated_text}{newline}")

В следующем фрагменте кода мы определяем приглашение и тестовые данные. Описывает нашу целевую задачу, которая состоит в том, чтобы генерировать вопросы, связанные с предоставленным текстом, но на которые нельзя ответить на его основе.

Тестовые данные состоят из трех разных абзацев, один из которых посвящен австралийскому городу Аделаиде из первые два абзаца этой страницы Википедии, один относительно Магазин эластичных блоков Amazon (Amazon EBS) из Документация по Amazon EBS, и один из Amazon Comprehend из Документация по Amazon Comprehend. Мы ожидаем, что модель выявит вопросы, связанные с этими абзацами, но на которые нельзя ответить с помощью представленной в них информации.

prompt = "Ask a question which is related to the following text, but cannot be answered based on the text. Text: {context}" test_paragraphs = [ """
Adelaide is the capital city of South Australia, the state's largest city and the fifth-most populous city in Australia. "Adelaide" may refer to either Greater Adelaide (including the Adelaide Hills) or the Adelaide city centre.
The demonym Adelaidean is used to denote the city and the residents of Adelaide. The Traditional Owners of the Adelaide
region are the Kaurna people. The area of the city centre and surrounding parklands is called Tarndanya in the Kaurna language. Adelaide is situated on the Adelaide Plains north of the Fleurieu Peninsula, between the Gulf St Vincent in the west and
the Mount Lofty Ranges in the east. Its metropolitan area extends 20 km (12 mi) from the coast to the foothills of
the Mount Lofty Ranges, and stretches 96 km (60 mi) from Gawler in the north to Sellicks Beach in the south. """, """
Amazon Elastic Block Store (Amazon EBS) provides block level storage volumes for use with EC2 instances. EBS volumes behave like raw, unformatted block devices. You can mount these volumes as devices on your instances. EBS volumes that are attached to an instance are exposed as storage volumes that persist independently from the life of the instance. You can create a file system on top of these volumes, or use them in any way you would use a block device (such as a hard drive). You can dynamically change the configuration of a volume attached to an instance. We recommend Amazon EBS for data that must be quickly accessible and requires long-term persistence. EBS volumes are particularly well-suited for use as the primary storage for file systems, databases, or for any applications that require fine granular updates and access to raw, unformatted, block-level storage. Amazon EBS is well suited to both database-style applications that rely on random reads and writes, and to throughput-intensive applications that perform long, continuous reads and writes. """, """
Amazon Comprehend uses natural language processing (NLP) to extract insights about the content of documents. It develops insights by recognizing the entities, key phrases, language, sentiments, and other common elements in a document. Use Amazon Comprehend to create new products based on understanding the structure of documents. For example, using Amazon Comprehend you can search social networking feeds for mentions of products or scan an entire document repository for key phrases. You can access Amazon Comprehend document analysis capabilities using the Amazon Comprehend console or using the Amazon Comprehend APIs. You can run real-time analysis for small workloads or you can start asynchronous analysis jobs for large document sets. You can use the pre-trained models that Amazon Comprehend provides, or you can train your own custom models for classification and entity recognition. All of the Amazon Comprehend features accept UTF-8 text documents as the input. In addition, custom classification and custom entity recognition accept image files, PDF files, and Word files as input. Amazon Comprehend can examine and analyze documents in a variety of languages, depending on the specific feature. For more information, see Languages supported in Amazon Comprehend. Amazon Comprehend's Dominant language capability can examine documents and determine the dominant language for a far wider selection of languages. """
]

Теперь вы можете протестировать конечные точки, используя примеры статей.

print(f"{bold}Prompt:{unbold} {repr(prompt)}")
for paragraph in test_paragraphs:
print("-" * 80)
print(paragraph)
print("-" * 80)
print(f"{bold}pre-trained{unbold}")
generate_questions(pre_trained_name, paragraph)
print(f"{bold}fine-tuned{unbold}")
generate_questions(fine_tuned_name, paragraph)

Данные испытаний: Аделаида

Мы используем следующий контекст:

delaide is the capital city of South Australia, the state's largest city and the fifth-most populous city in Australia. "Adelaide" may refer to either Greater Adelaide (including the Adelaide Hills) or the Adelaide city centre.
The demonym Adelaidean is used to denote the city and the residents of Adelaide. The Traditional Owners of the Adelaide
region are the Kaurna people. The area of the city centre and surrounding parklands is called Tarndanya in the Kaurna language. Adelaide is situated on the Adelaide Plains north of the Fleurieu Peninsula, between the Gulf St Vincent in the west and
the Mount Lofty Ranges in the east. Its metropolitan area extends 20 km (12 mi) from the coast to the foothills of
the Mount Lofty Ranges, and stretches 96 km (60 mi) from Gawler in the north to Sellicks Beach in the south.

Ответ предварительно обученной модели выглядит следующим образом:

Response 0: What is the area of the city centre and surrounding parklands called in the Kaurna language?
Response 1: What is the area of the city centre and surrounding parklands is called Tarndanya in the Kaurna language?
Response 2: What is the area of the city centre and surrounding parklands called in Kaurna?
Response 3: What is the capital city of South Australia?
Response 4: What is the area of the city centre and surrounding parklands known as in the Kaurna language?

Точно настроенные ответы модели следующие:

Response 0: What is the second most populous city in Australia?
Response 1: What is the fourth most populous city in Australia?
Response 2: What is the population of Gawler?
Response 3: What is the largest city in Australia?
Response 4: What is the fifth most populous city in the world?

Тестовые данные: Amazon EBS

Мы используем следующий контекст:

Amazon Elastic Block Store (Amazon EBS) provides block level storage volumes for use with EC2 instances. EBS volumes behave like raw, unformatted block devices. You can mount these volumes as devices on your instances. EBS volumes that are attached to an instance are exposed as storage volumes that persist independently from the life of the instance. You can create a file system on top of these volumes, or use them in any way you would use a block device (such as a hard drive). You can dynamically change the configuration of a volume attached to an instance. We recommend Amazon EBS for data that must be quickly accessible and requires long-term persistence. EBS volumes are particularly well-suited for use as the primary storage for file systems, databases, or for any applications that require fine granular updates and access to raw, unformatted, block-level storage. Amazon EBS is well suited to both database-style applications that rely on random reads and writes, and to throughput-intensive applications that perform long, continuous reads and writes.

Ответы предварительно обученной модели следующие:

esponse 0: What is the difference between Amazon EBS and Amazon Elastic Block Store (Amazon EBS)?
Response 1: What is the difference between Amazon EBS and Amazon Elastic Block Store?
Response 2: What is the difference between Amazon EBS and Amazon Simple Storage Service (Amazon S3)?
Response 3: What is Amazon Elastic Block Store (Amazon EBS)?
Response 4: What is the difference between Amazon EBS and a hard drive?

Точно настроенные ответы модели следующие:

Response 0: What type of applications are not well suited to Amazon EBS?
Response 1: What behaves like formatted block devices?
Response 2: What type of applications are not suited to Amazon EBS?
Response 3: What type of applications are not well suited for Amazon EBS?
Response 4: What type of applications are not suited for Amazon EBS?

Тестовые данные: Amazon Comprehend

Мы используем следующий контекст:

Amazon Comprehend uses natural language processing (NLP) to extract insights about the content of documents. It develops insights by recognizing the entities, key phrases, language, sentiments, and other common elements in a document. Use Amazon Comprehend to create new products based on understanding the structure of documents. For example, using Amazon Comprehend you can search social networking feeds for mentions of products or scan an entire document repository for key phrases. You can access Amazon Comprehend document analysis capabilities using the Amazon Comprehend console or using the Amazon Comprehend APIs. You can run real-time analysis for small workloads or you can start asynchronous analysis jobs for large document sets. You can use the pre-trained models that Amazon Comprehend provides, or you can train your own custom models for classification and entity recognition. All of the Amazon Comprehend features accept UTF-8 text documents as the input. In addition, custom classification and custom entity recognition accept image files, PDF files, and Word files as input. Amazon Comprehend can examine and analyze documents in a variety of languages, depending on the specific feature. For more information, see Languages supported in Amazon Comprehend. Amazon Comprehend's Dominant language capability can examine documents and determine the dominant language for a far wider selection of languages.

Ответы предварительно обученной модели следующие:

Response 0: What does Amazon Comprehend use to extract insights about the content of documents?
Response 1: How does Amazon Comprehend extract insights about the content of documents?
Response 2: What does Amazon Comprehend use to develop insights about the content of documents?
Response 3: How does Amazon Comprehend develop insights about the content of documents?
Response 4: What does Amazon Comprehend use to extract insights about the content of a document?

Точно настроенные ответы модели следующие:

Response 0: What does Amazon Comprehend use to extract insights about the structure of documents?
Response 1: How does Amazon Comprehend recognize sentiments in a document?
Response 2: What does Amazon Comprehend use to extract insights about the content of social networking feeds?
Response 3: What does Amazon Comprehend use to extract insights about the content of documents?
Response 4: What type of files does Amazon Comprehend reject as input?

Разница в качестве вывода между предварительно обученной моделью и точно настроенной моделью разительна. Вопросы, представленные в тонко настроенной модели, затрагивают более широкий круг тем. Это систематически значимые вопросы, что не всегда верно для предварительно обученной модели, как показано на примере Amazon EBS.

Хотя это не является формальной и систематической оценкой, ясно, что процесс тонкой настройки улучшил качество ответов модели на эту задачу.

Убирать

Наконец, не забудьте очистить и удалить конечные точки:

# Delete resources
pre_trained_predictor.delete_model()
pre_trained_predictor.delete_endpoint()
fine_tuned_predictor.delete_model()
fine_tuned_predictor.delete_endpoint()

Заключение

В этом посте мы показали, как использовать тонкую настройку инструкций для моделей FLAN T5 с помощью пользовательского интерфейса Jumpstart или ноутбука Jupyter, работающего в Studio. Мы предоставили код, объясняющий, как переобучить модель, используя данные для целевой задачи, и развернуть точно настроенную модель за конечной точкой. Целевой задачей в этом посте было определить вопросы, которые относятся к фрагменту текста, предоставленному во входных данных, но на которые нельзя ответить на основе информации, представленной в этом тексте. Мы продемонстрировали, что модель, точно настроенная для этой конкретной задачи, дает лучшие результаты, чем предварительно обученная модель.

Теперь, когда вы знаете, как настроить модель с помощью Jumpstart, вы можете создавать мощные модели, адаптированные для вашего приложения. Соберите некоторые данные для вашего варианта использования, загрузите их в Amazon S3 и используйте пользовательский интерфейс Studio или ноутбук для настройки модели FLAN T5!