Hunyuan API 모델 가이드: TurboS Latest, Lite, Translation, 3D Engine 선택법

TurboS Latest, Hunyuan-Lite, Hunyuan Translation, Hunyuan 3D Engine은 하나의 Hunyuan 모델을 네 단계로 나눈 이름이 아니다. hunyuan-turbos-latest는 텍스트/채팅 route, Hunyuan-Lite는 저비용 텍스트 검증 route, Hunyuan Translation은 번역 workflow, Hunyuan 3D Engine은 비동기 3D 에셋 생성 pipeline으로 분리해서 판단해야 한다.

모델명, endpoint, billing row, quota, region, 번역 지원, 3D task behavior는 Tencent Cloud docs와 콘솔이 공식 사실을 소유한다. 호환 제공자 페이지는 해당 제공자의 endpoint, 가격, 제한, 로그, 지원 책임만 증명한다. latest, Lite 무료 상태, translation language coverage, 3D output formats, TokenHub migration note, 지역, provider support는 배포 전에 현재 공식 문서로 다시 확인한다.

작업	먼저 볼 Hunyuan route	배포 전 공식 확인 항목
일반 텍스트, 채팅, 요약, 코드 보조, OpenAI-compatible client	Tencent Cloud OpenAI-compatible chat/completions의 `hunyuan-turbos-latest`	model overview, compatible examples, console availability, Tencent-specific parameters
저비용 smoke test, 간단한 분류/추출, 내부 prompt 검증	Hunyuan-Lite	현재 무료/가격 row, 기능 경계, output ceiling, account state
번역이 주 작업인 pipeline	Hunyuan Translation / `hunyuan-translation`	model overview, ChatTranslations API, supported languages, segment limits, billing row
text/image/sketch to 3D asset	Hunyuan 3D Engine	Tencent HY 3D docs, async submit/query, output formats, quota, storage/review
third-party compatible Hunyuan access	제공자 계약만 기준으로 보는 route	alias, endpoint, 가격, rate limits, logs, retry billing, support boundary

빠른 route board

안전한 결정 순서는 작업을 먼저 보고 모델명을 나중에 보는 것이다. 이름부터 보면 hunyuan-turbos-latest를 전체 Hunyuan 제품군의 최신 대표로 오해하거나, Hunyuan-Lite를 모든 production default로 쓰거나, Hunyuan Translation을 일반 chat prompt로 처리하거나, Hunyuan 3D Engine을 text token 모델처럼 설계할 수 있다.

Tencent Cloud Hunyuan product overview는 text/translation family의 첫 지도다. hunyuan-turbos-latest, hunyuan-lite, hunyuan-translation은 서로 다른 row에 있고 input/output limits와 intended use가 다르다. Hunyuan에는 multimodal, image, video, 3D까지 포함되므로 모든 이름을 하나의 성능 순위로 놓으면 implementation risk가 커진다.

Route	첫 use case	피해야 할 해석
TurboS Latest	general text/chat, reasoning, summary, coding, OpenAI-compatible text workflow	모든 Hunyuan 제품군에서 영구적으로 최신/최강이라는 claim
Hunyuan-Lite	cheap text smoke tests, simple extraction, classification, prompt evaluation	품질 민감한 customer-facing production default
Hunyuan Translation	translation-first pipeline, bilingual document, support messages, subtitle	번역만 필요한 작업을 일반 chat prompt로 대체
Hunyuan 3D Engine	text/image/sketch to 3D asset	chat/completions model 또는 text-token billing으로 처리

Tencent Cloud 공식 계약과 호환 제공자 경로의 Hunyuan 사실 소유권 경계도

공식 사실 소유자 분리

Tencent Cloud docs에는 Hunyuan capabilities가 TokenHub 쪽으로 이동한다는 platform note도 있다. 새 기능, 신규 구매, 신규 account, 신규 region을 다룰 때는 이 메모가 implementation boundary가 된다. 기존 console path가 작동한다는 사실만으로 새 배포의 route, billing, quota가 같다고 볼 수 없다.

확인할 사실	source family	중요한 이유
model name, version update, input/output window, feature notes	Tencent Cloud Hunyuan product overview	`latest`, Lite, Translation, 3D를 각각 다른 row로 읽게 한다
base URL, API key, chat/completions path, sample `hunyuan-turbos-latest` call	Tencent Cloud OpenAI-compatible examples	OpenAI-style client shape와 Tencent extension을 확인한다
free resources, prepaid/postpaid, token billing rows, Lite status	Tencent Cloud billing overview	무료/유료 표현을 현재 official billing row에 묶는다
translation-specific API behavior	Tencent Cloud ChatTranslations docs	번역 route를 일반 chat route와 섞지 않는다
3D auth, SecretId/SecretKey, async job APIs, result retrieval	Tencent HY 3D Global docs	3D Engine을 text/chat assumptions와 분리한다

호환 제공자 페이지는 provider endpoint, account model, price, rate limits, logs, retries, support, model alias를 확인할 때 유용하다. 하지만 Tencent official pricing, quota, region behavior, translation coverage, 3D output format, migration note를 바꾸는 근거는 아니다.

`hunyuan-turbos-latest`: 텍스트/채팅 route

일반 text/chat workload이고 OpenAI-compatible client shape가 중요하다면 hunyuan-turbos-latest가 첫 후보가 된다. Tencent Cloud compatible-interface examples는 base URL을 https://api.hunyuan.cloud.tencent.com/v1로 보여주고, chat/completions sample에서 model: "hunyuan-turbos-latest"를 사용한다.

최소 Python 형태는 다음과 같다.

hljs python
import os
from openai import OpenAI

client = OpenAI(
    api_key=os.environ["HUNYUAN_API_KEY"],
    base_url="https://api.hunyuan.cloud.tencent.com/v1",
)

response = client.chat.completions.create(
    model="hunyuan-turbos-latest",
    messages=[
        {"role": "user", "content": "릴리스 노트를 세 가지 핵심 bullet로 요약해 주세요."}
    ],
    extra_body={"enable_enhancement": True},
)

print(response.choices[0].message.content)

compatible은 identical이 아니다. Tencent는 OpenAI-compatible parameters와 Tencent-specific parameters를 함께 설명한다. production migration 전에는 같은 payload로 streaming, enhancement flags, function calling, SDK retries, logging, timeout, error body, cost reporting을 테스트한다. endpoint 모양이 익숙해도 billing과 failure behavior는 별도 검증이 필요하다.

latest도 제한해서 써야 한다. hunyuan-turbos-latest는 text/chat family 안의 TurboS Latest alias이지, translation, 3D, multimodal, image, future TokenHub route까지 포함한 전체 Hunyuan의 영구 최신 보장은 아니다.

Hunyuan-Lite: 저비용 텍스트 검증

Hunyuan-Lite는 cost-sensitive text work와 early integration tests에 적합하다. Tencent Cloud pricing page는 2026년 5월 8일 확인 시 Hunyuan-Lite를 free to use로 표시했다. 이 표현은 현재 official billing row에 묶어야 하며, model overview의 Lite row와 함께 읽어 output boundary와 feature fit을 확인한다.

Lite를 universal production default로 두지 않는다. 적합한 첫 용도는 smoke tests, simple extraction, classification, internal tools, early prompt evaluation이다. stronger reasoning, knowledge enhancement, long answer, customer-facing output, complex code, higher output limit가 필요하면 같은 test set을 TurboS나 다른 official text row에서도 실행한다.

판단 축	Lite가 맞는 경우	다른 route를 함께 볼 경우
cost	작은 low-risk 검증이 많다	retry와 human rework가 비용을 키운다
quality	label, extraction, short summary, internal draft	customer-visible copy, high-stakes answer, complex code
features	basic text call이면 충분하다	enhanced search, special parameters, larger output가 필요하다
external promise	customer copy에 무료라고 쓰지 않는다	budget, SLA, public docs에 가격을 써야 한다

무료라는 말은 date-bounded로 유지한다. billing page, console setting, region, account status, service migration이 바뀌면 오래된 screenshot이나 provider page로 production budget을 방어할 수 없다.

Hunyuan Translation: 번역 전용 route

Hunyuan Translation은 독립 route로 봐야 한다. Tencent는 hunyuan-translation을 translation model/API로 설명하고, translation-first workload는 chat과 다른 검증 기준을 가진다. source language, target language, segment length, glossary handling, format retention, fallback behavior, throughput, accepted translation cost를 확인한다.

제품 문구 번역, 문서 localization, support-message conversion, subtitle preparation, multilingual QA, 다수의 short segment를 처리하는 pipeline에서는 ChatTranslations route를 우선한다. chat completions는 번역이 broader reasoning의 일부일 때 적합하다. 예를 들어 customer message를 읽고 intent를 분류하고 답변을 만든 뒤 번역하는 흐름은 chat route가 자연스럽다.

확인 항목	생략하면 생기는 문제
supported source/target languages	데모에 나온 언어를 공식 지원으로 오해한다
segment length and format	문서, 자막, JSON fields는 chunking 방식이 다르다
terminology and names	translation API도 glossary QA가 필요하다
retry and fallback	batch translation에는 failure policy가 필요하다
billing row	translation cost와 chat cost를 섞게 된다

Hunyuan 3D Engine: 비동기 3D 에셋 pipeline

Hunyuan 3D Engine은 text model variant가 아니다. Tencent는 text, image, sketch에서 3D asset을 생성하는 platform으로 설명하고, Tencent HY 3D Global docs는 별도 product family에서 auth, task submission, result retrieval을 다룬다.

API shape가 다르면 architecture도 달라진다. professional 3D API는 asynchronous flow다. task를 submit하고 JobId를 받은 뒤 JobId로 result를 query한다. queue design, retry policy, status storage, timeout, asset storage, preview UI, moderation review, customer-facing failure message가 필요하다.

작업에서 Hunyuan chat, Lite, Translation, 3D 비동기 API로 이어지는 선택 흐름도

production concern	3D route implication
prompt or image input	prompt, image requirements, user rights를 submit 전에 확인한다
async job	JobId 저장, polling/scheduled query, timeout과 retry rule을 설계한다
output formats	GLB, OBJ, STL, FBX, preview를 약속하기 전 current docs를 본다
billing and quota	Tencent HY 3D billing/free-resource pages를 사용하고 text-token row로 계산하지 않는다
review workflow	generated asset에는 preview, cleanup, export, rights review가 필요할 수 있다

첫 구현 테스트는 3D에 대한 설명을 chat으로 받는 것이 아니라 작은 3D task를 submit/query하는 것이다. auth, JobId, query interval, output file, failure body, storage path, user-visible status를 확인한다.

호환 제공자 route

호환 제공자는 OpenAI-compatible endpoint, centralized billing, logs, support 같은 운영 편의를 줄 수 있다. 그래도 계약은 별도다. provider page가 증명하는 것은 provider route, price, rate limits, logs, retry behavior, support responsibility, data contract이지 Tencent official facts가 아니다.

production 사용 전에는 다음을 확인한다.

exact model alias와 Tencent official model row의 mapping.
endpoint compatibility와 unsupported parameters.
billing unit, minimum charge, retry billing, failed-request behavior.
data handling, log retention, privacy, support responsibility.
Tencent official update 이후 provider route가 뒤처지거나 고정되거나 renamed될 경우의 fallback.

기본 route decision에는 provider recommendation이 필요하지 않다. 먼저 official Tencent lane을 확정하고, 이후 provider convenience가 deployment friction을 실제로 줄이는지 판단한다.

production recheck checklist

code, customer copy, budget에 route를 고정하기 전 같은 작은 job set을 사용한다. TurboS Latest에는 text/chat prompt, Lite에는 cheap text sample, Translation에는 translation segment, 3D Engine에는 small async 3D task를 실행한다. 목표는 leaderboard가 아니라 각 route가 자기 일을 끝낼 수 있는지 확인하는 것이다.

Hunyuan 모델명, 가격, 쿼터, 리전, 3D 출력 형식을 프로덕션 전에 확인하는 체크리스트

쓰고 싶은 claim	사용 전 재확인
`hunyuan-turbos-latest` for text/chat	current model overview, compatible sample, console availability, `latest` meaning
Hunyuan-Lite is free	current Tencent pricing row, account state, service activation, region, feature caveats
Hunyuan Translation coverage	current model row, ChatTranslations docs, language list, segment limits, billing
Hunyuan 3D output format	current Tencent HY 3D docs, API version, output formats, free resources, quota, storage
provider availability	provider docs, billing, limits, support, data contract; Tencent official facts로 옮기지 않는다

현재 확인할 수 없는 내용은 약속처럼 쓰지 않는다. unlimited speed, permanent free, no-ban, refund guarantee, fixed SLA, all-region availability 같은 말은 정확한 현재 근거가 있을 때만 사용한다.

자주 묻는 질문

`hunyuan-turbos-latest`는 무엇인가?

Tencent Cloud OpenAI-compatible chat/completions examples에 나오는 Hunyuan text/chat model alias다. TurboS Latest text route로 다루되 전체 Hunyuan product family의 영구 최신 보장으로 쓰지 않는다.

Hunyuan-Lite는 무료인가?

Tencent pricing page는 2026년 5월 8일 확인 시 Hunyuan-Lite를 free to use로 표시했다. 외부 문서나 production budget에는 official pricing page, account state, region, console behavior를 다시 확인한 뒤 쓴다.

번역은 Hunyuan Translation과 chat model 중 무엇을 쓰나?

번역이 주 작업이면 Hunyuan Translation을 쓴다. 번역이 broader reasoning이나 conversation workflow의 일부라면 chat completions를 검토한다. translation route가 budget, monitoring, language QA를 설명하기 쉽다.

Hunyuan 3D Engine은 TurboS 같은 API model인가?

아니다. Hunyuan 3D Engine은 async 3D asset workflow다. task submit, JobId storage, result query, retry, preview, review, storage가 필요하다.

provider page가 Tencent official pricing을 증명할 수 있나?

아니다. provider page는 provider route와 commercial terms를 증명한다. Tencent Cloud docs와 console이 Tencent official model names, pricing, quota, API behavior를 소유한다.

어느 route부터 테스트하나?

작업에서 시작한다. text/chat은 TurboS Latest, cheap text checks는 Lite, translation은 Hunyuan Translation, 3D assets는 Hunyuan 3D Engine이다. provider-compatible route는 official lane이 분명해진 뒤 비교한다.

Hunyuan API 모델 가이드: TurboS Latest, Lite, Translation, 3D Engine 선택법

목차

Hunyuan API 모델 가이드: TurboS Latest, Lite, Translation, 3D Engine 선택법

빠른 route board

공식 사실 소유자 분리

`hunyuan-turbos-latest`: 텍스트/채팅 route

Hunyuan-Lite: 저비용 텍스트 검증

Hunyuan Translation: 번역 전용 route

Hunyuan 3D Engine: 비동기 3D 에셋 pipeline

호환 제공자 route

production recheck checklist