Full Covariance State Duration Modeling for HMM-Based Speech Synthesis

Lu, Heng; Wu, Yi Jian; Tokuda, Keiichi; Dai, Li Rong; Wang, Ren hua

doi:http://dx.doi.org/10.1109/ICASSP.2009.4960513

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

{"_buckets": {"deposit": "8dfc4278-ba5f-4346-921d-adb65725abea"}, "_deposit": {"created_by": 3, "id": "3453", "owners": [3], "pid": {"revision_id": 0, "type": "depid", "value": "3453"}, "status": "published"}, "_oai": {"id": "oai:nitech.repo.nii.ac.jp:00003453", "sets": ["31"]}, "author_link": ["9159", "9160", "9156", "464", "462"], "item_5_biblio_info_5": {"attribute_name": "書誌情報", "attribute_value_mlt": [{"bibliographicIssueDates": {"bibliographicIssueDate": "2009-04-24", "bibliographicIssueDateType": "Issued"}, "bibliographicPageEnd": "4036", "bibliographicPageStart": "4033", "bibliographic_titles": [{"bibliographic_title": "ICASSP 2009. IEEE International Conference on Acoustics, Speech and Signal Processing, 2009."}]}]}, "item_5_description_16": {"attribute_name": "フォーマット", "attribute_value_mlt": [{"subitem_description": "application/pdf", "subitem_description_type": "Other"}]}, "item_5_description_4": {"attribute_name": "内容記述", "attribute_value_mlt": [{"subitem_description": "This paper proposes a state duration modeling method using full covariance  matrix for HMM-based speech synthesis. In this method, a full covariance matrix instead of the conventional diagonal covariance  matrix is adopted in the multi-dimensional Gaussian distribution to model the state duration of each context-dependent phoneme. At synthesis stage, the state  durations are predicted using the clustered context-dependent distributions with full  covariance matrices. Experimental results show that the synthesized speech using full-covariance state duration models is more natural than the conventional method when we change the speaking rate of synthesized speech.", "subitem_description_type": "Other"}]}, "item_5_full_name_3": {"attribute_name": "著者別名", "attribute_value_mlt": [{"affiliations": [{"affiliationNames": [{"affiliationName": "", "lang": "ja"}], "nameIdentifiers": [{"nameIdentifier": "", "nameIdentifierScheme": "ISNI", "nameIdentifierURI": "http://www.isni.org/isni/"}]}], "familyNames": [{"familyName": "Tokuda", "familyNameLang": "en"}, {"familyName": "徳田", "familyNameLang": "ja"}, {"familyName": "トクダ", "familyNameLang": "ja-Kana"}], "givenNames": [{"givenName": "Keiichi", "givenNameLang": "en"}, {"givenName": "恵一", "givenNameLang": "ja"}, {"givenName": "ケイイチ", "givenNameLang": "ja-Kana"}], "nameIdentifiers": [{"nameIdentifier": "464", "nameIdentifierScheme": "WEKO"}, {"nameIdentifier": "1000020217483", "nameIdentifierScheme": "NRID", "nameIdentifierURI": "http://rns.nii.ac.jp/nr/1000020217483"}], "names": [{"name": "Tokuda, Keiichi", "nameLang": "en"}, {"name": "徳田, 恵一", "nameLang": "ja"}, {"name": "トクダ, ケイイチ", "nameLang": "ja-Kana"}]}]}, "item_5_publisher_6": {"attribute_name": "出版者", "attribute_value_mlt": [{"subitem_publisher": "Institute of Electrical and Electronics Engineers"}]}, "item_5_relation_10": {"attribute_name": "DOI", "attribute_value_mlt": [{"subitem_relation_name": [{"subitem_relation_name_text": "10.1109/ICASSP.2009.4960513"}], "subitem_relation_type": "isIdenticalTo", "subitem_relation_type_id": {"subitem_relation_type_id_text": "http://dx.doi.org/10.1109/ICASSP.2009.4960513", "subitem_relation_type_select": "DOI"}}]}, "item_5_version_type_13": {"attribute_name": "著者版フラグ", "attribute_value_mlt": [{"subitem_version_resource": "http://purl.org/coar/version/c_970fb48d4fbd8a85", "subitem_version_type": "VoR"}]}, "item_creator": {"attribute_name": "著者", "attribute_type": "creator", "attribute_value_mlt": [{"creatorNames": [{"creatorName": "Lu, Heng", "creatorNameLang": "en"}], "nameIdentifiers": [{"nameIdentifier": "9156", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Wu, Yi Jian", "creatorNameLang": "en"}, {"creatorName": "Tokuda, Keiichi", "creatorNameLang": "en"}], "nameIdentifiers": [{"nameIdentifier": "462", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Dai, Li Rong", "creatorNameLang": "en"}], "nameIdentifiers": [{"nameIdentifier": "9159", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "Wang, Ren hua", "creatorNameLang": "en"}], "nameIdentifiers": [{"nameIdentifier": "9160", "nameIdentifierScheme": "WEKO"}]}]}, "item_files": {"attribute_name": "ファイル情報", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_date", "date": [{"dateType": "Available", "dateValue": "2017-01-17"}], "displaytype": "detail", "download_preview_message": "", "file_order": 0, "filename": "ICASSP2009_lu_heng.pdf", "filesize": [{"value": "207.3 kB"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensefree": "c2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.", "licensetype": "license_free", "mimetype": "application/pdf", "size": 207300.0, "url": {"label": "本文_fulltext", "url": "https://nitech.repo.nii.ac.jp/record/3453/files/ICASSP2009_lu_heng.pdf"}, "version_id": "32aa4995-7592-4b57-b71e-b1910b3727df"}]}, "item_language": {"attribute_name": "言語", "attribute_value_mlt": [{"subitem_language": "eng"}]}, "item_resource_type": {"attribute_name": "資源タイプ", "attribute_value_mlt": [{"resourcetype": "conference paper", "resourceuri": "http://purl.org/coar/resource_type/c_5794"}]}, "item_title": "Full Covariance State Duration Modeling for HMM-Based Speech Synthesis", "item_titles": {"attribute_name": "タイトル", "attribute_value_mlt": [{"subitem_title": "Full Covariance State Duration Modeling for HMM-Based Speech Synthesis", "subitem_title_language": "en"}]}, "item_type_id": "5", "owner": "3", "path": ["31"], "permalink_uri": "https://nitech.repo.nii.ac.jp/records/3453", "pubdate": {"attribute_name": "公開日", "attribute_value": "2012-11-07"}, "publish_date": "2012-11-07", "publish_status": "0", "recid": "3453", "relation": {}, "relation_version_is_last": true, "title": ["Full Covariance State Duration Modeling for HMM-Based Speech Synthesis"], "weko_shared_id": 3}

Full Covariance State Duration Modeling for HMM-Based Speech Synthesis

https://nitech.repo.nii.ac.jp/records/3453

名前 / ファイル	ライセンス	アクション
本文_fulltext (207.3 kB)	c2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Item type

会議発表論文 / Conference Paper(1)

公開日

2012-11-07

タイトル

言語

タイトル

Full Covariance State Duration Modeling for HMM-Based Speech Synthesis

言語

eng

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_5794

資源タイプ

conference paper

著者

Lu, Heng
Wu, Yi Jian
Dai, Li Rong
Wang, Ren hua

著者別名

言語

ja-Kana

姓名

Tokuda, Keiichi

言語

姓名

徳田, 恵一

言語

姓名

トクダ, ケイイチ

言語

ja-Kana

書誌情報

ICASSP 2009. IEEE International Conference on Acoustics, Speech and Signal Processing, 2009.

p. 4033-4036, 発行日 2009-04-24

出版者

Institute of Electrical and Electronics Engineers

著者版フラグ

出版タイプ

VoR

出版タイプResource

http://purl.org/coar/version/c_970fb48d4fbd8a85

DOI

Versions

Ver.1

2023-05-15 13:48:46.174681

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

Full Covariance State Duration Modeling for HMM-Based Speech Synthesis

× Lu, Heng

× Wu, Yi Jian

× Dai, Li Rong

× Wang, Ren hua

Versions

Share

Cite as

エクスポート