AniDB Definition:Romanisation

Revision as of 07:47, 26 August 2014 by Benu (talk | contribs) (→‎Counters)
The information on this page is incomplete and may not be of much use.
If you can, please help by adding to it.

The information on this page is provided as guidelines on the use of romanised Japanese (rōmaji) in AniDB. Please be aware that this is not an exact science, there are many viable solutions to the same problem, though when submitting change requests on romanised titles users are expected to adhere to the 'house style' of the database. When there is contention over a particular issue, this page will provide both alternatives. External links to Wikipedia are provided throughout for ideas and terms that might be unfamiliar.

What romanised titles are for

  • Primarily, to provide a transcription of the Japanese title that is aurally recognisable and readable by a user with little or no knowledge of the language. In using Roman script, this is obviously targeted at speakers of European languages, however as this constitutes a majority of the population of AniDB users, this is a fair restriction.
  • Secondary purposes include enabling rough pronunciation of titles, providing an alternative method of searching for a Japanese title, assisting novices in reading unfamiliar words, and clarification of the reading of a particular word or phrase where it might be ambiguous.

What romanised titles aren't for

  • There is no requirement to be able to reconstruct the original title from romanised form. With three distinct scripts plus Roman, a wide range of homophones, and typographic intricacies such as furigana usage, this is beyond the scope of a 26 letter alphabet. In all cases the Japanese title should be presented as well, a romanised form is in no way a replacement for this.
  • Further more, the romanisation need not be a lossless transliteration of Japanese spelling. Though less so than English, Japanese pronunciation deviates somewhat from the phonemic spelling. As the aim is to provide an aurally recognisable transcription, it is more important to better reflect the sound than exact spelling.
  • Romanised titles do not need to provide a basis for correct Japanese collation of titles. This is a technical problem that would be better handled correctly through its own system, and would interfere with the primary purpose of the romanisation.
  • Romanisations need not have an 'official' status. Though both the Japanese makers and international licensees might provide a romanised title, this is irrelevant to a transcription of the Japanese title - except arguably in the case of names.

Hepburn romanisation

The Hepburn romanisation system was devised for a Japanese–English dictionary, published in 1867. Despite having no official status, variations of it are used for a vast majority of transcriptions, both inside and outside Japan. Unlike the two other main romanisation schemes, it concentrates of representing Japanese phonology rather than the underlying spelling.

Table of kana romanisation

Each mora represented in the kana spelling of a Japanese word can be transcribed into Roman letters according to the table below, with a few special cases that are listed in the following sections. The hiragana is on the left, katakana is on the right.

Table adapted from wikipedia article on Hepburn. Obsolete kana are shown in red.

a i u e o (ya) (yu) (yo)
ka ki ku ke ko きゃ kya キャ きゅ kyu キュ きょ kyo キョ
sa shi su se so しゃ sha シャ しゅ shu シュ しょ sho ショ
ta chi tsu te to ちゃ cha チャ ちゅ chu チュ ちょ cho チョ
na ni nu ne no にゃ nya ニャ にゅ nyu ニュ にょ nyo ニョ
ha hi fu he ho ひゃ hya ヒャ ひゅ hyu ヒュ ひょ hyo ヒョ
ma mi mu me mo みゃ mya ミャ みゅ myu ミュ みょ myo ミョ
ya yu yo
ra ri ru re ro りゃ rya リャ りゅ ryu リュ りょ ryo リョ
wa wi we wo
n
ga gi gu ge go ぎゃ gya ギャ ぎゅ gyu ギュ ぎょ gyo ギョ
za ji zu ze zo じゃ ja ジャ じゅ ju ジュ じょ jo ジョ
da (ji) (zu) de do ぢゃ (ja) ヂャ ぢゅ (ju) ヂュ ぢょ (jo) ヂョ
ba bi bu be bo びゃ bya ビャ びゅ byu ビュ びょ byo ビョ
pa pi pu pe po ぴゃ pya ピャ ぴゅ pyu ピュ ぴょ pyo ピョ
Extended Katakana - These are used mainly to represent the sounds in words in other languages. Most of these are not formally standardized and some are very rarely used.
ye イェ
wi ウィ we ウェ wo ウォ
va ヷ vi ヸ ve ヹ vo ヺ
va ヴァ vi ヴィ vu ヴ ve ヴェ vo ヴォ
she シェ
je ジェ
ti ティ tu トゥ che チェ tyu テュ
di ディ du ドゥ dyu デュ
tsa ツァ tsi ツィ tse ツェ tso ツォ
fa ファ fi フィ fe フェ fo フォ fyu フュ

Special cases

Hepburn also has a few extra rules to deal with particular cases, the ones below the AniDB house style adheres to.

The particle spelling rules exist to reflect modern Japanese pronunciation, note there are other features that Hepburn does not attempt to reflect, for instance the frequent dropping of the vowel /u/ (です is only pronounced 'desu' by kids), largely because there's no easy rule that could always be applied. The 'small tsu' rules reflect the fact it used in two rather different ways, and the syllabic n case is to deal with the problem that transcription might be ambiguous in a few cases.

Particle は as wa

Intro to は by Tae Kim

This rule is basically accepted by everyone, generally only ignored in error.

When used as a particle, transcribe は as 'wa' rather than 'ha'

  • Better represents the pronunciation
  • Common practice everywhere

Particle へ as e

Intro to へ by Tae Kim

Sometimes contested, as romanisations that ignore this rule are somewhat more common. Use 'e' in preference, but if adding an anime title where 'he' is sometimes used, add that alternative as a synonym.

When used as a particle, transcribe へ as 'e' rather than 'he'.

  • Better represents the pronunciation.
  • Established Hepburn rule, and widespread usage by those who follow transcription rules strictly.
  • Titles will save one character per へ particle.

Transcribing へ as 'he', even when particle.

  • One less rule to remember.
  • Common practice amongst fansubbers.
  • Some titles including the particle へ are generally called by names romanised with 'e' by fans.
Kita e: ~Diamond Dust Drops~ (北へ。 ~Diamond Dust Drops~) also particularly resistant to using more sensible punctuation.
  • There's not much difference between pronunciation of /he/ and /e/.

Particle を as o

Intro to を by Tae Kim

When used as a particle (you won't ever see it used in a normal word, so this means always (exceptions can come with names) transcribe を as 'o' rather than 'wo'.

  • Better represents the pronunciation.
  • Established Hepburn rule, and widespread usage by those who follow transcription rules strictly.

Transcribing を as 'wo', even when particle.

  • One less rule to remember.
  • Common practice amongst fansubbers.
  • Many titles including the particle を are generally called by names romanised with 'wo' by fans.
Full Moon wo Sagashite (満月(フルムーン)をさがして)
Mimi wo Sumaseba (耳をすませば)
Ace wo Nerae! (エースをねらえ!)

Discussion: 2004.06 (old forum) / 2004.06 (old forum) / 2005.07 (old forum) (warning: profanity)

っ when geminate consonant

Really a very simple rule, complicated by one particular case. When っ is indicating a stop, the easy way to show that in the Roman alphabet is with a doubled consonant. However for っち/っちゃ/っちゅ/っちょ the cluster tch is a probably a better transcription than cch (which is also confused by use in Italian) - but which is used tends to come down to individual words, which makes applying a general rule very difficult.

When part of a word, transcribe っ by doubling the following consonant, except っち as 'tchi' and similar.

  • May better represent pronunciation.
  • Some common words are best known with a 'tch' transcription.
Anime TV de Hakken! Tamagotchi (アニメ TVで発見!! たまごっち) The toys are best known as Tamagotchi, the spelling 'tamagocchi' not used.
Touch (タッチ) (tatchi) is a pun on たっちゃん Tat[suya]-chan, but the っちゃん ending can be used for any name.

When part of a word, always transcribe っ by doubling the following consonant.

  • One less rule to remember.
  • Some common words are best known with a 'cch' transcription.
Futari Ecchi (ふたりエッチ) Ecchi has been borrowed back into English, and almost always spelt with the 'cch' - though this particular title is arguably just 'Futari H'.

っ when exclamation

Commonly either given as an exclamation mark or just dropped, the former is preferable.

Transcribe っ at the end of a word as '!', unless followed by one anyway, in which case drop.

  • The っ as surprise/intonation marker is broadly equivalent to an exclamation mark.
  • !! is っ! is ! semanticly, typography isn't important for transcriptions.
AA! Megami-sama! (ああっ女神さまっ)
Tsuruhime Ja! (つる姫じゃーっ!)

Discard っ at the end of a word in transcription.

  • Not exactly crucial, is it. Is it? (punctuation joke, sorry)

Transcribe っ at the end of a word as a trailing 'h'.

  • More appropriate for some endings than others, is context dependant, 'ah' is sensible, 'ih' is just odd.
  • Potential for confusion with the habit of transcribing long vowels with an h, 'oh' could be おう or おっ.

ん before vowels as n'

Accepted.

Deviations from Hepburn

Note These are rules in Hepburn that the AniDB house style does not follow, for the reasons given.

Macron usage for long vowels

Not accepted.

ん before labial consonants as m

Not accepted.

Loanwords in Japanese

The description is missing or severely incomplete.
If you can, please help by explaining it.

Spell in original language where possible

The description is missing or severely incomplete.
If you can, please help by explaining it.

What to do with wasei eigo terms

The description is missing or severely incomplete.
If you can, please help by explaining it.

What to do with names and invented terms

The description is missing or severely incomplete.
If you can, please help by explaining it.

Other orthography issues

The description is missing or severely incomplete.
If you can, please help by explaining it.

Anything that doesn't fit into the above major categories.

Capitalisation

Use an initial capital letter for 単語.

See the Capitalisation guide.

Spacing

Separate each word (単語 (Tango)) and particle (助詞 (Joshi)).

Exception: (Question) Particle か =ka When =ka is used as an indicator for a question (most of the times at the end of a sentence), it will be assimilated to the Verb. Example: 私の家へ行きますか - Watashi no Uchi e Ikimasuka When =ka is used to indicate a choice in the middle of a sentence, which includes a noun, it will be split. Example: コーヒーか茶か - Kohi ka Cha ka

隊 (tai)

Separate 隊 (Tai ("Group")), except when it's actually part of another word (e.g. 軍隊 (Guntai (Army/Troops)). Don't hyphenate.

Example:

少女隊 - Shoujo Tai (Shoujo Tai , a Japanese girl band from the 80's)

号 (gou)

Separate 号 (Gou ("Vessel" / "Ship" / "Issue" / [...]), except when it's actually part of another word. Don't hyphenate.

Example:

ベザン・ブラック号 - Bezan Black Gou (Bezan Black Gou , character/ship from One Piece)

Honorifics

(temp rule not final) Honorifics that are no standalone words are added with '-'. Otherwise just separate them by spacing.

i.e.
AA! Megami-sama
but
Arete Hime

たち (tachi)

Always split from the associated word and set '-'.

i.e.
Elf o Karu Mono-tachi

色 (iro)

Dictionary Words

A list of real Japanese composita that you can find in a dictionary will be seen as one word on AniDB:

藍色 - Aiiro ("Indigo Blue")
茜色 - Akaneiro ("Madder Red")
赤色 - Akairo ("Red")
薔薇色/バラ色/ばら色 - Barairo ("Rose Coloured")
橙色 - Daidaiiro ("Orange")
艶色 - Enshoku ("Charming / Wonderful Colour")
銀色 - Gin'iro ("Silver Coloured")
灰色 - Haiiro ("Grey")
緋色 - Hiiro, Hishoku ("Scarlet", "Cardinal")
黄色い - Kiiroi ("Yellow")
金いろ - Kin'iro ("Golden")
金色 - Konjiki ("Golden")
水色 - Mizuiro ("Light Blue")
桃色 - Momoiro ("Pink", "Rosy")
七色/なないろ - Nanairo (describes the seven colours of the rainbow)
音色 - Neiro ("Tone colour", "tone (quality)")
瑠璃色 - Ruriiro ("Azure")
緑色 - Ryokushoku/Midoriiro ("Green")
桜色/サクライロ/さくらいろ - Sakurairo ("Pink", "Cherry Blossom Coloured")
真珠色 - Shinjuiro ("Pearl Grey")
秋色 - Shuushoku ("Autumn/Fall Scenery")
空色/ソライロ - Sorairo ("Sky Coloured")
鴇色 - Tokiiro ("Pale Pink", "Pale Rose")

Non-Dictionary Words

A list of composita with 色 that you can't find in a Japanese dictionary will be separated with "-":

雨色 - Ame-iro ("Rain Coloured")
あなた色 - Anata-iro ("You-Coloured")
朝色 - Asa-iro ("Morning Coloured")
不思議色 - Fushigi-iro ("Mysterious Coloured")
グンジョ色 - Gunjo-iro (群青, Gunjou: Ultramarine)
初色 - Hatsu-iro ("First (Time) Coloured")
枯れ葉色 - Kareha-iro ("Colour of Dead/Dry Leaves")
君色 - Kimi-iro ("You-Coloured")
恋色 - Koi-iro ("Love Coloured")
ココロいろ - Kokoro-iro (Kokoro means Heart, but can also be Soul, thus it's either "Heart Coloured" or "Soul Coloured")
マーブル色 - Marble-iro ("Marble Coloured")
モーブ色 - Mauve-iro (Mauve Coloured)
みらいいろ - Mirai-iro ("Future-Coloured")
紫水晶色 - Murasakisuishou-iro ("Amethyst Coloured")
ナミダイロ - Namida-iro ("Wave Coloured")
夏色 - Natsu-iro ("Summer Coloured")
ニビイロ - Nibi-iro ("Nibi Coloured")
虹色 - Niji-iro ("Rainbow Coloured")
オレンジ色 - Orange-iro ("Orange (Coloured)")
セピア色 - Sepia-iro ("Sepia Coloured")
修羅色 - Shura-iro ("Fighting/Battle Coloured")
ときめき色 - Tokimeki-iro (ときめく, tokimeku: to throb, to flutter, to palpitate; send your throb-coloured things to us!)
夢色 - Yume-iro ("Dream Coloured")
百合色 - Yuri-iro ("Lily Coloured")
ユウヤケイロ - Yuuyake-iro ("Sunset Coloured")

Special Guests

A list of anime specific "colours" that are purely fictional:

ラメ色 - Lum-iro (Lum , a character from Urusei Yatsura)
トモカネいろ - Tomokane-iro (Tomokane , a character from GA: Geijutsuka Art Design Class)
ノダミキいろ - Noda Miki-iro (Noda Miki , a character from GA: Geijutsuka Art Design Class)
ナミコいろ - Namiko-iro (Namiko , a character from GA: Geijutsuka Art Design Class)
キョージュいろ - Kyoju-iro (Kyoju , nickname for a character from GA: Geijutsuka Art Design Class)
キサラギいろ - Kisaragi-iro (Kisaragi , a character from GA: Geijutsuka Art Design Class)

This list is of course not complete and will get new colours added when they are encountered

Counters

When making a compound between a number and one of the many many many Japanese counters, the two words merge into one.

Here you will find a list of some useful counters and how they should be transcribed on AniDB.

Counting in General

# Kanji Hiragana Transcription
1 一つ ひとつ Hitotsu
2 二つ ふたつ Futatsu
3 三つ みっつ Mittsu
4 四つ よっつ Yottsu
5 五つ いつつ Itsutsu
6 六つ むつ Mutsu
7 七つ ななつ Nanatsu
8 八つ やっつ Yattsu
9 九つ ここのつ Kokonotsu
10 とう Tou/Too

People (Counter: 人)

# Kanji Hiragana Transcription
1 一人 ひとり Hitori
2 二人 ふたり Futari
3 三人 さんにん Sannin
4 四人 よにん Yonin
5 五人 ごにん Gonin
6 六人 ろくにん Rokunin
7 七人 七人 ななにん / しちにん Nananin / Shichinin
8 八人 はちにん Hachinin
9 九人 きゅうにん Kyuunin
10 十人 じゅうにん Juunin

Small Animals (up until the size of a dog) (Counter: 匹)

# Kanji Hiragana Transcription
1 一匹 いっぴき Ippiki
2 二匹 にひき Nihiki
3 三匹 さんびき Sanbiki
4 四匹 よんひき Yonhiki
5 五匹 ごひき Gohiki
6 六匹 ろっぴき Roppiki
7 七匹 / 七匹 ななひき / しちひき Nanahiki / Shichihiki
8 八匹 はっぴき Happiki
9 九匹 きゅうひき Kyuuhiki
10 十匹 じゅっぴき Juppiki

Large Animals (Counter: 頭)

# Kanji Hiragana Transcription
1 一頭 いっとう Ittou
2 二頭 にとう Nitou
3 三頭 さんとう Santou
4 四頭 よんとう Yontou
5 五頭 ごとう Gotou
6 六頭 ろくとう Rokutou
7 七頭 ななとう Nanatou
8 八頭 / 八頭 ななとう / はっとう Hattou / Hachitou
9 九頭 きゅうとう Kyuutou
10 十頭 じゅっとう Juttou

Punctuation

The description is missing or severely incomplete.
If you can, please help by explaining it.

Practical guide

The description is missing or severely incomplete.
If you can, please help by explaining it.
MediaWiki spam blocked by CleanTalk.
MediaWiki spam blocked by CleanTalk.