AniDB Definition:Romanisation

Revision as of 18:16, 25 July 2010 by Nwa (talk | contribs)
The information on this page is incomplete and may not be of much use.
If you can, please help by adding to it.

The information on this page is provided as guidelines on the use of romanised Japanese (rōmaji) in AniDB. Please be aware that this is not an exact science, there are many viable solutions to the same problem, though when submitting change requests on romanised titles users are expected to adhere to the 'house style' of the database. When there is contention over a particular issue, this page will provide both alternatives. External links to Wikipedia are provided throughout for ideas and terms that might be unfamiliar.

What romanised titles are for

  • Primarily, to provide a transcription of the Japanese title that is aurally recognisable and readable by a user with little or no knowledge of the language. In using Roman script, this is obviously targeted at speakers of European languages, however as this constitutes a majority of the population of AniDB users, this is a fair restriction.
  • Secondary purposes include enabling rough pronunciation of titles, providing an alternative method of searching for a Japanese title, assisting novices in reading unfamiliar words, and clarification of the reading of a particular word or phrase where it might be ambiguous.

What romanised titles aren't for

  • There is no requirement to be able to reconstruct the original title from romanised form. With three distinct scripts plus Roman, a wide range of homophones, and typographic intricacies such as furigana usage, this is beyond the scope of a 26 letter alphabet. In all cases the Japanese title should be presented as well, a romanised form is in no way a replacement for this.
  • Further more, the romanisation need not be a lossless transliteration of Japanese spelling. Though less so than English, Japanese pronunciation deviates somewhat from the phonemic spelling. As the aim is to provide an aurally recognisable transcription, it is more important to better reflect the sound than exact spelling.
  • Romanised titles do not need to provide a basis for correct Japanese collation of titles. This is a technical problem that would be better handled correctly through its own system, and would interfere with the primary purpose of the romanisation.
  • Romanisations need not have an 'official' status. Though both the Japanese makers and international licensees might provide a romanised title, this is irrelevant to a transcription of the Japanese title - except arguably in the case of names.

Hepburn romanisation

The Hepburn romanisation system was devised for a Japanese–English dictionary, published in 1867. Despite having no official status, variations of it are used for a vast majority of transcriptions, both inside and outside Japan. Unlike the two other main romanisation schemes, it concentrates of representing Japanese phonology rather than the underlying spelling.

Table of kana romanisation

Each mora represented in the kana spelling of a Japanese word can be transcribed into Roman letters according to the table below, with a few special cases that are listed in the following sections. The hiragana is on the left, katakana is on the right.

Table adapted from wikipedia article on Hepburn. Obsolete kana are shown in red.

a i u e o (ya) (yu) (yo)
ka ki ku ke ko きゃ kya キャ きゅ kyu キュ きょ kyo キョ
sa shi su se so しゃ sha シャ しゅ shu シュ しょ sho ショ
ta chi tsu te to ちゃ cha チャ ちゅ chu チュ ちょ cho チョ
na ni nu ne no にゃ nya ニャ にゅ nyu ニュ にょ nyo ニョ
ha hi fu he ho ひゃ hya ヒャ ひゅ hyu ヒュ ひょ hyo ヒョ
ma mi mu me mo みゃ mya ミャ みゅ myu ミュ みょ myo ミョ
ya yu yo
ra ri ru re ro りゃ rya リャ りゅ ryu リュ りょ ryo リョ
wa wi we wo
n
ga gi gu ge go ぎゃ gya ギャ ぎゅ gyu ギュ ぎょ gyo ギョ
za ji zu ze zo じゃ ja ジャ じゅ ju ジュ じょ jo ジョ
da (ji) (zu) de do ぢゃ (ja) ヂャ ぢゅ (ju) ヂュ ぢょ (jo) ヂョ
ba bi bu be bo びゃ bya ビャ びゅ byu ビュ びょ byo ビョ
pa pi pu pe po ぴゃ pya ピャ ぴゅ pyu ピュ ぴょ pyo ピョ
Extended Katakana - These are used mainly to represent the sounds in words in other languages. Most of these are not formally standardized and some are very rarely used.
ye イェ
wi ウィ we ウェ wo ウォ
va ヷ vi ヸ ve ヹ vo ヺ
va ヴァ vi ヴィ vu ヴ ve ヴェ vo ヴォ
she シェ
je ジェ
ti ティ tu トゥ che チェ tyu テュ
di ディ du ドゥ dyu デュ
tsa ツァ tsi ツィ tse ツェ tso ツォ
fa ファ fi フィ fe フェ fo フォ fyu フュ

Special cases

Hepburn also has a few extra rules to deal with particular cases, the ones below the AniDB house style adheres to.

The particle spelling rules exist to reflect modern Japanese pronunciation, note there are other features that Hepburn does not attempt to reflect, for instance the frequent dropping of the vowel /u/ (です is only pronounced 'desu' by kids), largely because there's no easy rule that could always be applied. The 'small tsu' rules reflect the fact it used in two rather different ways, and the syllabic n case is to deal with the problem that transcription might be ambiguous in a few cases.

Particle は as wa

Intro to は by Tae Kim

This rule is basically accepted by everyone, generally only ignored in error.

When used as a particle, transcribe は as 'wa' rather than 'ha'

  • Better represents the pronunciation
  • Common practice everywhere

Particle へ as e

Intro to へ by Tae Kim

Sometimes contested, as romanisations that ignore this rule are somewhat more common. Use 'e' in preference, but if adding an anime title where 'he' is sometimes used, add that alternative as a synonym.

When used as a particle, transcribe へ as 'e' rather than 'he'.

  • Better represents the pronunciation.
  • Established Hepburn rule, and widespread usage by those who follow transcription rules strictly.
  • Titles will save one character per へ particle.

Always transcribe へ as 'he', even when particle.

  • One less rule to remember.
  • Common practice amongst fansubbers.
  • Some titles including the particle へ are generally called by names romanised with 'e' by fans.
Kita e: ~Diamond Dust Drops~ (北へ。 ~Diamond Dust Drops~) also particularly resistant to using more sensible punctuation.
  • There's not much difference between pronunciation of /he/ and /e/.

Particle を as o

Intro to を by Tae Kim

Often contested, many romanisations in the wild will use 'wo'. Feel free to use either, but do not make change requests that just change the particle... If adding as an anime title, include a synonym with the alternative romanisation.

When used as a particle (you won't ever see it used in a normal word, so this means always (except when you do, as in names, or old words sometimes written in kana ... (like this? http://ja.wikipedia.org/wiki/ちみもりを also, use the talk page, you tards))), transcribe を as 'o' rather than 'wo'.

  • Better represents the pronunciation.
  • Established Hepburn rule, and widespread usage by those who follow transcription rules strictly.

Always transcribe を as 'wo', even when particle.

  • One less rule to remember.
  • Common practice amongst fansubbers.
  • Many titles including the particle を are generally called by names romanised with 'wo' by fans.
Full Moon wo Sagashite (満月(フルムーン)をさがして)
Mimi wo Sumaseba (耳をすませば)
Ace wo Nerae! (エースをねらえ!)

Discussion: 2004.06 (old forum) / 2004.06 (old forum) / 2005.07 (old forum) (warning: profanity)

っ when geminate consonant

Really a very simple rule, complicated by one particular case. When っ is indicating a stop, the easy way to show that in the Roman alphabet is with a doubled consonant. However for っち/っちゃ/っちゅ/っちょ the cluster tch is a probably a better transcription than cch (which is also confused by use in Italian) - but which is used tends to come down to individual words, which makes applying a general rule very difficult.

When part of a word, transcribe っ by doubling the following consonant, except っち as 'tchi' and similar.

  • May better represent pronunciation.
  • Some common words are best known with a 'tch' transcription.
Anime TV de Hakken! Tamagotchi (アニメ TVで発見!! たまごっち) The toys are best known as Tamagotchi, the spelling 'tamagocchi' not used.
Touch (タッチ) (tatchi) is a pun on たっちゃん Tat[suya]-chan, but the っちゃん ending can be used for any name.

When part of a word, always transcribe っ by doubling the following consonant.

  • One less rule to remember.
  • Some common words are best known with a 'cch' transcription.
Futari Ecchi (ふたりエッチ) Ecchi has been borrowed back into English, and almost always spelt with the 'cch' - though this particular title is arguably just 'Futari H'.

っ when exclamation

Commonly either given as an exclamation mark or just dropped, the former is preferable.

Transcribe っ at the end of a word as '!', unless followed by one anyway, in which case drop.

  • The っ as surprise/intonation marker is broadly equivalent to an exclamation mark.
  • !! is っ! is ! semanticly, typography isn't important for transcriptions.
AA! Megami-sama! (ああっ女神さまっ)
Tsuruhime Ja! (つる姫じゃーっ!)

Discard っ at the end of a word in transcription.

  • Not exactly crucial, is it. Is it? (punctuation joke, sorry)

Transcribe っ at the end of a word as a trailing 'h'.

  • More appropriate for some endings than others, is context dependant, 'ah' is sensible, 'ih' is just odd.
  • Potential for confusion with the habit of transcribing long vowels with an h, 'oh' could be おう or おっ.

ん before vowels as n'

Accepted.

Deviations from Hepburn

Note These are rules in Hepburn that the AniDB house style does not follow, for the reasons given.

Macron usage for long vowels

Not accepted.

ん before labial consonants as m

Not accepted.

Loanwords in Japanese

The description is missing or severely incomplete.
If you can, please help by explaining it.

Spell in original language where possible

The description is missing or severely incomplete.
If you can, please help by explaining it.

What to do with wasei eigo terms

The description is missing or severely incomplete.
If you can, please help by explaining it.

What to do with names and invented terms

The description is missing or severely incomplete.
If you can, please help by explaining it.

Other orthography issues

The description is missing or severely incomplete.
If you can, please help by explaining it.

Spacing

Separate each 単語 and 助詞.

Separate 隊 except when part of another word (e.g. 軍隊). Don't hyphenate.

Honorifics

(temp rule not final) Honorifics that are no standalone words are added with '-'. Otherwise just separate them by spacing.

i.e.
AA! Megami-sama
but
Arete Hime

たち (tachi)

Always split from the associated word and set '-'. i.e.
Elf o Karu Mono-tachi

色 (iro)

Capitalisation

Use an initial capital letter for 単語.

Punctuation

The description is missing or severely incomplete.
If you can, please help by explaining it.

Practical guide

The description is missing or severely incomplete.
If you can, please help by explaining it.
MediaWiki spam blocked by CleanTalk.
MediaWiki spam blocked by CleanTalk.