The Morrison Collection
Morrison Texts

Table of Contents

Verse and Poetry



Vernacular Short Stories



Cantonese Opera



Notes on the Transcription of Texts

  1. All texts are transcribed as accurately as possible, retaining fidelity to the form of the original character wherever possible. This means that simplified and vulgar form characters are not normalised if the character is encoded in the CJK Unified Ideographs basic block (U4E00-U9FAF) of Unicode 3.1. If you only have a Big5 font, then some characters will not be displayable.
  2. Characters which are not encoded in Unicode are represented by 〇, with the normalised form of the character following in tortoise shell brackets (e.g. 〇〔眾〕 for the common vulgar form of the character 眾 which written like the character 中 with a horizontal stroke above and a slash below).
  3. Where a character appears to be a corruption or transcription error for another character, the supposed reading is given in tortoise shell brackets following the actual character (e.g. 樵〔瞧〕 where from context the character 樵 should be read as 瞧). If the suggested reading is uncertain, it is marked with a question mark. Where such corruptions are frequent, and there is no doubt as to the correct reading, the correct reading is silently substituted for the corrupt character (e.g. in 李奇嘆監 the phrase 姐丈 is frequently written as 姐文 – in these cases 文 has been silently amended to 丈).
  4. Where a character cannot be made out or is missing due to physical damage to the text, the missing character is represented by □. If an educated guess can be made as to what the missing character should be (from context or from what can be made out of the character), it is given in tortoise shell brackets and marked with a question mark (e.g. □〔稟?〕 where from context the missing character is assumed to be 稟).
  5. Where a character seems to have been accidentally omitted, the supposed missing character is given in solid lenticular brackets (e.g. 百姓之歡【心】 indicates that the character 心 is missing from the actual text, but is assumed from context to have been present in the original text).
  6. Square black ink blots (墨丁) that replace a lost character in the original text are represented by ■.