Tuesday, 17 October 2017

Chromatic fonts and Proofreading

A useful application of chromatic (or polychromatic) fonts is proofreading of text, an especially valuable technique for complex scripts such as Egyptian hieroglyphic, CJK writing and mathematical notation.

I've not discovered much on the web about chromatic proofreading possibly because chromatic fonts are relatively new and not as well supported in software as might be expected. For instance macOS contained very limited polychromatic font support until the 10.13 "High Sierra" release last month. Windows Notepad still lacks polychromatic font support (as of the Windows 10 "Fall Creators Update" released today) although Window 10 in general has excellent polychromatic support. Chrome on Android 7.1 appears to be still missing polychromatic support and I've not had the opportunity to try Android 8.0.

The basic principles of chromatic proofreading are obvious. In the case of Egyptian Hieroglyphs, some signs are predominantly used as phonograms, others as ideograms. Some signs could be easily confused with others. Some hieroglyph forms exist but are rarely encountered in historical documents. And so forth. Use of colour to highlight certain of these characteristics can aid identification of patterns and possible transcription errors. I've found use of a chromatic font a more practical tool than using markup for chromatic highlighting for proofreading purposes.

I've used up to 18 colours in experiments with polychromatic proofreading of hieroglyphic. I've also experimented with context-specific colours in fonts and alternate colour palettes.  The key issue is usability and I've found some techniques that work well in specialist software can be over-complex when used in plain text editor software or work processors.

The simplest system I've found useful uses a font with four colours as follows:
  1. Hieroglyphs predominantly encountered as phonograms remain black, as normal text.
  2. Numeric hieroglyphs are displayed as red.
  3. Ideograms commonly used as generic determinatives are displayed as green.
  4. Other ideograms/determinatives are displayed as blue.
Even the novice will be aware that some hieroglyphs fulfil more than one role so its important that knowledge is applied when interpreting this simplified scheme.

An example of the technique applied to a Unicode-encoded text is available at Tale of the Eloquent Peasant encoded in UMdC. This example uses a webfont version of my experimental Aaron UMdC Phonetic Alpha font. An installable version of the font is available for download at Hieroglyphs Everywhere Fonts Project for anyone who'd like to try out the method. I've not got around to document how to use polychromatic fonts yet so less technical readers may want to wait until some tutorial information is available.

Bob Richmond

Monday, 16 October 2017

Hieroglyphic fonts for Unicode

I've compiled a list of Unicode hieroglyphic fonts at https://github.com/HieroglyphsEverywhere/Fonts/blob/master/HieroglyphicFontList.md. 'Official' download links are provided where applicable.

All these fonts contain the 1071 hieroglyphs introduced in Unicode 5.2 (2009). Certain fonts such as Aegyptus and Abydos contain hieroglyphs that are not yet in the Unicode standard.

My new Aaron Series fonts are intended to help establish new techniques for working with hieroglyphic in Unicode. They can also help with development of the next generation of Unicode hieroglyphic fonts that feature quadrat shaping.

If you know of other Unicode hieroglyph fonts to add to the list please let me know and I can update the list.

Bob Richmond

Wednesday, 31 May 2017

Unicode Technical Committee (UTC) documents about Egyptian Hieroglyphic (May 2017)

There are several updates and additions to the UTC document register concerning Ancient Egyptian since my previous post on the topic Unicode Technical Committee (UTC) documents about Egyptian Hieroglyphic (March 2017). Hieroglyphic was discussed during the UTC meeting (May 8-12, 2017) and some recommendations made: see L2/17-103 UTC #151 Minutes.

The place for questions, discussion and suggestions is the Egyptian Hieroglyphs in the UCS mailing list (see Informatique et Égyptologie, Cambridge, 2016).

Recent documents

  • L2/17-076 Revised proposal for the encoding of an Egyptological YOD and Ugaritic characters; Michel Suignard; 2017-05-09. This replaces the earlier L2/17-076 submitted in March as Proposal for the Encoding of an Egyptological YOD. The names of the YOD upper and lower case characters to be LATIN CAPITAL LETTER I WITH SPIRITUS LENIS and LATIN SMALL LETTER I WITH SPIRITUS LENIS. This appear to be generally accepted so we can be cautiously optimistic that a future version of Unicode will resolve this long-standing anomaly.
  • L2/17-122R A method for encoding Egyptian quadrats in Unicode (revised); Andrew Glass, et al; 2017-05-16. Minor revision to clarify some technical points following UTC meeting. See my earlier post A method for encoding Egyptian quadrats in Unicode for an overview.
  • L2/17/153 Recommendations to UTC #151 May 2017 on Script Proposals; Deborah Anderson; 2017-05-07. Summary of the Script ad hoc group discussion on hieroglyphic, recommending UTC discuss the L2/17-122 quadrat proposal.
  • L2/17-171 Future Additions to ISO/IEC 10646 (May 2017); UTC/Deborah Anderson; 2-17-05-17. Additions requested include the Yod (as above) and six Hieroglyphic format control characters characters from L2/17-122 (4 corner and 2 bracket-like controls but not the overlay aka stack control at this time) in addition to the two format controls recommended in January 2016.
Current status of Ancient Egyptian for future Unicode

No decisions appear imminent on hieroglyph repertoire expansion in Unicode but Michel Suignard is making progress on his candidate database.

UTC is now recommending an 8 control character set for representing hieroglyph quadrats in Unicode and this will be running through the standardisation approval pipeline. All those interested in digital hieroglyphic will want to study what is proposed and the degree to which it meets their needs. The controls are:

13430 EGYPTIAN HIEROGLYPH VERTICAL JOINER
13431 EGYPTIAN HIEROGLYPH HORIZONTAL JOINER
13432 EGYPTIAN HIEROGLYPH INSERT TOP START
13433 EGYPTIAN HIEROGLYPH INSERT BOTTOM START
13434 EGYPTIAN HIEROGLYPH INSERT TOP END
13435 EGYPTIAN HIEROGLYPH INSERT BOTTOM END
13437 EGYPTIAN HIEROGLYPH SEGMENT START
13438 EGYPTIAN HIEROGLYPH SEGMENT END

UTC is recommending two new characters for the Egyptological YOD. Namely

A7BC LATIN CAPITAL LETTER I WITH SPIRITUS LENIS
A7BD LATIN SMALL LETTER I WITH SPIRITUS LENIS

There is a considerable amount of work needed to produce working implementations of the control system and guidelines for usage. There is a case for additional controls. I hope to deal with some of the issues in further blog posts an welcome any suggestions or feedback.

Bob Richmond

Thursday, 27 April 2017

A method for encoding Egyptian quadrats in Unicode

A new document 'A method for encoding Egyptian quadrats in Unicode' is now available from the UTC document register as L2/17-122 [pdf]. The system described takes into account discussions last July during the Informatique et Égyptologie Cambridge meeting and afterwards about extensions to Unicode plain text support to handle vertical hieroglyphic and various complex forms of quadrat structure.

The place for questions, discussion and suggestions is the Egyptian Hieroglyphs in the UCS mailing list (see Informatique et Égyptologie, Cambridge, 2016).

L2/17-122 contains a feasibility report based on three prototype OpenType font developments (Glass, Nederhof, and Richmond) which I hope goes a long way to alleviate concerns raised last year by Egyptologists about the viability of flexible hieroglyphic font implementations in Unicode.

L2/17-122 identifies 9 controls as follows:

Basic quadrat structures

EGYPTIAN HIEROGLYPH VERTICAL JOINER
EGYPTIAN HIEROGLYPH HORIZONTAL JOINER

These two controls were proposed in L2/16-018 (January 2016). They are similar to the original Manuel de Codage (MdC85) ':' and '*' controls.

EGYPTIAN HIEROGLYPH SEGMENT START
EGYPTIAN HIEROGLYPH SEGMENT END

These two controls operate in a similar way to MdC85 brackets '(' and ')'.

L2/17-122 does not contain structure extensions such as the group joiners suggested in L2/16-214 [pdf] to simplify encoding of quadrats in vertical text and tall quadrats in horizontal text. Therefore for most applications the basic quadrat structures of L2/17-122 are encoded as exact equivalents to those of MdC85 (itself derived from the Buurman 1976 model).

However, there are subtle differences from MdC85, most importantly (i) L2/17-122 has more clearly defined control behaviour and (ii) quadrat appearance is determined by a font (or equivalent) so there is more flexibility in handling issues such as hieroglyph sizing, kerning, etc. in plain text implementations.

Hieroglyph combinations

EGYPTIAN HIEROGLYPH STACK MIDDLE

This control overlays one hieroglyph on top of another - a direct equivalent of the MdC88 '#' control (encoded as '##' in JSesh).

EGYPTIAN HIEROGLYPH INSERT TOP START
EGYPTIAN HIEROGLYPH INSERT BOTTOM START
EGYPTIAN HIEROGLYPH INSERT TOP END
EGYPTIAN HIEROGLYPH INSERT BOTTOM END

These four geometrical ligature controls are proposed in place of the L2/16-018 EGYPTIAN HIEROGLYPH LIGATURE JOINER (which was based on an abstract ligature model for non-grid quadrat elements). This set of four ligature controls originates from a consensus formed at the I&E 2016 meeting that four 'corner control' ligatures are sufficient to meet anticipated plain text ligature needs of corpus projects such as Ramses and TLA and that the Egyptologists present preferred geometrical to abstract ligatures. This is a new approach to ligatures although they link fairly well to usage of the original MdC ''&''ligature and MdC extensions familiar to JSesh users.

Bob Richmond

Digital Encoding of Egyptian Hieroglyphic: Origins

Updated 2017-05-17.

I thought it might be useful to summarise some of the background to digital hieroglyphic to help  inform discussion about representations of hieroglyphic writing in Unicode.

This post deals with the early years. Information is thin on the ground so I'd be delighted to learn about any material, unpublished or unpublished, that survives from this formative period.

Apparently, use of computers for hieroglyphic goes back to the 1960s when computers and printing peripherals were hugely expensive and inaccessible to most people except a lucky few. However it was not until the early 1980s that the emergence of personal computer technology started to bring digital techniques and practical tools to Egyptologists and others.

The first Informatique et Égyptology 'round table' meeting (Paris, 26-28 June, 1984) was pivotal in shaping the first generation of digital hieroglyphic that has been used for the last 30 years. Fortunately, the proceedings of the meetings were published in 1985 (although unfortunately and ironically not yet available in digital format) and this post is mostly based on that publication. I'll summarise some papers from I&E 1984 relevant to encoding.

COMPUTER PRINTING OF HIEROGLYPHS AT THE UNIVERSITY PRESS OXFORD (T. G. H. James) describes the replacement of the traditional metal type system at Oxford University Press (used for typesetting the Gardiner font from 1927 to 1983) by a Monotype LaserComp photo type setter adapted for hieroglyphic. The final publication to use the original hot metal font was J.E.A. Vol. 69 (1983).
Typesetting instructions for OUP workflow
This paper gives an insight into older typesetting practices as well as the short-lived LaserComp technology soon to be superceded by desktop publishing on personal computers.

INFORMATIQUE APPLIQUEE A L'EGYPTOLOGIE (Dirk van der Plas) gives a snapshot of his experiences with the Buurman GLYPH program and the practical situation for those printing hieroglyph texts on a budget in 1984 including comparative costs of authography and typesetting.

A PROGRAMM SYSTEM FOR THE EDITION OF TEXT (especially hieroglyphic printing) (Norbert Stief) describes hieroglyph plotting software written in Fortran 77 running on an IBM 370 mainframe driving a CALCOMP plotter. The system was developed at University of Bonn and appears to be what is sometimes later known as the PLOTTEXT system. Mnemonics are used as alternatives to alphanumeric Gardiner codes for encoding purposes in a similar way to Buurman (1976). The 'Bonn Ziechenliste' font catalogue (an extension of the Egyptian Grammar sign list) is given here.

NEW HARDWARE--NEW SOFTWARE (Leonard H. Lesko) gives a short summary of hardware used in his latest setup at Brown University and his earlier hieroglyphic printing workflow at Berkeley during 1973-1982 using the SCRIBE program. The Berkeley system was notable for its use in creating A Dictionary of Late Egyptian 1982-), what I understand to be the first substantial hieroglyphic dictionary to use digital encoding and printing techniques. Not mentioned here is the use of quadrat structure patterns for SCRIBE (rather than a control scheme such as that used by Buurman). Just to prove there's nothing new under the sun, one of the systems I initially considered for Unicode encoding in early 2015 used a similar pattern system although at the time I was unaware of the Lesko work from 40 years ago.

PRINTING OF EGYPTIAN HIEROGLYPHS BY MEANS OF A COMPUTER (Jan Buurman; Astronomer and hobby-Egyptologist). Buurman gives a history of his system which began as a hobby project in 1969. A sketch of the system was first published as The Composing of Hieroglyphic Texts by means of a Computer in Göttinger Miszellen 19 (1976). As far as I'm aware GM19 contains the earliest publication of what would become the basic MdC controls for quadrat structures.
Quadrat structure notation from " The Composing of Hieroglyphic Texts by means of a Computer" (1976)
GM19 reports that the first texts output (in 1971) took an average of 0.2 seconds per hieroglyph to process and 1.3 seconds to plot. The first version of GLYPH was written in Algol 60, running on a CDC Cyber 73 mainframe.

I understand that Buurman showed a video of the plotter in action to the meeting. Far more fun I expect than the laser printers we have grown used to. I'd love to see a video of  hieroglyphs being drawn by a plotter! I am grateful to Hans van den Berg for kindly providing two links showing output from a 1980s HP ColorPro plotter:  'wild bull hunt scarab' of Amenhotep III (https://www.youtube.com/watch?v=8z9aCclxV0U) and part of the Tale of Sinhue (https://www.youtube.com/watch?v=zyp68emXMZM).

RESOLUTION. The meeting resolved to create a standard system for encoding hieroglyphs to be made available before the Fourth International Congress of Egyptology in August 1985. A committee chosen to work on the manual consisted of Jan Buurman, Nicolas Grimal, Michael Hainsworth, Norbert Stief, Robert Vergnieux and Dirk van der Plas.
Proceedings of Informatique et Egyptology 1984 (Paris, 1995). Page 225.
This proposed system was to become known as Manuel de Codage (MdC85) which I aim to summarise in a subsequent post.

Bob Richmond

Monday, 27 March 2017

Unicode Technical Committee (UTC) documents about Egyptian Hieroglyphic (March 2017)

Updated 2017-04-04.

My January post Unicode Technical Committee (UTC) documents on Egyptian Hieroglyphic 2016 summarised relevant submissions to the UTC Document register last year. Additions so far for 2017 are:

January to March 2017
  • L2/16-210r  A system of control characters for Ancient Egyptian hieroglyphic text (updated version); Mark-Jan Nederhof and others; 2017-01-21. Warning of possible confusion. Despite the L2/16-210r document code this January update is substantially changed from the L2/16-210 document and not a minor revision (the earlier document contains alternative ideas and additional material). The original L2/16-210 dated 2016-07-25 is still available as http://www.unicode.org/L2/L2016/16210-egyptian-control.pdf ).
  • L2/17-073 New draft for the encoding of an extended Egyptian Hieroglyphs repertoire (Hieroglyphica based); Michel Suignard; 2017-03-19. An associated spreadsheet http://www.unicode.org/L2/L2017/17073-n4788-hieroglyphs.xlsx is also available for download. A database snapshot is also available at http://www.unicode.org/L2/L2017/17073-n4788-database-add.pdfThis draft contains substantial changes from its predecessor.
  • L2/17-076. Proposal for the encoding of an Egyptological YOD. Michel Suignard; 2017-03-24. A new attempt to sort out the remaining transliteration character.

If I've missed any other published documents relating to hieroglyphic in Unicode or any points came up at the UTC 150 meeting in January please let me know.



The next UTC meetings are in early May.

Bob Richmond


Monday, 27 February 2017

Irregular Hieroglyph clusters

I am compiling a document on irregular Egyptian Hieroglyph clusters (aka quadrats) and would like to hear of any publications relating to this topic. Documenting what is required is an essential precursor to expanding the capability of Unicode plain text orthography beyond regular cluster support.

Examples in this blog post are all taken from the 18th Dynasty Tomb of Rekhmire (TT100).

Here are some examples of the regular clusters that account for the vast majority of writing from ancient sources:

In addition to these simple grid-like arrangements, there are various conventional forms of cluster such as:
The majority of these conventional clusters follow the bird and cobra patterns of non-grid arrangement.

Irregular clusters can be seen when the scribe/artist attempts to squeeze more hieroglyphs into the available space, avoid empty space, or achieve some special visual effect.
In many cases in Unicode text it is satisfactory to transcribe irregular clusters into an equivalent more regular form as has often been done in established practice. For instance the first irregular form in the illustration could be written in traditional MdC as (a:t)*H-A28 with no great significance for transcription purposes. Likewise the last example can use t:t:W10 ignoring the slope.

The goal of future developments of the Unicode writing system is to enable rendering of clusters inherent to the writing system. This principle can only be made precise on the basis of well-attested data about irregular clusters in order to be crystal clear what is to be accomplished, Not all arrangements of hieroglyphs discovered in ancient writings are inherent and count as plain text. Through detailed illustration it should be possible to reach a consensus among Egyptologists that an extended Unicode system meets the needs of the subject.

In the near future I hope to produce draft documentation for comment. Meanwhile I'm interested in hearing of any relevant work that has been done already on hieroglyphic orthography and any examples of irregular quadrats that may be relevant.

Thanks
Bob Richmond

Monday, 30 January 2017

JSesh sign placement

A technical note by Serge Rosmorduc About JSesh 6 sign placement is now available on the JSesh website via jsesh.qenherkhopeshef.org/en/node/3112. The document sketches some aspects of how JSesh organises hieroglyphs into clusters/quadrats for rendering. Some JSesh users may find this informative.

The note does not discuss when to use special features for sign placement so I'll give some practical guidance here.

In common with other first generation hieroglyphic editing software, the primary purpose of JSesh is to generate images of hieroglyphic text, typically for inclusion in word processing documents as illustrations. The JSesh sign placement extensions over basic MdC (Manuel de Codage) focus on this application. It is possible to get away with an inelegant or incorrect transcription so long as the image looks ok although hacking solutions can prove to be fools gold so avoid where at all possible.

A secondary, and increasingly useful, application is to use JSesh to encode hieroglyphic data in an MdC (Manuel de Codage) style format intended for processing in other software applications such as databases and other MdC-like editors. Here, elements of JSesh sign placement may be unsupported, irrelevant, or misleading. Incorrect or inefficient transcriptions are unacceptable.

This means the golden rule is to keep a JSesh encoding as simple as possible and only use the more complex features when essential.

For instance ntt is normally written MdC n:t*t. However technically it could be written in JSesh as n:(t*t) or n{{0,10,110}}**t{{0,800,98}}**t{{600,800,99}} even though these alternative forms make no apparent sense and should be avoided.

In order of complexity (least first) you should try to transcribe a quadrat using:

1. Regular '*' and ':' operators (preferred option).
2. Ligature system (if regular quadrat doesn't work)
3. Absolute positioning (only if all else fails, e.g. ligature doesn't work).

In some cases you may find a need for brackets. This is ok but keep usage to the minimum of what is essential.

When it seems absolute positioning is essential for a JSesh transcription, you may want to consider inserting MdC comments at the the top of the JSesh data file. For instance:

++JSESH6: anx\R30{{0,357,51}}**G5{{194,0,97}} is used because anx\R30^^^G5 renders the ankh too small.+s
++JSESH6: R7{{0,612,55}}**bA{{101,0,98}}**Z1{{953,69,79}} is used because R7^^^bA&&&Z1 default scaling of R7 is unsatisfactory (although R7\50^^^bA&&&Z1 is not bad). Note in some MdC systems R7&bA&Z1 looks fine.+s

I personally use this commenting approach with JSesh data so I know what to do if improvements appear in a later release of JSesh or I want to create generic MdC or Unicode data.

One pernicious use of absolute positioning I've observed is 'new signs from old'. For instance combine A50 ('noble') with S45 ('flagellum') instead of using the preformed A51 sign. Always use a preformed sign when available in the Hieroglyphica/JSesh set.

The current MdC analysis for Unicode Repertoire Extensions web app is not designed to process absolute positioning but provides an easy way to highlight absolutes and ligatures used in MdC-coded data. I find it useful to help spot JSesh transcription errors.

A related topic is conversion between Unicode and JSesh data. The most important consideration related to this process is that new Unicode fonts define sign placement in the font itself  (allowing the user to work in many generic applications from Word Processors to Web Browsers). Each font can take its own approach to sign placement. A topic I hope to return to fairly soon.

Bob Richmond

Wednesday, 25 January 2017

150 Years of Hieroglyphic in Type: Anniversary of the first typeset Egyptian dictionary

2017 marks the 150th anniversary of the first printing of a typeset dictionary and grammar of Egyptian Hieroglyphic, written by Samuel Birch (at the time employed by the British Museum as head of the Egyptian and Assyrian Department).

The dictionary and grammar took the form of several hundred pages of extensive additions by Birch to Volume V of the (1867) English translation Egypt's place in Universal History of Aegyptens Stelle in der Weltgeschichte (1845-) by C. C. J. Baron Bunsen. Birch and Bunson collaborated ten years earlier on additions to the English edition of Volume I (translated by Charles Cotterel; Longmans, 1848) notably concerning the writing system (including Appendix II - a hieroglyphic sign list).

About the typesetting process, Birch writes in his preface to Volume V:

The hieroglyphic used in this volume has been cast by Mr. Branston from designs drawn by Mr. Joseph Bonomi. It is the sole hieroglyphical fount in this country, and its importance can only be sufficiently appreciated from the consideration that Messrs. Longman have fulfilled, at a heavy cost, a task only undertaken abroad by foreign governments.

The advantage of this type to the present volume cannot be too highly appreciated, as it has rendered it practicable to print the Egyptian Dictionary, the Grammar, and the Chrestomathy in a form which renders the study of the hieroglyphs accessible both to the student and general enquirer. The Dictionary is the only one hitherto printed in this country, nor has any hieroglyphical dictionary appeared elsewhere, except that of Champollion, published in 1841, which contained only a few of the principal words.  Its phonetic arrangement will, it is hoped, render it particularly easy of consultation. It has been a great labour to compile and print it, and the execution of it has been a task of many years.

A feature of the Bonomi typeface is the use of mostly solid filled in hieroglyphs. There is no attempt to detail most signs as is popular practice for the majority of hieroglyphic fonts later and nowadays.


Hieroglyphic typesetting was new technology in the mid-Nineteenth century and many later publications followed, using a variety of fonts, The Lepsius/Theinhardt font is probably the best known from that era.

Modern digital hieroglyphic, both the first generation MdC approach and the new Unicode/Opentype based systems, have characteristics that can be traced back though the early works by Birch and Lepsius. The corpus of 150 years of hieroglyphic content in type has proved very useful in devising new techniques and designing new fonts.

Some developments don't happen overnight. The goal to make hieroglyphic available to the student and general enquirer expressed by Birch has advanced gradually over 150 years with many modern publications available fairly inexpensively in print. Yet in terms of the potential of modern technology, there is much more to be done.

Sadly, the Birch publication never made it to the best seller lists of the day.

E. A. Budge dedicated his Dictionary of Hieroglyphs (1920) to the memory of Samuel Birch (whose Ancient Egyptian courses he attended and later worked with for a couple of years at the British Museum). In the introduction to his dictionary, Budge writes:

... it is quite impossible to hide the fact that the inclusion of Birch's Egyptian Dictionary in the fifth volume of the English translation was a great misfortune for the Dictionary itself and for the beginner in Egyptology for whom the work was primarily intended. There was an interval of seven years between the publication of the fourth and fifth volumes of the English translation of Aegyptens Stelle in der Weltgeschichte, arid there seems to be no doubt that public interest in Bunsen's scheme of chronology drooped when its author died in 1860, the year which saw the appearance of the fourth volume, and was practically dead when the fifth volume was published in 1867. According to Birch, the volume fell " flat," and its editor and publishers were greatly disappointed. Whether the edition was a small one or not I have no evidence to show, but it was certainly the fact that for some reason or other copies of the volume were difficult to get in the early "seventies." It was said at the time that the publishers, being dissatisfied with the sales, had "disposed" of the sheets of a large number of copies The natural result was that when people found out that the volume contained Birch's Dictionary and Grammar and Chrestomathy the copies that found their way into the market fetched relatively very high prices, or at all events prices which effectively placed the book beyond the reach of the ordinary student. When I attended Birch's Egyptian classes in 1875-76 and needed the book urgently, I was obliged to trace each page of it on a separate sheet of tracing paper, omitting the references, and when these sheets were bound I used them for some years with great benefit. Moreover, the volume of  the English translation of Bunsen's work formed a veritable tomb for Birch's Dictionary. The title-page of it sets forth quite clearly that the "Historical Investigation" was by Bunsen, and that it was translated from the German by Charles H. Cottrell, Esq., M.A., and that it contains "Additions by Samuel Birch, LL.D." But who could possibly imagine from this last remark that Birch's contribution was 594 pages, i.e., nearly three-quarters of the whole volume, or that his contribution included an Egyptian Dictionary, the first ever published arranged on phonetic principles (!), and containing about 4,500 entries of Egyptian words, and names of gods and places, with references and translations, and an Egyptian Grammar and Chrestomathy? Or, again, take the case of the student who wants to consult these works and who, hearing that copies of them are to be seen in the British Museum Library, goes to the Reading Room to see them. He turns up the entry Birch, Samuel, LL.D., of the British Museum, in the Great Catalogue, but fails to find any mention of the Dictionary of Hieroglyphics or Grammar and Chrestomathy, because they are not mentioned in any one of the columns of names of the other books and papers which Birch wrote. All that he will find connecting Birch with an Egyptian Dictionary is the entry, " Sketch of a Hieroglyphical Dictionary, London, 1838," and unless he receives further instruction he will conclude that the " Sketch" published in 1838 is useless to him, and that Birch's Egyptian Dictionary never appeared.

An account from which several lessons can be learned.


Bob Richmond


Monday, 16 January 2017

Establishing the next Expansion of the Egyptian Hieroglyph Repertoire in Unicode

As I noted in my previous post, 2016 saw progress in identifying methodologies along with lists of candidate hieroglyphs for inclusion in an expansion to the 1071 already available in Unicode since 2009, I expect a formal proposal for an initial expansion set to emerge this year, probably for release in the Unicode 11 (2018) or Unicode 12 (2019) timescale.

Note: announcements and discussions on Ancient Egyptian and Unicode now take place on the "Egyptian Hieroglyphs in the UCS" mailing list (see blog post Informatique et Égyptologie - Cambridge - 2016 for link and comments).

Also, it is now possible to check your Manuel de Codage (MdC) documents (from JSesh etc.) online for hieroglyphs currently missing in Unicode. See my October post MdC analysis for Unicode Repertoire Extensions. I hope to keep this up to date during the next few months as Unicode proposals firm up.

Finally. There is much potential for hieroglyphic already in Unicode now the script has become better supported in recent years by advance in font technology, web browsers, word processors and so on. I hope to see substantial progress this year. As far as the repertoire is concerned I personally use a systematic way of working with hieroglyphs not yet in Unicode and hope to publish details and resources about this system when I'm comfortable it works well alongside expected developments of the formal standard though 2019. Meanwhile I'd be pleased to hear from others working with expanded repertoire in Unicode so we don't duplicate work. Thanks.

Bob Richmond

Sunday, 8 January 2017

Unicode Technical Committee (UTC) documents on Egyptian Hieroglyphic 2016

I thought it would be useful to compile a list of documents relating to Ancient Egyptian in the 2016 Unicode Technical Committee (UTC) document registry. I've added a few notes for explanation.

UTC meets every three months so I've included minutes of the meetings where Egyptian has been on the agenda. Only one formal proposal was made during the year but there were various drafts, discussion documents and comments about the hieroglyphic writing system and the hieroglyph repertoire.

Following the I&E Cambridge 2016 meeting in July, it was agreed that discussions on Ancient Egyptian and Unicode would take place on the "Egyptian Hieroglyphs in the UCS" mailing list (see blog post Informatique et Égyptologie - Cambridge - 2016). If you are interested in following progress or feel you have something to contribute to digital hieroglyphic developments at any level I recommend you join the mailing list.

I also hope to continue to track developments on this blog as the pieces of the puzzle come together this year.


January to March
  • L2/16-018 Proposal to encode three control characters for Egyptian Hieroglyphs (revised); Bob Richmond, Andrew Glass; 2016-01-27
  • L2/16-028 Preliminary draft of the extended Egyptian Hieroglyphs repertoire; Michel Suignard; 2016-01-21
  • L2/16-037 Recommendations to UTC #146 January 2016 on Script Proposals; Deborah Anderson, et al; 2016-01-24
  • L2/16-004 UTC #146 Minutes; Lisa Moore; 2016-02-01
Summary. The L2/16-018 proposal was a clarification of L2/15-123 "Proposal to encode three control characters for Egyptian Hieroglyphs (revised); Bob Richmond; 2015-05-04" addressing all feedback received to date. This proposal was put out to ballot as a UTC recommendation from meeting #146.

April to June
  • L2/16-079 Preliminary draft for the encoding of an extended Egyptian Hieroglyphs repertoire; Michel Suignard; 2016-04-20
  • L2/16-090 Comments on three control characters for Egyptian Hieroglyphs; Mark-Jan Nederhof, Vinodh Rajan with additional comments by Richter et al (TLA project) and a note by Polis and Rosmorduc (Ramses Project). 2016-04-25.
  • L2/16-104 Observations: L2/16-090 [Egyptian]; Bob Richmond; 2016-05-02
  • L2/16-121 UTC #147 Minutes; Lisa Moore; 2016-05-20
Summary. Glass, Nederhof, Richmond participated by phone in UTC discussion on Egyptian repertoire and controls at UTC #147. Detailed discussion continued at I&E Cambridge 2016 in July.

July to September
  • L2/16-177 A comprehensive system of control characters for Ancient Egyptian hieroglyphic text (preliminary version); Mark-Jan Nederhof; 2016-06-30
  • L2/16-199 A suggestion ... for Ancient Egyptian; William Overington; 2016-07-21
  • L2/16-210 A system of control characters for Ancient Egyptian hieroglyphic text; Mark-Jan Nederhof, et al; 2016-07-25
  • L2/16-204 Recently Closed Action Items (since 2016-05-05); Rick McGowan; 2016-07-25
  • L2/16-214 An Extension to the three control characters for Egyptian Hieroglyphs and some additional remarks; Bob Richmond; 2016-08-01
  • L2/16-216 Recommendations to UTC #148 August 2016 on Script Proposals; Deborah Anderson, et al; 2016-08-01
  • L2/16-218 Brief Report from Cambridge meeting of Egyptologists and Update; Deborah Anderson; 2016-08-01
  • L2/16-227 The Universal Hieroglyphic Writing System: Consensus and possible compromise; Bob Richmond; 2016-08-04
  • L2/16-231 Proposal for Ancient Egyptian encoding in Unicode; Serge Rosmorduc, et al; 2016-08-04
  • L2/16-233 Addendum to: A system of control characters for Ancient Egyptian hieroglyphic text; Mark-Jan Nederhof, et al; 2016-08-05
  • L2/16-203 UTC #148 Minutes; Lisa Moore; 2016-08-18
  • L2/16-250 Preliminary proposal to encode Möller's Egyptian Hieroglyphs in the SMP (WG2 N4741); Michael Everson; 2016-09-14
  • L2/16-251 Preliminary Mapping table of Möller's Egyptian Hieroglyphs (WG2 N4742 ; see also associated spreadsheet); Deborah Anderson; 2016-09-12
  • L2/16-257 Source analysis of an extended Egyptian Hieroglyphs repertoire (Hieroglyphica); Michel Suignard; 2016-09-20
Summary. Discussions at I&E Cambridge 2016 through to UTC#148 yielded various suggestions for extending the L2/16-018 Proposal, notably considering rare quadrat structures and extending scope to address vertical writing issues along with associated tall quadrat orthography in horizontal writing. At UTC#148 there was consensus on use of the EGYPTIAN HIEROGLYPH HORIZONTAL JOINER and EGYPTIAN HIEROGLYPH VERTICAL JOINER proposed controls but that the EGYPTIAN HIEROGLYPH LIGATURE JOINER should be suspended in favour of investigating more elaborate  schemes. Monograms and other more complex arrangements not well-suited to controls to be treated as part of hieroglyph repertoire development.

October to December
  • L2/16-298 Draft additional repertoire for ISO/IEC 10646:2016 (5th ed.) Amendment 1.2 (WG2 N4770); Michel Suignard; 2016-10-27
  • L2/16-307 Towards an Expansion of the Unicode Hieroglyph repertoire; Bob Richmond; 2016-10-28
  • L2/16-326 Recently Closed Action Items (since 2016-07-25); Rick McGowan; 2016-11-06
  • L2/16-342 Recommendations to UTC #149 November 2016 on Script Proposals; Deborah Anderson, et al 2016-11-07.
  • L2/16-325 UTC #149 Minutes; Lisa Moore; 2016-11-18
Summary. L2/16-298 draft allocates a block 13430-1343F Egyptian Hieroglyphs Format Controls with two characters defined: U+13430 EGYPTIAN HIEROGLYPH HORIZONTAL JOINER and U+13431 EGYPTIAN HIEROGLYPH VERTICAL JOINER and as things stand these are to be part of the repertoire of Unicode 10.0 (Summer 2017). Other repertoire and writing system issues are still being investigated and I expect activity to continue at 2016 levels during this year.

Bob Richmond