Since the rewritten HarfBuzz is shaping up fast and getting lots of Buzz these days, I get asked the same question again and again: "Will HarfBuzz replace Pango?" This post tries to answer that.
Short answer: No, not at all! Pango is here to stay. It will change, but only get better.
Long answer:
Pango provides two levels of API: A low-level and a high-level.
Low level API: What I can the "three pillars of pango":
pango_itemize()
: Breaks text into runs that each have the same font, Unicode script, language, direction, and other characteristics.pango_shape()
: Shapes a single run of text, given the font, script, language, direction, and other properties. Shaping means converting Unicode text to positioned glyphs.pango_break()
: Does line breaking and other text segmentation (cursor positions, cluster boundaries, word boundaries, and sentence boundaries).
High-level API: Pango's high-level API consists of the PangoLayout object, aka "here's a piece of text render it in this box I don't care what you do."
Of these, HarfBuzz only does shaping. That is,
hb_shape()
is functionally equivalent to pango_shape()
.API implications: Here is how moving to HarfBuzz affects the Pango API:
- Everything in
pango-ot.h
will be deprecated and be a thin wrapper aroundhb-ot.h
. This is already done in theharfbuzz-ng-external
branch of Pango. - There will be new API in Pango, perhaps in
pango-hb.h
to help extracting various HarfBuzz structures from their Pango equivalents. pango_shape()
will be a thin wrapper aroundhb_shape()
(read below).
Pango Modules:
pango_shape()
calls into Pango shaper modules to get the actual shaping done. There are two kinds Pango shaper modules depending on what they do (the API is the same, so Pango doesn't differentiate between the two classes):- Bridge modules: The basic-win32.c, basic-atsui.c modules call into another, platform native, shaping system to get the work done. The external (not integrated in Pango yet) modules basic-graphite.c and basic-m17n.c also do the same for the SIL Graphite and m17n shaping libraries.
- On Linux, since there currently is no native shaping engine, Pango has multiple shaping modules, one per script, to do the actual shaping (arabic-fc, syriac-fc, indic-fc, thai-fc, ..., and basic-fc for all the non-complex scripts).
hb_shape()
. That's indeed what the basic-fc.c in the harfbuzz-ng-external
does.Later on, when we add support for native win32, CoreText, Graphite, and m17n to HarfBuzz, all those other modules will also be replaced by HarfBuzz-calling equivalents.
Which one to use: Pango or HarfBuzz? Depends.
PangoLayout is designed to be the 'render this text in this box I don't care how' kind of API. That's a perfect fit for GUI toolkits like GTK+, but not suitable for lots of other uses, for example:
- Web browsers
- Word processors
- Designer tools
- Font design tools
- Terminal emulators
- Batch document processors
- TeX engines
So, each of those kinds of applications need to assess the pros and cons of using Pango vs using HarBuzz and providing all the other bits themselves. For example, HarfBuzz doesn't provide:
- An itemizer
- A Unicode Bidirection Algorithm implementation
- A Unicode Line Breaking implementation
- Glyph rasterization
- Glyph metrics information
- etc
That said, Firefox will use HarfBuzz as soon as it's ready (there are patches circulating around). Google is using old HarfBuzz for their Webkit and will port to the new one. I'm also attending the Webkit-GTK hackfest in December to port that to the new HarfBuzz. We'll work towards sharing the HarfBuzz-dealing code among Webkit backends.
This is already a long post. Let me finish now. Hope I made it a tiny bit more clear.