SA Internationalization

Section 22 Internationalization

View Source for section

    <section xml:id="section-internationalization" label="section-internationalization">
      <title>Internationalization</title>
      <idx>internationalization</idx>
      <p>
        Supporting a multitude of possible characters,
        across many languages and across many output formats can be a challenge.
        One of our goals is to make this much easier for authors.
        Fortunately,
        the Unicode standard has led to improvements from the 7-bit ASCII standard of old.
      </p>
      <paragraphs>
        <title>Unicode Characters for HTML Output</title>
        <p>
          First, we discuss HTML output.
          If you include Unicode
              <idx>Unicode</idx>
          characters in your <pretext/> source,
          they should survive just fine <foreign xml:lang="fr-FR">en route</foreign> to a web browser or e-reader.
          Here are the caveats for HTML output:
          <ul>
            <li>
              <p>
                So that you can continue to get the best results with print and PDF output,
                use available empty elements for obscure characters,
                even if targeting HTML output,
                before resorting to a Unicode character.
                For example,
                use <tage>copyright</tage> for the copyright symbol in text before resorting to the Unicode character <c>U+00A9</c>.
                It is a bit more work,
                but you will get better results with other conversions,
                even if you initially are only fascinated by <init>HTML</init>.
              </p>
            </li>
            <li>
              <p>
                How you actually enter Unicode characters into your source file is dependent on your editor and operating system,
                and is therefore outside the scope of our documentation.
                You can cut-and-paste characters and text from the source of our examples for initial testing and experimentation.
              </p>
            </li>
            <li>
              <p>
                Always, always identify your source as having Unicode characters by including the incantation
                <cd>&lt;?xml version="1.0" encoding="UTF-8" ?&gt;</cd>
                as the first line of your source file.
                (You <em>may</em> be able to accurately cut-and-paste this version here.
                But if the copy has non-standard characters in it,
                go back to the top of this source file for a copy.)
              </p>
            </li>
            <li>
              <p>
                Alan Wood’s
                <url href="http://www.alanwood.net/unicode/unicode_samples.html" visual="www.alanwood.net/unicode/unicode_samples.html">Unicode Resources</url>
                has a plethora of samples of various groups of Unicode characters.
                If you, or your readers, are
                <q>missing</q>
                characters in a web browser,
                this is a good place to start testing the local setup.
              </p>
            </li>
          </ul>
        </p>
      </paragraphs>
      <paragraphs>
        <title>Characters in <latex/>, PDF, print</title>
        <p>
          The situation for <latex/> is a bit more complicated,
          since <tex/> pre-dates Unicode's widespread adoption.
        </p>
        <p>
          This sample article is intended to work well,
          out-of-the-box, for authors just starting with <pretext/>.
          So we only include here examples that we know are likely to convert to <init>PDF</init> without any errors.
          For more extensive examples and experiments,
          we provide the sample document <c>examples/fonts/fonts-and-characters.xml</c>,
          so be aware of that example as you look to see what is possible.
        </p>
        <p>
          Similarly, you should be able to process this sample article successfully with various <latex/> engines.
          We test regularly with <c>pdflatex</c> and <c>xelatex</c> and provide online sample PDF output of this document processed by <c>pdflatex</c>.
          In principle,
          you should be able to use <c>latex</c> (to produce a DVI), and possibly other (unsupported) engines,
          such as <c>lualatex</c>.
        </p>
        <p>
          Once you get beyond the Latin alphabet,
          with accents common in Western Europe and the Western Hemisphere,
          you will almost assuredly need to restrict your attention to producing <init>PDF</init> output with the <c>xelatex</c> engine.
          This is discussed and tested in <c>examples/fonts/fonts-and-characters.xml</c>.
        </p>
      </paragraphs>
      <paragraphs>
        <title>Basic Latin, <c>U+0000</c><ndash/><c>U+007F</c></title>
        <p>
          Unicode uses multiple 8-bit bytes to represent characters,
          and these are typically expressed in hexadecimal (base 16) notation.
          Using just a single byte, we can get 256 values, and the first 128
          (hex <c>00</c> to <c>7F</c>)
          are the
          <q>usual</q>
          Latin characters with some values used as control codes.
          These 95 characters are the most basic,
          and will all render using <c>pdflatex</c> or <c>xelatex</c> with no special setup
          (and will render easily in HTML).
          <c>U+0000</c> to <c>U+001F</c> are control codes and not used here.
          <c>U+007F</c> is also a control code and so is excluded,
          while <c>U+0020</c> is a space,
          so appears invisible in the table.
          In the source we have authored each character by its escaped version using its Unicode number
          (in hexadecimal).
          So, for example,
          capital-B is authored as <c>&amp;#x0042;</c>.
        </p>
<!-- ex &#x0000; - &#x001F; controls -->
<!-- ex &#x007F; control             -->
<!-- ex &#x0020; space, not visible  -->
        <table>
          <title>Basic Latin, Regular</title>
          <tabular>
            <row>
              <cell/>
              <cell><c>0</c></cell>
              <cell><c>1</c></cell>
              <cell><c>2</c></cell>
              <cell><c>3</c></cell>
              <cell><c>4</c></cell>
              <cell><c>5</c></cell>
              <cell><c>6</c></cell>
              <cell><c>7</c></cell>
              <cell><c>8</c></cell>
              <cell><c>9</c></cell>
              <cell><c>A</c></cell>
              <cell><c>B</c></cell>
              <cell><c>C</c></cell>
              <cell><c>D</c></cell>
              <cell><c>E</c></cell>
              <cell><c>F</c></cell>
            </row>
            <row>
              <cell><c>002_</c></cell>
              <cell> </cell>
              <cell>!</cell>
              <cell>"</cell>
              <cell>#</cell>
              <cell>$</cell>
              <cell>%</cell>
              <cell>&amp;</cell>
              <cell>'</cell>
              <cell>(</cell>
              <cell>)</cell>
              <cell>*</cell>
              <cell>+</cell>
              <cell>,</cell>
              <cell>-</cell>
              <cell>.</cell>
              <cell>/</cell>
            </row>
            <row>
              <cell><c>003_</c></cell>
              <cell>0</cell>
              <cell>1</cell>
              <cell>2</cell>
              <cell>3</cell>
              <cell>4</cell>
              <cell>5</cell>
              <cell>6</cell>
              <cell>7</cell>
              <cell>8</cell>
              <cell>9</cell>
              <cell>:</cell>
              <cell>;</cell>
              <cell>&lt;</cell>
              <cell>=</cell>
              <cell>&gt;</cell>
              <cell>?</cell>
            </row>
            <row>
              <cell><c>004_</c></cell>
              <cell>@</cell>
              <cell>A</cell>
              <cell>B</cell>
              <cell>C</cell>
              <cell>D</cell>
              <cell>E</cell>
              <cell>F</cell>
              <cell>G</cell>
              <cell>H</cell>
              <cell>I</cell>
              <cell>J</cell>
              <cell>K</cell>
              <cell>L</cell>
              <cell>M</cell>
              <cell>N</cell>
              <cell>O</cell>
            </row>
            <row>
              <cell><c>005_</c></cell>
              <cell>P</cell>
              <cell>Q</cell>
              <cell>R</cell>
              <cell>S</cell>
              <cell>T</cell>
              <cell>U</cell>
              <cell>V</cell>
              <cell>W</cell>
              <cell>X</cell>
              <cell>Y</cell>
              <cell>Z</cell>
              <cell>[</cell>
              <cell>\</cell>
              <cell>]</cell>
              <cell>^</cell>
              <cell>_</cell>
            </row>
            <row>
              <cell><c>006_</c></cell>
              <cell>`</cell>
              <cell>a</cell>
              <cell>b</cell>
              <cell>c</cell>
              <cell>d</cell>
              <cell>e</cell>
              <cell>f</cell>
              <cell>g</cell>
              <cell>h</cell>
              <cell>i</cell>
              <cell>j</cell>
              <cell>k</cell>
              <cell>l</cell>
              <cell>m</cell>
              <cell>n</cell>
              <cell>o</cell>
            </row>
            <row>
              <cell><c>007_</c></cell>
              <cell>p</cell>
              <cell>q</cell>
              <cell>r</cell>
              <cell>s</cell>
              <cell>t</cell>
              <cell>u</cell>
              <cell>v</cell>
              <cell>w</cell>
              <cell>x</cell>
              <cell>y</cell>
              <cell>z</cell>
              <cell>{</cell>
              <cell>|</cell>
              <cell>}</cell>
              <cell>~</cell>
              <cell/>
            </row>
          </tabular>
        </table>
      </paragraphs>
      <paragraphs>
        <title>Latin-1 Supplement, <c>U+0080</c><ndash/><c>U+00FF</c></title>
        <p>
          Now we are interested in the next 128 possible bytes,
          (hex <c>80</c> to <c>FF</c>).
          The first 32 are again control codes and <c>U+00A0</c> is a non-breaking space,
          so is invisible, while <c>U+00AD</c> is a soft hyphen
          (which we have not implemented and so is excluded).
          We have taken care to see that the remainder will render using <c>pdflatex</c> or <c>xelatex</c> with no special setup
          (and HTML).
          In the source we have authored each character by its escaped version using its Unicode number
          (in hexadecimal).
          So, for example,
          a copyright symbol is authored as <c>&amp;#x00A9;</c>.
        </p>
<!-- ex &#x0080; - &#x009F; controls -->
<!-- ex &#x00A0; non-breaking space  -->
<!-- ex &#x00AD; soft hyphen         -->
        <table>
          <title>Latin-1 Supplement, Regular</title>
          <tabular>
            <row>
              <cell/>
              <cell><c>0</c></cell>
              <cell><c>1</c></cell>
              <cell><c>2</c></cell>
              <cell><c>3</c></cell>
              <cell><c>4</c></cell>
              <cell><c>5</c></cell>
              <cell><c>6</c></cell>
              <cell><c>7</c></cell>
              <cell><c>8</c></cell>
              <cell><c>9</c></cell>
              <cell><c>A</c></cell>
              <cell><c>B</c></cell>
              <cell><c>C</c></cell>
              <cell><c>D</c></cell>
              <cell><c>E</c></cell>
              <cell><c>F</c></cell>
            </row>
            <row>
              <cell><c>00A_</c></cell>
              <cell> </cell>
              <cell>¡</cell>
              <cell>¢</cell>
              <cell>£</cell>
              <cell>¤</cell>
              <cell>¥</cell>
              <cell>¦</cell>
              <cell>§</cell>
              <cell>¨</cell>
              <cell>©</cell>
              <cell>ª</cell>
              <cell>«</cell>
              <cell>¬</cell>
              <cell/>
              <cell>®</cell>
              <cell>¯</cell>
            </row>
            <row>
              <cell><c>00B_</c></cell>
              <cell>°</cell>
              <cell>±</cell>
              <cell>²</cell>
              <cell>³</cell>
              <cell>´</cell>
              <cell>µ</cell>
              <cell>¶</cell>
              <cell>·</cell>
              <cell>¸</cell>
              <cell>¹</cell>
              <cell>º</cell>
              <cell>»</cell>
              <cell>¼</cell>
              <cell>½</cell>
              <cell>¾</cell>
              <cell>¿</cell>
            </row>
            <row>
              <cell><c>00C_</c></cell>
              <cell>À</cell>
              <cell>Á</cell>
              <cell>Â</cell>
              <cell>Ã</cell>
              <cell>Ä</cell>
              <cell>Å</cell>
              <cell>Æ</cell>
              <cell>Ç</cell>
              <cell>È</cell>
              <cell>É</cell>
              <cell>Ê</cell>
              <cell>Ë</cell>
              <cell>Ì</cell>
              <cell>Í</cell>
              <cell>Î</cell>
              <cell>Ï</cell>
            </row>
            <row>
              <cell><c>00D_</c></cell>
              <cell>Ð</cell>
              <cell>Ñ</cell>
              <cell>Ò</cell>
              <cell>Ó</cell>
              <cell>Ô</cell>
              <cell>Õ</cell>
              <cell>Ö</cell>
              <cell>×</cell>
              <cell>Ø</cell>
              <cell>Ù</cell>
              <cell>Ú</cell>
              <cell>Û</cell>
              <cell>Ü</cell>
              <cell>Ý</cell>
              <cell>Þ</cell>
              <cell>ß</cell>
            </row>
            <row>
              <cell><c>00E_</c></cell>
              <cell>à</cell>
              <cell>á</cell>
              <cell>â</cell>
              <cell>ã</cell>
              <cell>ä</cell>
              <cell>å</cell>
              <cell>æ</cell>
              <cell>ç</cell>
              <cell>è</cell>
              <cell>é</cell>
              <cell>ê</cell>
              <cell>ë</cell>
              <cell>ì</cell>
              <cell>í</cell>
              <cell>î</cell>
              <cell>ï</cell>
            </row>
            <row>
              <cell><c>00F_</c></cell>
              <cell>ð</cell>
              <cell>ñ</cell>
              <cell>ò</cell>
              <cell>ó</cell>
              <cell>ô</cell>
              <cell>õ</cell>
              <cell>ö</cell>
              <cell>÷</cell>
              <cell>ø</cell>
              <cell>ù</cell>
              <cell>ú</cell>
              <cell>û</cell>
              <cell>ü</cell>
              <cell>ý</cell>
              <cell>þ</cell>
              <cell>ÿ</cell>
            </row>
          </tabular>
        </table>
      </paragraphs>
      <paragraphs>
        <title>Monospace, Basic Latin and Latin-1 Supplement, <c>U+0000</c><ndash/><c>U+00FF</c></title>
        <p>
          A monospace font is critical for samples of keyboard input and to distinguish exact technical input from running commentary.
          We list here all of the reasonable characters from the first 256 Unicode code points.
          (We skip the same 65 control characters from above, and the soft hyphen.)
          These should all render fine in HTML and when processed with <c>xelatex</c>,
          however our focus with this sample article for PDF output is the capabilities when processed with <c>pdflatex</c>.
          First, characters from <c>U+0000</c><ndash/><c>U+007F</c>.
        </p>
<!-- ex &#x0000; - &#x001F; controls -->
<!-- ex &#x007F; control             -->
        <table>
          <title>Basic Latin, Monospace</title>
          <tabular>
            <row>
              <cell/>
              <cell><c>0</c></cell>
              <cell><c>1</c></cell>
              <cell><c>2</c></cell>
              <cell><c>3</c></cell>
              <cell><c>4</c></cell>
              <cell><c>5</c></cell>
              <cell><c>6</c></cell>
              <cell><c>7</c></cell>
              <cell><c>8</c></cell>
              <cell><c>9</c></cell>
              <cell><c>A</c></cell>
              <cell><c>B</c></cell>
              <cell><c>C</c></cell>
              <cell><c>D</c></cell>
              <cell><c>E</c></cell>
              <cell><c>F</c></cell>
            </row>
            <row>
              <cell><c>002_</c></cell>
              <cell><c> </c></cell>
              <cell><c>!</c></cell>
              <cell><c>"</c></cell>
              <cell><c>#</c></cell>
              <cell><c>$</c></cell>
              <cell><c>%</c></cell>
              <cell><c>&amp;</c></cell>
              <cell><c>'</c></cell>
              <cell><c>(</c></cell>
              <cell><c>)</c></cell>
              <cell><c>*</c></cell>
              <cell><c>+</c></cell>
              <cell><c>,</c></cell>
              <cell><c>-</c></cell>
              <cell><c>.</c></cell>
              <cell><c>/</c></cell>
            </row>
            <row>
              <cell><c>003_</c></cell>
              <cell><c>0</c></cell>
              <cell><c>1</c></cell>
              <cell><c>2</c></cell>
              <cell><c>3</c></cell>
              <cell><c>4</c></cell>
              <cell><c>5</c></cell>
              <cell><c>6</c></cell>
              <cell><c>7</c></cell>
              <cell><c>8</c></cell>
              <cell><c>9</c></cell>
              <cell><c>:</c></cell>
              <cell><c>;</c></cell>
              <cell><c>&lt;</c></cell>
              <cell><c>=</c></cell>
              <cell><c>&gt;</c></cell>
              <cell><c>?</c></cell>
            </row>
            <row>
              <cell><c>004_</c></cell>
              <cell><c>@</c></cell>
              <cell><c>A</c></cell>
              <cell><c>B</c></cell>
              <cell><c>C</c></cell>
              <cell><c>D</c></cell>
              <cell><c>E</c></cell>
              <cell><c>F</c></cell>
              <cell><c>G</c></cell>
              <cell><c>H</c></cell>
              <cell><c>I</c></cell>
              <cell><c>J</c></cell>
              <cell><c>K</c></cell>
              <cell><c>L</c></cell>
              <cell><c>M</c></cell>
              <cell><c>N</c></cell>
              <cell><c>O</c></cell>
            </row>
            <row>
              <cell><c>005_</c></cell>
              <cell><c>P</c></cell>
              <cell><c>Q</c></cell>
              <cell><c>R</c></cell>
              <cell><c>S</c></cell>
              <cell><c>T</c></cell>
              <cell><c>U</c></cell>
              <cell><c>V</c></cell>
              <cell><c>W</c></cell>
              <cell><c>X</c></cell>
              <cell><c>Y</c></cell>
              <cell><c>Z</c></cell>
              <cell><c>[</c></cell>
              <cell><c>\</c></cell>
              <cell><c>]</c></cell>
              <cell><c>^</c></cell>
              <cell><c>_</c></cell>
            </row>
            <row>
              <cell><c>006_</c></cell>
              <cell><c>`</c></cell>
              <cell><c>a</c></cell>
              <cell><c>b</c></cell>
              <cell><c>c</c></cell>
              <cell><c>d</c></cell>
              <cell><c>e</c></cell>
              <cell><c>f</c></cell>
              <cell><c>g</c></cell>
              <cell><c>h</c></cell>
              <cell><c>i</c></cell>
              <cell><c>j</c></cell>
              <cell><c>k</c></cell>
              <cell><c>l</c></cell>
              <cell><c>m</c></cell>
              <cell><c>n</c></cell>
              <cell><c>o</c></cell>
            </row>
            <row>
              <cell><c>007_</c></cell>
              <cell><c>p</c></cell>
              <cell><c>q</c></cell>
              <cell><c>r</c></cell>
              <cell><c>s</c></cell>
              <cell><c>t</c></cell>
              <cell><c>u</c></cell>
              <cell><c>v</c></cell>
              <cell><c>w</c></cell>
              <cell><c>x</c></cell>
              <cell><c>y</c></cell>
              <cell><c>z</c></cell>
              <cell><c>{</c></cell>
              <cell><c>|</c></cell>
              <cell><c>}</c></cell>
              <cell><c>~</c></cell>
              <cell/>
            </row>
          </tabular>
        </table>
        <p>
          Note that the single and double quotes are upright and dumb,
          not curly and smart:
          <c>' " ' " ' "</c>.
          And a backtick is a backtick:
          <c>` ` `</c>.
          The zero is distinguished from the capital
          <q>oh</q>: <c>0 O 0 O 0 O</c>.
          And the numeral one is slightly different from the lower-case
          <q>ell</q>: <c>1 l 1 l 1 l</c>.
          The hyphen should be short and not expanded into some other kind of dash:
          <c>- - -</c>.
          These characters should all cut/paste out of a PDF into a text editor with no conversion to other characters.
        </p>
        <p>
          Now the remaining characters from <c>U+0080</c><ndash/><c>U+00FF</c>.
          The <c>program</c> tag is implemented in <latex/> via the <c>listing</c> package and these characters require ad-hoc replacements for processing by <c>pdflatex</c>. (You can see the replacements in the preamble of the <latex/> source for this document.) The replacement mechanism provided by the <c>listing</c> package will cause the characters below to produce a <latex/> compilation error if processed by <c>pdflatex</c> and in a table cell in certain situations
          (which we have avoided in the table below).
          The only workaround in this case is to switch to <c>xelatex</c>.
        </p>
<!-- ex &#x0080; - &#x009F; controls -->
<!-- ex &#x00A0; non-breaking space  -->
<!-- ex &#x00AD; soft hyphen         -->
        <table>
          <title>Latin-1 Supplement, Monospace</title>
          <tabular>
            <row>
              <cell/>
              <cell><c>0</c></cell>
              <cell><c>1</c></cell>
              <cell><c>2</c></cell>
              <cell><c>3</c></cell>
              <cell><c>4</c></cell>
              <cell><c>5</c></cell>
              <cell><c>6</c></cell>
              <cell><c>7</c></cell>
              <cell><c>8</c></cell>
              <cell><c>9</c></cell>
              <cell><c>A</c></cell>
              <cell><c>B</c></cell>
              <cell><c>C</c></cell>
              <cell><c>D</c></cell>
              <cell><c>E</c></cell>
              <cell><c>F</c></cell>
            </row>
            <row>
              <cell><c>00A_</c></cell>
              <cell/>
              <cell><c>¡</c></cell>
              <cell><c>¢</c></cell>
              <cell><c>£</c></cell>
              <cell><c>¤</c></cell>
              <cell><c>¥</c></cell>
              <cell><c>¦</c></cell>
              <cell><c>§</c></cell>
              <cell><c>¨</c></cell>
              <cell><c>©</c></cell>
              <cell><c>ª</c></cell>
              <cell><c>«</c></cell>
              <cell><c>¬</c></cell>
              <cell/>
              <cell><c>®</c></cell>
              <cell><c>¯</c></cell>
            </row>
            <row>
              <cell><c>00B_</c></cell>
              <cell><c>°</c></cell>
              <cell><c>±</c></cell>
              <cell><c>²</c></cell>
              <cell><c>³</c></cell>
              <cell><c>´</c></cell>
              <cell><c>µ</c></cell>
              <cell><c>¶</c></cell>
              <cell><c>·</c></cell>
              <cell><c>¸</c></cell>
              <cell><c>¹</c></cell>
              <cell><c>º</c></cell>
              <cell><c>»</c></cell>
              <cell><c>¼</c></cell>
              <cell><c>½</c></cell>
              <cell><c>¾</c></cell>
              <cell><c>¿</c></cell>
            </row>
            <row>
              <cell><c>00C_</c></cell>
              <cell><c>À</c></cell>
              <cell><c>Á</c></cell>
              <cell><c>Â</c></cell>
              <cell><c>Ã</c></cell>
              <cell><c>Ä</c></cell>
              <cell><c>Å</c></cell>
              <cell><c>Æ</c></cell>
              <cell><c>Ç</c></cell>
              <cell><c>È</c></cell>
              <cell><c>É</c></cell>
              <cell><c>Ê</c></cell>
              <cell><c>Ë</c></cell>
              <cell><c>Ì</c></cell>
              <cell><c>Í</c></cell>
              <cell><c>Î</c></cell>
              <cell><c>Ï</c></cell>
            </row>
            <row>
              <cell><c>00D_</c></cell>
              <cell><c>Ð</c></cell>
              <cell><c>Ñ</c></cell>
              <cell><c>Ò</c></cell>
              <cell><c>Ó</c></cell>
              <cell><c>Ô</c></cell>
              <cell><c>Õ</c></cell>
              <cell><c>Ö</c></cell>
              <cell><c>×</c></cell>
              <cell><c>Ø</c></cell>
              <cell><c>Ù</c></cell>
              <cell><c>Ú</c></cell>
              <cell><c>Û</c></cell>
              <cell><c>Ü</c></cell>
              <cell><c>Ý</c></cell>
              <cell><c>Þ</c></cell>
              <cell><c>ß</c></cell>
            </row>
            <row>
              <cell><c>00E_</c></cell>
              <cell><c>à</c></cell>
              <cell><c>á</c></cell>
              <cell><c>â</c></cell>
              <cell><c>ã</c></cell>
              <cell><c>ä</c></cell>
              <cell><c>å</c></cell>
              <cell><c>æ</c></cell>
              <cell><c>ç</c></cell>
              <cell><c>è</c></cell>
              <cell><c>é</c></cell>
              <cell><c>ê</c></cell>
              <cell><c>ë</c></cell>
              <cell><c>ì</c></cell>
              <cell><c>í</c></cell>
              <cell><c>î</c></cell>
              <cell><c>ï</c></cell>
            </row>
            <row>
              <cell><c>00F_</c></cell>
              <cell><c>ð</c></cell>
              <cell><c>ñ</c></cell>
              <cell><c>ò</c></cell>
              <cell><c>ó</c></cell>
              <cell><c>ô</c></cell>
              <cell><c>õ</c></cell>
              <cell><c>ö</c></cell>
              <cell><c>÷</c></cell>
              <cell><c>ø</c></cell>
              <cell><c>ù</c></cell>
              <cell><c>ú</c></cell>
              <cell><c>û</c></cell>
              <cell><c>ü</c></cell>
              <cell><c>ý</c></cell>
              <cell><c>þ</c></cell>
              <cell><c>ÿ</c></cell>
            </row>
          </tabular>
        </table>
        <p>
          The <c>pre</c> tag is implemented in <latex/> with the <c>fancyvrb</c> package.
          You can compare results here with the table above,
          lines here are rows above.
        </p>
<!-- These two raise errors in both of following -->
<!-- ex &#x00A0; non-breaking space              -->
<!-- ex &#x00AD; soft hyphen                     -->
<pre>
<cline>  ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬   ® ¯</cline>
<cline>° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿</cline>
<cline>À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï</cline>
<cline>Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß</cline>
<cline>à á â ã ä å æ ç è é ê ë ì í î ï</cline>
<cline>ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ</cline>
</pre>
        <p>
          The <c>console</c> tag is also implemented with <c>fancyvrb</c>,
          with adjustments for the input lines.
          It will not look like it,
          but these are 8 such inputs, with similar results to above,
          but now bolded.
        </p>
<console margins="0%" prompt="">
<input>  ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬   ® ¯</input>
<input>° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿</input>
<input>À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï</input>
<input>Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß</input>
<input>à á â ã ä å æ ç è é ê ë ì í î ï</input>
<input>ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ</input>
</console>
        <p>
          We take care to render the <c>U+0080</c><ndash/><c>U+00FF</c> characters in Sage cells.
          This would allow some flexibility in comments and strings employed.
          The following is just a test of these characters in the <c>input</c> and <c>output</c> of a <c>sage</c> element.
          This is not functional code.
        </p>
<sage doctest="not tested">
<input>
¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬   ® ¯
° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿
À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
à á â ã ä å æ ç è é ê ë ì í î ï
ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
</input>
<output>
¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬   ® ¯
° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿
À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
à á â ã ä å æ ç è é ê ë ì í î ï
ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
</output>
</sage>
      </paragraphs>
      <p>
        The table below has a single column,
        and each cell of the table has a string of 10 characters inside a <c>c</c> element.
        It is meant to test if the font is monospace in this situation.
      </p>
      <table>
        <title>Alignment Test</title>
        <tabular>
          <row>
            <cell><c>0123456789</c></cell>
          </row>
          <row>
            <cell><c>9876543210</c></cell>
          </row>
          <row>
            <cell><c>iiiiiiiiii</c></cell>
          </row>
          <row>
            <cell><c>mmmmmmmmmm</c></cell>
          </row>
        </tabular>
      </table>
      <p>
        Again, more examples and more thorough explanations can be found in the sample:
        <c>examples/fonts/fonts-and-characters.xml</c>.
        Be aware that the nature of the more advanced sample is that it will likely produce many errors when processed with <c>pdflatex</c>.
        Adding <c>-interaction batchmode</c> or <c>-interaction nonstopmode</c> to the <c>pdflatex</c> command-line will sometimes be less painless than acknowledging each error.
        The more advanced sample will perform well when processed with <c>xelatex</c>.
      </p>
    </section>

Supporting a multitude of possible characters, across many languages and across many output formats can be a challenge. One of our goals is to make this much easier for authors. Fortunately, the Unicode standard has led to improvements from the 7-bit ASCII standard of old.

🔗

Unicode Characters for HTML Output.

View Source for paragraphs

<paragraphs>
  <title>Unicode Characters for HTML Output</title>
  <p>
    First, we discuss HTML output.
    If you include Unicode
        <idx>Unicode</idx>
    characters in your <pretext/> source,
    they should survive just fine <foreign xml:lang="fr-FR">en route</foreign> to a web browser or e-reader.
    Here are the caveats for HTML output:
    <ul>
      <li>
        <p>
          So that you can continue to get the best results with print and PDF output,
          use available empty elements for obscure characters,
          even if targeting HTML output,
          before resorting to a Unicode character.
          For example,
          use <tage>copyright</tage> for the copyright symbol in text before resorting to the Unicode character <c>U+00A9</c>.
          It is a bit more work,
          but you will get better results with other conversions,
          even if you initially are only fascinated by <init>HTML</init>.
        </p>
      </li>
      <li>
        <p>
          How you actually enter Unicode characters into your source file is dependent on your editor and operating system,
          and is therefore outside the scope of our documentation.
          You can cut-and-paste characters and text from the source of our examples for initial testing and experimentation.
        </p>
      </li>
      <li>
        <p>
          Always, always identify your source as having Unicode characters by including the incantation
          <cd>&lt;?xml version="1.0" encoding="UTF-8" ?&gt;</cd>
          as the first line of your source file.
          (You <em>may</em> be able to accurately cut-and-paste this version here.
          But if the copy has non-standard characters in it,
          go back to the top of this source file for a copy.)
        </p>
      </li>
      <li>
        <p>
          Alan Wood’s
          <url href="http://www.alanwood.net/unicode/unicode_samples.html" visual="www.alanwood.net/unicode/unicode_samples.html">Unicode Resources</url>
          has a plethora of samples of various groups of Unicode characters.
          If you, or your readers, are
          <q>missing</q>
          characters in a web browser,
          this is a good place to start testing the local setup.
        </p>
      </li>
    </ul>
  </p>
</paragraphs>

First, we discuss HTML output. If you include Unicode characters in your PreTeXt source, they should survive just fine en route to a web browser or e-reader. Here are the caveats for HTML output:

So that you can continue to get the best results with print and PDF output, use available empty elements for obscure characters, even if targeting HTML output, before resorting to a Unicode character. For example, use <copyright/> for the copyright symbol in text before resorting to the Unicode character U+00A9. It is a bit more work, but you will get better results with other conversions, even if you initially are only fascinated by HTML.
🔗

🔗
How you actually enter Unicode characters into your source file is dependent on your editor and operating system, and is therefore outside the scope of our documentation. You can cut-and-paste characters and text from the source of our examples for initial testing and experimentation.
🔗

🔗
Always, always identify your source as having Unicode characters by including the incantation
```
<?xml version="1.0" encoding="UTF-8" ?>
```
as the first line of your source file. (You may be able to accurately cut-and-paste this version here. But if the copy has non-standard characters in it, go back to the top of this source file for a copy.)

🔗
🔗
Alan Wood’s Unicode Resources
¹
www.alanwood.net/unicode/unicode_samples.html
has a plethora of samples of various groups of Unicode characters. If you, or your readers, are “missing” characters in a web browser, this is a good place to start testing the local setup.
🔗

🔗

🔗

Characters in LaTeX, PDF, print.

View Source for paragraphs

<paragraphs>
  <title>Characters in <latex/>, PDF, print</title>
  <p>
    The situation for <latex/> is a bit more complicated,
    since <tex/> pre-dates Unicode's widespread adoption.
  </p>
  <p>
    This sample article is intended to work well,
    out-of-the-box, for authors just starting with <pretext/>.
    So we only include here examples that we know are likely to convert to <init>PDF</init> without any errors.
    For more extensive examples and experiments,
    we provide the sample document <c>examples/fonts/fonts-and-characters.xml</c>,
    so be aware of that example as you look to see what is possible.
  </p>
  <p>
    Similarly, you should be able to process this sample article successfully with various <latex/> engines.
    We test regularly with <c>pdflatex</c> and <c>xelatex</c> and provide online sample PDF output of this document processed by <c>pdflatex</c>.
    In principle,
    you should be able to use <c>latex</c> (to produce a DVI), and possibly other (unsupported) engines,
    such as <c>lualatex</c>.
  </p>
  <p>
    Once you get beyond the Latin alphabet,
    with accents common in Western Europe and the Western Hemisphere,
    you will almost assuredly need to restrict your attention to producing <init>PDF</init> output with the <c>xelatex</c> engine.
    This is discussed and tested in <c>examples/fonts/fonts-and-characters.xml</c>.
  </p>
</paragraphs>

The situation for LaTeX is a bit more complicated, since TeX pre-dates Unicode’s widespread adoption.

🔗

This sample article is intended to work well, out-of-the-box, for authors just starting with PreTeXt. So we only include here examples that we know are likely to convert to PDF without any errors. For more extensive examples and experiments, we provide the sample document examples/fonts/fonts-and-characters.xml, so be aware of that example as you look to see what is possible.

🔗

Similarly, you should be able to process this sample article successfully with various LaTeX engines. We test regularly with pdflatex and xelatex and provide online sample PDF output of this document processed by pdflatex. In principle, you should be able to use latex (to produce a DVI), and possibly other (unsupported) engines, such as lualatex.

🔗

Once you get beyond the Latin alphabet, with accents common in Western Europe and the Western Hemisphere, you will almost assuredly need to restrict your attention to producing PDF output with the xelatex engine. This is discussed and tested in examples/fonts/fonts-and-characters.xml.

🔗

Basic Latin, `U+0000`–`U+007F`.

View Source for paragraphs

      <paragraphs>
        <title>Basic Latin, <c>U+0000</c><ndash/><c>U+007F</c></title>
        <p>
          Unicode uses multiple 8-bit bytes to represent characters,
          and these are typically expressed in hexadecimal (base 16) notation.
          Using just a single byte, we can get 256 values, and the first 128
          (hex <c>00</c> to <c>7F</c>)
          are the
          <q>usual</q>
          Latin characters with some values used as control codes.
          These 95 characters are the most basic,
          and will all render using <c>pdflatex</c> or <c>xelatex</c> with no special setup
          (and will render easily in HTML).
          <c>U+0000</c> to <c>U+001F</c> are control codes and not used here.
          <c>U+007F</c> is also a control code and so is excluded,
          while <c>U+0020</c> is a space,
          so appears invisible in the table.
          In the source we have authored each character by its escaped version using its Unicode number
          (in hexadecimal).
          So, for example,
          capital-B is authored as <c>&amp;#x0042;</c>.
        </p>
<!-- ex &#x0000; - &#x001F; controls -->
<!-- ex &#x007F; control             -->
<!-- ex &#x0020; space, not visible  -->
        <table>
          <title>Basic Latin, Regular</title>
          <tabular>
            <row>
              <cell/>
              <cell><c>0</c></cell>
              <cell><c>1</c></cell>
              <cell><c>2</c></cell>
              <cell><c>3</c></cell>
              <cell><c>4</c></cell>
              <cell><c>5</c></cell>
              <cell><c>6</c></cell>
              <cell><c>7</c></cell>
              <cell><c>8</c></cell>
              <cell><c>9</c></cell>
              <cell><c>A</c></cell>
              <cell><c>B</c></cell>
              <cell><c>C</c></cell>
              <cell><c>D</c></cell>
              <cell><c>E</c></cell>
              <cell><c>F</c></cell>
            </row>
            <row>
              <cell><c>002_</c></cell>
              <cell> </cell>
              <cell>!</cell>
              <cell>"</cell>
              <cell>#</cell>
              <cell>$</cell>
              <cell>%</cell>
              <cell>&amp;</cell>
              <cell>'</cell>
              <cell>(</cell>
              <cell>)</cell>
              <cell>*</cell>
              <cell>+</cell>
              <cell>,</cell>
              <cell>-</cell>
              <cell>.</cell>
              <cell>/</cell>
            </row>
            <row>
              <cell><c>003_</c></cell>
              <cell>0</cell>
              <cell>1</cell>
              <cell>2</cell>
              <cell>3</cell>
              <cell>4</cell>
              <cell>5</cell>
              <cell>6</cell>
              <cell>7</cell>
              <cell>8</cell>
              <cell>9</cell>
              <cell>:</cell>
              <cell>;</cell>
              <cell>&lt;</cell>
              <cell>=</cell>
              <cell>&gt;</cell>
              <cell>?</cell>
            </row>
            <row>
              <cell><c>004_</c></cell>
              <cell>@</cell>
              <cell>A</cell>
              <cell>B</cell>
              <cell>C</cell>
              <cell>D</cell>
              <cell>E</cell>
              <cell>F</cell>
              <cell>G</cell>
              <cell>H</cell>
              <cell>I</cell>
              <cell>J</cell>
              <cell>K</cell>
              <cell>L</cell>
              <cell>M</cell>
              <cell>N</cell>
              <cell>O</cell>
            </row>
            <row>
              <cell><c>005_</c></cell>
              <cell>P</cell>
              <cell>Q</cell>
              <cell>R</cell>
              <cell>S</cell>
              <cell>T</cell>
              <cell>U</cell>
              <cell>V</cell>
              <cell>W</cell>
              <cell>X</cell>
              <cell>Y</cell>
              <cell>Z</cell>
              <cell>[</cell>
              <cell>\</cell>
              <cell>]</cell>
              <cell>^</cell>
              <cell>_</cell>
            </row>
            <row>
              <cell><c>006_</c></cell>
              <cell>`</cell>
              <cell>a</cell>
              <cell>b</cell>
              <cell>c</cell>
              <cell>d</cell>
              <cell>e</cell>
              <cell>f</cell>
              <cell>g</cell>
              <cell>h</cell>
              <cell>i</cell>
              <cell>j</cell>
              <cell>k</cell>
              <cell>l</cell>
              <cell>m</cell>
              <cell>n</cell>
              <cell>o</cell>
            </row>
            <row>
              <cell><c>007_</c></cell>
              <cell>p</cell>
              <cell>q</cell>
              <cell>r</cell>
              <cell>s</cell>
              <cell>t</cell>
              <cell>u</cell>
              <cell>v</cell>
              <cell>w</cell>
              <cell>x</cell>
              <cell>y</cell>
              <cell>z</cell>
              <cell>{</cell>
              <cell>|</cell>
              <cell>}</cell>
              <cell>~</cell>
              <cell/>
            </row>
          </tabular>
        </table>
      </paragraphs>

Unicode uses multiple 8-bit bytes to represent characters, and these are typically expressed in hexadecimal (base 16) notation. Using just a single byte, we can get 256 values, and the first 128 (hex 00 to 7F) are the “usual” Latin characters with some values used as control codes. These 95 characters are the most basic, and will all render using pdflatex or xelatex with no special setup (and will render easily in HTML). U+0000 to U+001F are control codes and not used here. U+007F is also a control code and so is excluded, while U+0020 is a space, so appears invisible in the table. In the source we have authored each character by its escaped version using its Unicode number (in hexadecimal). So, for example, capital-B is authored as B.

🔗

View Source for table

<table>
  <title>Basic Latin, Regular</title>
  <tabular>
    <row>
      <cell/>
      <cell><c>0</c></cell>
      <cell><c>1</c></cell>
      <cell><c>2</c></cell>
      <cell><c>3</c></cell>
      <cell><c>4</c></cell>
      <cell><c>5</c></cell>
      <cell><c>6</c></cell>
      <cell><c>7</c></cell>
      <cell><c>8</c></cell>
      <cell><c>9</c></cell>
      <cell><c>A</c></cell>
      <cell><c>B</c></cell>
      <cell><c>C</c></cell>
      <cell><c>D</c></cell>
      <cell><c>E</c></cell>
      <cell><c>F</c></cell>
    </row>
    <row>
      <cell><c>002_</c></cell>
      <cell> </cell>
      <cell>!</cell>
      <cell>"</cell>
      <cell>#</cell>
      <cell>$</cell>
      <cell>%</cell>
      <cell>&amp;</cell>
      <cell>'</cell>
      <cell>(</cell>
      <cell>)</cell>
      <cell>*</cell>
      <cell>+</cell>
      <cell>,</cell>
      <cell>-</cell>
      <cell>.</cell>
      <cell>/</cell>
    </row>
    <row>
      <cell><c>003_</c></cell>
      <cell>0</cell>
      <cell>1</cell>
      <cell>2</cell>
      <cell>3</cell>
      <cell>4</cell>
      <cell>5</cell>
      <cell>6</cell>
      <cell>7</cell>
      <cell>8</cell>
      <cell>9</cell>
      <cell>:</cell>
      <cell>;</cell>
      <cell>&lt;</cell>
      <cell>=</cell>
      <cell>&gt;</cell>
      <cell>?</cell>
    </row>
    <row>
      <cell><c>004_</c></cell>
      <cell>@</cell>
      <cell>A</cell>
      <cell>B</cell>
      <cell>C</cell>
      <cell>D</cell>
      <cell>E</cell>
      <cell>F</cell>
      <cell>G</cell>
      <cell>H</cell>
      <cell>I</cell>
      <cell>J</cell>
      <cell>K</cell>
      <cell>L</cell>
      <cell>M</cell>
      <cell>N</cell>
      <cell>O</cell>
    </row>
    <row>
      <cell><c>005_</c></cell>
      <cell>P</cell>
      <cell>Q</cell>
      <cell>R</cell>
      <cell>S</cell>
      <cell>T</cell>
      <cell>U</cell>
      <cell>V</cell>
      <cell>W</cell>
      <cell>X</cell>
      <cell>Y</cell>
      <cell>Z</cell>
      <cell>[</cell>
      <cell>\</cell>
      <cell>]</cell>
      <cell>^</cell>
      <cell>_</cell>
    </row>
    <row>
      <cell><c>006_</c></cell>
      <cell>`</cell>
      <cell>a</cell>
      <cell>b</cell>
      <cell>c</cell>
      <cell>d</cell>
      <cell>e</cell>
      <cell>f</cell>
      <cell>g</cell>
      <cell>h</cell>
      <cell>i</cell>
      <cell>j</cell>
      <cell>k</cell>
      <cell>l</cell>
      <cell>m</cell>
      <cell>n</cell>
      <cell>o</cell>
    </row>
    <row>
      <cell><c>007_</c></cell>
      <cell>p</cell>
      <cell>q</cell>
      <cell>r</cell>
      <cell>s</cell>
      <cell>t</cell>
      <cell>u</cell>
      <cell>v</cell>
      <cell>w</cell>
      <cell>x</cell>
      <cell>y</cell>
      <cell>z</cell>
      <cell>{</cell>
      <cell>|</cell>
      <cell>}</cell>
      <cell>~</cell>
      <cell/>
    </row>
  </tabular>
</table>

Table 22.1. Basic Latin, Regular

🔗

0 1 2 3 4 5 6 7 8 9 A B C D E F

002_ ! " # $ % & ’ ( ) * + , - . /

003_ 0 1 2 3 4 5 6 7 8 9 : ; < = > ?

004_ @ A B C D E F G H I J K L M N O

005_ P Q R S T U V W X Y Z [ \ ] ^ _

006_ ` a b c d e f g h i j k l m n o

007_ p q r s t u v w x y z { | } ~

🔗

Latin-1 Supplement, `U+0080`–`U+00FF`.

View Source for paragraphs

      <paragraphs>
        <title>Latin-1 Supplement, <c>U+0080</c><ndash/><c>U+00FF</c></title>
        <p>
          Now we are interested in the next 128 possible bytes,
          (hex <c>80</c> to <c>FF</c>).
          The first 32 are again control codes and <c>U+00A0</c> is a non-breaking space,
          so is invisible, while <c>U+00AD</c> is a soft hyphen
          (which we have not implemented and so is excluded).
          We have taken care to see that the remainder will render using <c>pdflatex</c> or <c>xelatex</c> with no special setup
          (and HTML).
          In the source we have authored each character by its escaped version using its Unicode number
          (in hexadecimal).
          So, for example,
          a copyright symbol is authored as <c>&amp;#x00A9;</c>.
        </p>
<!-- ex &#x0080; - &#x009F; controls -->
<!-- ex &#x00A0; non-breaking space  -->
<!-- ex &#x00AD; soft hyphen         -->
        <table>
          <title>Latin-1 Supplement, Regular</title>
          <tabular>
            <row>
              <cell/>
              <cell><c>0</c></cell>
              <cell><c>1</c></cell>
              <cell><c>2</c></cell>
              <cell><c>3</c></cell>
              <cell><c>4</c></cell>
              <cell><c>5</c></cell>
              <cell><c>6</c></cell>
              <cell><c>7</c></cell>
              <cell><c>8</c></cell>
              <cell><c>9</c></cell>
              <cell><c>A</c></cell>
              <cell><c>B</c></cell>
              <cell><c>C</c></cell>
              <cell><c>D</c></cell>
              <cell><c>E</c></cell>
              <cell><c>F</c></cell>
            </row>
            <row>
              <cell><c>00A_</c></cell>
              <cell> </cell>
              <cell>¡</cell>
              <cell>¢</cell>
              <cell>£</cell>
              <cell>¤</cell>
              <cell>¥</cell>
              <cell>¦</cell>
              <cell>§</cell>
              <cell>¨</cell>
              <cell>©</cell>
              <cell>ª</cell>
              <cell>«</cell>
              <cell>¬</cell>
              <cell/>
              <cell>®</cell>
              <cell>¯</cell>
            </row>
            <row>
              <cell><c>00B_</c></cell>
              <cell>°</cell>
              <cell>±</cell>
              <cell>²</cell>
              <cell>³</cell>
              <cell>´</cell>
              <cell>µ</cell>
              <cell>¶</cell>
              <cell>·</cell>
              <cell>¸</cell>
              <cell>¹</cell>
              <cell>º</cell>
              <cell>»</cell>
              <cell>¼</cell>
              <cell>½</cell>
              <cell>¾</cell>
              <cell>¿</cell>
            </row>
            <row>
              <cell><c>00C_</c></cell>
              <cell>À</cell>
              <cell>Á</cell>
              <cell>Â</cell>
              <cell>Ã</cell>
              <cell>Ä</cell>
              <cell>Å</cell>
              <cell>Æ</cell>
              <cell>Ç</cell>
              <cell>È</cell>
              <cell>É</cell>
              <cell>Ê</cell>
              <cell>Ë</cell>
              <cell>Ì</cell>
              <cell>Í</cell>
              <cell>Î</cell>
              <cell>Ï</cell>
            </row>
            <row>
              <cell><c>00D_</c></cell>
              <cell>Ð</cell>
              <cell>Ñ</cell>
              <cell>Ò</cell>
              <cell>Ó</cell>
              <cell>Ô</cell>
              <cell>Õ</cell>
              <cell>Ö</cell>
              <cell>×</cell>
              <cell>Ø</cell>
              <cell>Ù</cell>
              <cell>Ú</cell>
              <cell>Û</cell>
              <cell>Ü</cell>
              <cell>Ý</cell>
              <cell>Þ</cell>
              <cell>ß</cell>
            </row>
            <row>
              <cell><c>00E_</c></cell>
              <cell>à</cell>
              <cell>á</cell>
              <cell>â</cell>
              <cell>ã</cell>
              <cell>ä</cell>
              <cell>å</cell>
              <cell>æ</cell>
              <cell>ç</cell>
              <cell>è</cell>
              <cell>é</cell>
              <cell>ê</cell>
              <cell>ë</cell>
              <cell>ì</cell>
              <cell>í</cell>
              <cell>î</cell>
              <cell>ï</cell>
            </row>
            <row>
              <cell><c>00F_</c></cell>
              <cell>ð</cell>
              <cell>ñ</cell>
              <cell>ò</cell>
              <cell>ó</cell>
              <cell>ô</cell>
              <cell>õ</cell>
              <cell>ö</cell>
              <cell>÷</cell>
              <cell>ø</cell>
              <cell>ù</cell>
              <cell>ú</cell>
              <cell>û</cell>
              <cell>ü</cell>
              <cell>ý</cell>
              <cell>þ</cell>
              <cell>ÿ</cell>
            </row>
          </tabular>
        </table>
      </paragraphs>

Now we are interested in the next 128 possible bytes, (hex 80 to FF). The first 32 are again control codes and U+00A0 is a non-breaking space, so is invisible, while U+00AD is a soft hyphen (which we have not implemented and so is excluded). We have taken care to see that the remainder will render using pdflatex or xelatex with no special setup (and HTML). In the source we have authored each character by its escaped version using its Unicode number (in hexadecimal). So, for example, a copyright symbol is authored as ©.

🔗

View Source for table

<table>
  <title>Latin-1 Supplement, Regular</title>
  <tabular>
    <row>
      <cell/>
      <cell><c>0</c></cell>
      <cell><c>1</c></cell>
      <cell><c>2</c></cell>
      <cell><c>3</c></cell>
      <cell><c>4</c></cell>
      <cell><c>5</c></cell>
      <cell><c>6</c></cell>
      <cell><c>7</c></cell>
      <cell><c>8</c></cell>
      <cell><c>9</c></cell>
      <cell><c>A</c></cell>
      <cell><c>B</c></cell>
      <cell><c>C</c></cell>
      <cell><c>D</c></cell>
      <cell><c>E</c></cell>
      <cell><c>F</c></cell>
    </row>
    <row>
      <cell><c>00A_</c></cell>
      <cell> </cell>
      <cell>¡</cell>
      <cell>¢</cell>
      <cell>£</cell>
      <cell>¤</cell>
      <cell>¥</cell>
      <cell>¦</cell>
      <cell>§</cell>
      <cell>¨</cell>
      <cell>©</cell>
      <cell>ª</cell>
      <cell>«</cell>
      <cell>¬</cell>
      <cell/>
      <cell>®</cell>
      <cell>¯</cell>
    </row>
    <row>
      <cell><c>00B_</c></cell>
      <cell>°</cell>
      <cell>±</cell>
      <cell>²</cell>
      <cell>³</cell>
      <cell>´</cell>
      <cell>µ</cell>
      <cell>¶</cell>
      <cell>·</cell>
      <cell>¸</cell>
      <cell>¹</cell>
      <cell>º</cell>
      <cell>»</cell>
      <cell>¼</cell>
      <cell>½</cell>
      <cell>¾</cell>
      <cell>¿</cell>
    </row>
    <row>
      <cell><c>00C_</c></cell>
      <cell>À</cell>
      <cell>Á</cell>
      <cell>Â</cell>
      <cell>Ã</cell>
      <cell>Ä</cell>
      <cell>Å</cell>
      <cell>Æ</cell>
      <cell>Ç</cell>
      <cell>È</cell>
      <cell>É</cell>
      <cell>Ê</cell>
      <cell>Ë</cell>
      <cell>Ì</cell>
      <cell>Í</cell>
      <cell>Î</cell>
      <cell>Ï</cell>
    </row>
    <row>
      <cell><c>00D_</c></cell>
      <cell>Ð</cell>
      <cell>Ñ</cell>
      <cell>Ò</cell>
      <cell>Ó</cell>
      <cell>Ô</cell>
      <cell>Õ</cell>
      <cell>Ö</cell>
      <cell>×</cell>
      <cell>Ø</cell>
      <cell>Ù</cell>
      <cell>Ú</cell>
      <cell>Û</cell>
      <cell>Ü</cell>
      <cell>Ý</cell>
      <cell>Þ</cell>
      <cell>ß</cell>
    </row>
    <row>
      <cell><c>00E_</c></cell>
      <cell>à</cell>
      <cell>á</cell>
      <cell>â</cell>
      <cell>ã</cell>
      <cell>ä</cell>
      <cell>å</cell>
      <cell>æ</cell>
      <cell>ç</cell>
      <cell>è</cell>
      <cell>é</cell>
      <cell>ê</cell>
      <cell>ë</cell>
      <cell>ì</cell>
      <cell>í</cell>
      <cell>î</cell>
      <cell>ï</cell>
    </row>
    <row>
      <cell><c>00F_</c></cell>
      <cell>ð</cell>
      <cell>ñ</cell>
      <cell>ò</cell>
      <cell>ó</cell>
      <cell>ô</cell>
      <cell>õ</cell>
      <cell>ö</cell>
      <cell>÷</cell>
      <cell>ø</cell>
      <cell>ù</cell>
      <cell>ú</cell>
      <cell>û</cell>
      <cell>ü</cell>
      <cell>ý</cell>
      <cell>þ</cell>
      <cell>ÿ</cell>
    </row>
  </tabular>
</table>

Table 22.2. Latin-1 Supplement, Regular

🔗

0 1 2 3 4 5 6 7 8 9 A B C D E F

00A_ ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬ ® ¯

00B_ ° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿

00C_ À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï

00D_ Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß

00E_ à á â ã ä å æ ç è é ê ë ì í î ï

00F_ ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ

🔗

Monospace, Basic Latin and Latin-1 Supplement, `U+0000`–`U+00FF`.

View Source for paragraphs

      <paragraphs>
        <title>Monospace, Basic Latin and Latin-1 Supplement, <c>U+0000</c><ndash/><c>U+00FF</c></title>
        <p>
          A monospace font is critical for samples of keyboard input and to distinguish exact technical input from running commentary.
          We list here all of the reasonable characters from the first 256 Unicode code points.
          (We skip the same 65 control characters from above, and the soft hyphen.)
          These should all render fine in HTML and when processed with <c>xelatex</c>,
          however our focus with this sample article for PDF output is the capabilities when processed with <c>pdflatex</c>.
          First, characters from <c>U+0000</c><ndash/><c>U+007F</c>.
        </p>
<!-- ex &#x0000; - &#x001F; controls -->
<!-- ex &#x007F; control             -->
        <table>
          <title>Basic Latin, Monospace</title>
          <tabular>
            <row>
              <cell/>
              <cell><c>0</c></cell>
              <cell><c>1</c></cell>
              <cell><c>2</c></cell>
              <cell><c>3</c></cell>
              <cell><c>4</c></cell>
              <cell><c>5</c></cell>
              <cell><c>6</c></cell>
              <cell><c>7</c></cell>
              <cell><c>8</c></cell>
              <cell><c>9</c></cell>
              <cell><c>A</c></cell>
              <cell><c>B</c></cell>
              <cell><c>C</c></cell>
              <cell><c>D</c></cell>
              <cell><c>E</c></cell>
              <cell><c>F</c></cell>
            </row>
            <row>
              <cell><c>002_</c></cell>
              <cell><c> </c></cell>
              <cell><c>!</c></cell>
              <cell><c>"</c></cell>
              <cell><c>#</c></cell>
              <cell><c>$</c></cell>
              <cell><c>%</c></cell>
              <cell><c>&amp;</c></cell>
              <cell><c>'</c></cell>
              <cell><c>(</c></cell>
              <cell><c>)</c></cell>
              <cell><c>*</c></cell>
              <cell><c>+</c></cell>
              <cell><c>,</c></cell>
              <cell><c>-</c></cell>
              <cell><c>.</c></cell>
              <cell><c>/</c></cell>
            </row>
            <row>
              <cell><c>003_</c></cell>
              <cell><c>0</c></cell>
              <cell><c>1</c></cell>
              <cell><c>2</c></cell>
              <cell><c>3</c></cell>
              <cell><c>4</c></cell>
              <cell><c>5</c></cell>
              <cell><c>6</c></cell>
              <cell><c>7</c></cell>
              <cell><c>8</c></cell>
              <cell><c>9</c></cell>
              <cell><c>:</c></cell>
              <cell><c>;</c></cell>
              <cell><c>&lt;</c></cell>
              <cell><c>=</c></cell>
              <cell><c>&gt;</c></cell>
              <cell><c>?</c></cell>
            </row>
            <row>
              <cell><c>004_</c></cell>
              <cell><c>@</c></cell>
              <cell><c>A</c></cell>
              <cell><c>B</c></cell>
              <cell><c>C</c></cell>
              <cell><c>D</c></cell>
              <cell><c>E</c></cell>
              <cell><c>F</c></cell>
              <cell><c>G</c></cell>
              <cell><c>H</c></cell>
              <cell><c>I</c></cell>
              <cell><c>J</c></cell>
              <cell><c>K</c></cell>
              <cell><c>L</c></cell>
              <cell><c>M</c></cell>
              <cell><c>N</c></cell>
              <cell><c>O</c></cell>
            </row>
            <row>
              <cell><c>005_</c></cell>
              <cell><c>P</c></cell>
              <cell><c>Q</c></cell>
              <cell><c>R</c></cell>
              <cell><c>S</c></cell>
              <cell><c>T</c></cell>
              <cell><c>U</c></cell>
              <cell><c>V</c></cell>
              <cell><c>W</c></cell>
              <cell><c>X</c></cell>
              <cell><c>Y</c></cell>
              <cell><c>Z</c></cell>
              <cell><c>[</c></cell>
              <cell><c>\</c></cell>
              <cell><c>]</c></cell>
              <cell><c>^</c></cell>
              <cell><c>_</c></cell>
            </row>
            <row>
              <cell><c>006_</c></cell>
              <cell><c>`</c></cell>
              <cell><c>a</c></cell>
              <cell><c>b</c></cell>
              <cell><c>c</c></cell>
              <cell><c>d</c></cell>
              <cell><c>e</c></cell>
              <cell><c>f</c></cell>
              <cell><c>g</c></cell>
              <cell><c>h</c></cell>
              <cell><c>i</c></cell>
              <cell><c>j</c></cell>
              <cell><c>k</c></cell>
              <cell><c>l</c></cell>
              <cell><c>m</c></cell>
              <cell><c>n</c></cell>
              <cell><c>o</c></cell>
            </row>
            <row>
              <cell><c>007_</c></cell>
              <cell><c>p</c></cell>
              <cell><c>q</c></cell>
              <cell><c>r</c></cell>
              <cell><c>s</c></cell>
              <cell><c>t</c></cell>
              <cell><c>u</c></cell>
              <cell><c>v</c></cell>
              <cell><c>w</c></cell>
              <cell><c>x</c></cell>
              <cell><c>y</c></cell>
              <cell><c>z</c></cell>
              <cell><c>{</c></cell>
              <cell><c>|</c></cell>
              <cell><c>}</c></cell>
              <cell><c>~</c></cell>
              <cell/>
            </row>
          </tabular>
        </table>
        <p>
          Note that the single and double quotes are upright and dumb,
          not curly and smart:
          <c>' " ' " ' "</c>.
          And a backtick is a backtick:
          <c>` ` `</c>.
          The zero is distinguished from the capital
          <q>oh</q>: <c>0 O 0 O 0 O</c>.
          And the numeral one is slightly different from the lower-case
          <q>ell</q>: <c>1 l 1 l 1 l</c>.
          The hyphen should be short and not expanded into some other kind of dash:
          <c>- - -</c>.
          These characters should all cut/paste out of a PDF into a text editor with no conversion to other characters.
        </p>
        <p>
          Now the remaining characters from <c>U+0080</c><ndash/><c>U+00FF</c>.
          The <c>program</c> tag is implemented in <latex/> via the <c>listing</c> package and these characters require ad-hoc replacements for processing by <c>pdflatex</c>. (You can see the replacements in the preamble of the <latex/> source for this document.) The replacement mechanism provided by the <c>listing</c> package will cause the characters below to produce a <latex/> compilation error if processed by <c>pdflatex</c> and in a table cell in certain situations
          (which we have avoided in the table below).
          The only workaround in this case is to switch to <c>xelatex</c>.
        </p>
<!-- ex &#x0080; - &#x009F; controls -->
<!-- ex &#x00A0; non-breaking space  -->
<!-- ex &#x00AD; soft hyphen         -->
        <table>
          <title>Latin-1 Supplement, Monospace</title>
          <tabular>
            <row>
              <cell/>
              <cell><c>0</c></cell>
              <cell><c>1</c></cell>
              <cell><c>2</c></cell>
              <cell><c>3</c></cell>
              <cell><c>4</c></cell>
              <cell><c>5</c></cell>
              <cell><c>6</c></cell>
              <cell><c>7</c></cell>
              <cell><c>8</c></cell>
              <cell><c>9</c></cell>
              <cell><c>A</c></cell>
              <cell><c>B</c></cell>
              <cell><c>C</c></cell>
              <cell><c>D</c></cell>
              <cell><c>E</c></cell>
              <cell><c>F</c></cell>
            </row>
            <row>
              <cell><c>00A_</c></cell>
              <cell/>
              <cell><c>¡</c></cell>
              <cell><c>¢</c></cell>
              <cell><c>£</c></cell>
              <cell><c>¤</c></cell>
              <cell><c>¥</c></cell>
              <cell><c>¦</c></cell>
              <cell><c>§</c></cell>
              <cell><c>¨</c></cell>
              <cell><c>©</c></cell>
              <cell><c>ª</c></cell>
              <cell><c>«</c></cell>
              <cell><c>¬</c></cell>
              <cell/>
              <cell><c>®</c></cell>
              <cell><c>¯</c></cell>
            </row>
            <row>
              <cell><c>00B_</c></cell>
              <cell><c>°</c></cell>
              <cell><c>±</c></cell>
              <cell><c>²</c></cell>
              <cell><c>³</c></cell>
              <cell><c>´</c></cell>
              <cell><c>µ</c></cell>
              <cell><c>¶</c></cell>
              <cell><c>·</c></cell>
              <cell><c>¸</c></cell>
              <cell><c>¹</c></cell>
              <cell><c>º</c></cell>
              <cell><c>»</c></cell>
              <cell><c>¼</c></cell>
              <cell><c>½</c></cell>
              <cell><c>¾</c></cell>
              <cell><c>¿</c></cell>
            </row>
            <row>
              <cell><c>00C_</c></cell>
              <cell><c>À</c></cell>
              <cell><c>Á</c></cell>
              <cell><c>Â</c></cell>
              <cell><c>Ã</c></cell>
              <cell><c>Ä</c></cell>
              <cell><c>Å</c></cell>
              <cell><c>Æ</c></cell>
              <cell><c>Ç</c></cell>
              <cell><c>È</c></cell>
              <cell><c>É</c></cell>
              <cell><c>Ê</c></cell>
              <cell><c>Ë</c></cell>
              <cell><c>Ì</c></cell>
              <cell><c>Í</c></cell>
              <cell><c>Î</c></cell>
              <cell><c>Ï</c></cell>
            </row>
            <row>
              <cell><c>00D_</c></cell>
              <cell><c>Ð</c></cell>
              <cell><c>Ñ</c></cell>
              <cell><c>Ò</c></cell>
              <cell><c>Ó</c></cell>
              <cell><c>Ô</c></cell>
              <cell><c>Õ</c></cell>
              <cell><c>Ö</c></cell>
              <cell><c>×</c></cell>
              <cell><c>Ø</c></cell>
              <cell><c>Ù</c></cell>
              <cell><c>Ú</c></cell>
              <cell><c>Û</c></cell>
              <cell><c>Ü</c></cell>
              <cell><c>Ý</c></cell>
              <cell><c>Þ</c></cell>
              <cell><c>ß</c></cell>
            </row>
            <row>
              <cell><c>00E_</c></cell>
              <cell><c>à</c></cell>
              <cell><c>á</c></cell>
              <cell><c>â</c></cell>
              <cell><c>ã</c></cell>
              <cell><c>ä</c></cell>
              <cell><c>å</c></cell>
              <cell><c>æ</c></cell>
              <cell><c>ç</c></cell>
              <cell><c>è</c></cell>
              <cell><c>é</c></cell>
              <cell><c>ê</c></cell>
              <cell><c>ë</c></cell>
              <cell><c>ì</c></cell>
              <cell><c>í</c></cell>
              <cell><c>î</c></cell>
              <cell><c>ï</c></cell>
            </row>
            <row>
              <cell><c>00F_</c></cell>
              <cell><c>ð</c></cell>
              <cell><c>ñ</c></cell>
              <cell><c>ò</c></cell>
              <cell><c>ó</c></cell>
              <cell><c>ô</c></cell>
              <cell><c>õ</c></cell>
              <cell><c>ö</c></cell>
              <cell><c>÷</c></cell>
              <cell><c>ø</c></cell>
              <cell><c>ù</c></cell>
              <cell><c>ú</c></cell>
              <cell><c>û</c></cell>
              <cell><c>ü</c></cell>
              <cell><c>ý</c></cell>
              <cell><c>þ</c></cell>
              <cell><c>ÿ</c></cell>
            </row>
          </tabular>
        </table>
        <p>
          The <c>pre</c> tag is implemented in <latex/> with the <c>fancyvrb</c> package.
          You can compare results here with the table above,
          lines here are rows above.
        </p>
<!-- These two raise errors in both of following -->
<!-- ex &#x00A0; non-breaking space              -->
<!-- ex &#x00AD; soft hyphen                     -->
<pre>
<cline>  ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬   ® ¯</cline>
<cline>° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿</cline>
<cline>À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï</cline>
<cline>Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß</cline>
<cline>à á â ã ä å æ ç è é ê ë ì í î ï</cline>
<cline>ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ</cline>
</pre>
        <p>
          The <c>console</c> tag is also implemented with <c>fancyvrb</c>,
          with adjustments for the input lines.
          It will not look like it,
          but these are 8 such inputs, with similar results to above,
          but now bolded.
        </p>
<console margins="0%" prompt="">
<input>  ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬   ® ¯</input>
<input>° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿</input>
<input>À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï</input>
<input>Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß</input>
<input>à á â ã ä å æ ç è é ê ë ì í î ï</input>
<input>ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ</input>
</console>
        <p>
          We take care to render the <c>U+0080</c><ndash/><c>U+00FF</c> characters in Sage cells.
          This would allow some flexibility in comments and strings employed.
          The following is just a test of these characters in the <c>input</c> and <c>output</c> of a <c>sage</c> element.
          This is not functional code.
        </p>
<sage doctest="not tested">
<input>
¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬   ® ¯
° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿
À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
à á â ã ä å æ ç è é ê ë ì í î ï
ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
</input>
<output>
¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬   ® ¯
° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿
À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
à á â ã ä å æ ç è é ê ë ì í î ï
ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ
</output>
</sage>
      </paragraphs>

A monospace font is critical for samples of keyboard input and to distinguish exact technical input from running commentary. We list here all of the reasonable characters from the first 256 Unicode code points. (We skip the same 65 control characters from above, and the soft hyphen.) These should all render fine in HTML and when processed with xelatex, however our focus with this sample article for PDF output is the capabilities when processed with pdflatex. First, characters from U+0000–U+007F.

🔗

View Source for table

<table>
  <title>Basic Latin, Monospace</title>
  <tabular>
    <row>
      <cell/>
      <cell><c>0</c></cell>
      <cell><c>1</c></cell>
      <cell><c>2</c></cell>
      <cell><c>3</c></cell>
      <cell><c>4</c></cell>
      <cell><c>5</c></cell>
      <cell><c>6</c></cell>
      <cell><c>7</c></cell>
      <cell><c>8</c></cell>
      <cell><c>9</c></cell>
      <cell><c>A</c></cell>
      <cell><c>B</c></cell>
      <cell><c>C</c></cell>
      <cell><c>D</c></cell>
      <cell><c>E</c></cell>
      <cell><c>F</c></cell>
    </row>
    <row>
      <cell><c>002_</c></cell>
      <cell><c> </c></cell>
      <cell><c>!</c></cell>
      <cell><c>"</c></cell>
      <cell><c>#</c></cell>
      <cell><c>$</c></cell>
      <cell><c>%</c></cell>
      <cell><c>&amp;</c></cell>
      <cell><c>'</c></cell>
      <cell><c>(</c></cell>
      <cell><c>)</c></cell>
      <cell><c>*</c></cell>
      <cell><c>+</c></cell>
      <cell><c>,</c></cell>
      <cell><c>-</c></cell>
      <cell><c>.</c></cell>
      <cell><c>/</c></cell>
    </row>
    <row>
      <cell><c>003_</c></cell>
      <cell><c>0</c></cell>
      <cell><c>1</c></cell>
      <cell><c>2</c></cell>
      <cell><c>3</c></cell>
      <cell><c>4</c></cell>
      <cell><c>5</c></cell>
      <cell><c>6</c></cell>
      <cell><c>7</c></cell>
      <cell><c>8</c></cell>
      <cell><c>9</c></cell>
      <cell><c>:</c></cell>
      <cell><c>;</c></cell>
      <cell><c>&lt;</c></cell>
      <cell><c>=</c></cell>
      <cell><c>&gt;</c></cell>
      <cell><c>?</c></cell>
    </row>
    <row>
      <cell><c>004_</c></cell>
      <cell><c>@</c></cell>
      <cell><c>A</c></cell>
      <cell><c>B</c></cell>
      <cell><c>C</c></cell>
      <cell><c>D</c></cell>
      <cell><c>E</c></cell>
      <cell><c>F</c></cell>
      <cell><c>G</c></cell>
      <cell><c>H</c></cell>
      <cell><c>I</c></cell>
      <cell><c>J</c></cell>
      <cell><c>K</c></cell>
      <cell><c>L</c></cell>
      <cell><c>M</c></cell>
      <cell><c>N</c></cell>
      <cell><c>O</c></cell>
    </row>
    <row>
      <cell><c>005_</c></cell>
      <cell><c>P</c></cell>
      <cell><c>Q</c></cell>
      <cell><c>R</c></cell>
      <cell><c>S</c></cell>
      <cell><c>T</c></cell>
      <cell><c>U</c></cell>
      <cell><c>V</c></cell>
      <cell><c>W</c></cell>
      <cell><c>X</c></cell>
      <cell><c>Y</c></cell>
      <cell><c>Z</c></cell>
      <cell><c>[</c></cell>
      <cell><c>\</c></cell>
      <cell><c>]</c></cell>
      <cell><c>^</c></cell>
      <cell><c>_</c></cell>
    </row>
    <row>
      <cell><c>006_</c></cell>
      <cell><c>`</c></cell>
      <cell><c>a</c></cell>
      <cell><c>b</c></cell>
      <cell><c>c</c></cell>
      <cell><c>d</c></cell>
      <cell><c>e</c></cell>
      <cell><c>f</c></cell>
      <cell><c>g</c></cell>
      <cell><c>h</c></cell>
      <cell><c>i</c></cell>
      <cell><c>j</c></cell>
      <cell><c>k</c></cell>
      <cell><c>l</c></cell>
      <cell><c>m</c></cell>
      <cell><c>n</c></cell>
      <cell><c>o</c></cell>
    </row>
    <row>
      <cell><c>007_</c></cell>
      <cell><c>p</c></cell>
      <cell><c>q</c></cell>
      <cell><c>r</c></cell>
      <cell><c>s</c></cell>
      <cell><c>t</c></cell>
      <cell><c>u</c></cell>
      <cell><c>v</c></cell>
      <cell><c>w</c></cell>
      <cell><c>x</c></cell>
      <cell><c>y</c></cell>
      <cell><c>z</c></cell>
      <cell><c>{</c></cell>
      <cell><c>|</c></cell>
      <cell><c>}</c></cell>
      <cell><c>~</c></cell>
      <cell/>
    </row>
  </tabular>
</table>

Table 22.3. Basic Latin, Monospace

🔗

0 1 2 3 4 5 6 7 8 9 A B C D E F

002_ ! " # $ % & ' ( ) * + , - . /

003_ 0 1 2 3 4 5 6 7 8 9 : ; < = > ?

004_ @ A B C D E F G H I J K L M N O

005_ P Q R S T U V W X Y Z [ \ ] ^ _

006_ ` a b c d e f g h i j k l m n o

007_ p q r s t u v w x y z { | } ~

Note that the single and double quotes are upright and dumb, not curly and smart: ' " ' " ' ". And a backtick is a backtick: ` ` `. The zero is distinguished from the capital “oh”: 0 O 0 O 0 O. And the numeral one is slightly different from the lower-case “ell”: 1 l 1 l 1 l. The hyphen should be short and not expanded into some other kind of dash: - - -. These characters should all cut/paste out of a PDF into a text editor with no conversion to other characters.

🔗

Now the remaining characters from U+0080–U+00FF. The program tag is implemented in LaTeX via the listing package and these characters require ad-hoc replacements for processing by pdflatex. (You can see the replacements in the preamble of the LaTeX source for this document.) The replacement mechanism provided by the listing package will cause the characters below to produce a LaTeX compilation error if processed by pdflatex and in a table cell in certain situations (which we have avoided in the table below). The only workaround in this case is to switch to xelatex.

🔗

View Source for table

<table>
  <title>Latin-1 Supplement, Monospace</title>
  <tabular>
    <row>
      <cell/>
      <cell><c>0</c></cell>
      <cell><c>1</c></cell>
      <cell><c>2</c></cell>
      <cell><c>3</c></cell>
      <cell><c>4</c></cell>
      <cell><c>5</c></cell>
      <cell><c>6</c></cell>
      <cell><c>7</c></cell>
      <cell><c>8</c></cell>
      <cell><c>9</c></cell>
      <cell><c>A</c></cell>
      <cell><c>B</c></cell>
      <cell><c>C</c></cell>
      <cell><c>D</c></cell>
      <cell><c>E</c></cell>
      <cell><c>F</c></cell>
    </row>
    <row>
      <cell><c>00A_</c></cell>
      <cell/>
      <cell><c>¡</c></cell>
      <cell><c>¢</c></cell>
      <cell><c>£</c></cell>
      <cell><c>¤</c></cell>
      <cell><c>¥</c></cell>
      <cell><c>¦</c></cell>
      <cell><c>§</c></cell>
      <cell><c>¨</c></cell>
      <cell><c>©</c></cell>
      <cell><c>ª</c></cell>
      <cell><c>«</c></cell>
      <cell><c>¬</c></cell>
      <cell/>
      <cell><c>®</c></cell>
      <cell><c>¯</c></cell>
    </row>
    <row>
      <cell><c>00B_</c></cell>
      <cell><c>°</c></cell>
      <cell><c>±</c></cell>
      <cell><c>²</c></cell>
      <cell><c>³</c></cell>
      <cell><c>´</c></cell>
      <cell><c>µ</c></cell>
      <cell><c>¶</c></cell>
      <cell><c>·</c></cell>
      <cell><c>¸</c></cell>
      <cell><c>¹</c></cell>
      <cell><c>º</c></cell>
      <cell><c>»</c></cell>
      <cell><c>¼</c></cell>
      <cell><c>½</c></cell>
      <cell><c>¾</c></cell>
      <cell><c>¿</c></cell>
    </row>
    <row>
      <cell><c>00C_</c></cell>
      <cell><c>À</c></cell>
      <cell><c>Á</c></cell>
      <cell><c>Â</c></cell>
      <cell><c>Ã</c></cell>
      <cell><c>Ä</c></cell>
      <cell><c>Å</c></cell>
      <cell><c>Æ</c></cell>
      <cell><c>Ç</c></cell>
      <cell><c>È</c></cell>
      <cell><c>É</c></cell>
      <cell><c>Ê</c></cell>
      <cell><c>Ë</c></cell>
      <cell><c>Ì</c></cell>
      <cell><c>Í</c></cell>
      <cell><c>Î</c></cell>
      <cell><c>Ï</c></cell>
    </row>
    <row>
      <cell><c>00D_</c></cell>
      <cell><c>Ð</c></cell>
      <cell><c>Ñ</c></cell>
      <cell><c>Ò</c></cell>
      <cell><c>Ó</c></cell>
      <cell><c>Ô</c></cell>
      <cell><c>Õ</c></cell>
      <cell><c>Ö</c></cell>
      <cell><c>×</c></cell>
      <cell><c>Ø</c></cell>
      <cell><c>Ù</c></cell>
      <cell><c>Ú</c></cell>
      <cell><c>Û</c></cell>
      <cell><c>Ü</c></cell>
      <cell><c>Ý</c></cell>
      <cell><c>Þ</c></cell>
      <cell><c>ß</c></cell>
    </row>
    <row>
      <cell><c>00E_</c></cell>
      <cell><c>à</c></cell>
      <cell><c>á</c></cell>
      <cell><c>â</c></cell>
      <cell><c>ã</c></cell>
      <cell><c>ä</c></cell>
      <cell><c>å</c></cell>
      <cell><c>æ</c></cell>
      <cell><c>ç</c></cell>
      <cell><c>è</c></cell>
      <cell><c>é</c></cell>
      <cell><c>ê</c></cell>
      <cell><c>ë</c></cell>
      <cell><c>ì</c></cell>
      <cell><c>í</c></cell>
      <cell><c>î</c></cell>
      <cell><c>ï</c></cell>
    </row>
    <row>
      <cell><c>00F_</c></cell>
      <cell><c>ð</c></cell>
      <cell><c>ñ</c></cell>
      <cell><c>ò</c></cell>
      <cell><c>ó</c></cell>
      <cell><c>ô</c></cell>
      <cell><c>õ</c></cell>
      <cell><c>ö</c></cell>
      <cell><c>÷</c></cell>
      <cell><c>ø</c></cell>
      <cell><c>ù</c></cell>
      <cell><c>ú</c></cell>
      <cell><c>û</c></cell>
      <cell><c>ü</c></cell>
      <cell><c>ý</c></cell>
      <cell><c>þ</c></cell>
      <cell><c>ÿ</c></cell>
    </row>
  </tabular>
</table>

Table 22.4. Latin-1 Supplement, Monospace

🔗

0 1 2 3 4 5 6 7 8 9 A B C D E F

00B_ ° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿

00C_ À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï

00D_ Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß

00E_ à á â ã ä å æ ç è é ê ë ì í î ï

00F_ ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ

The pre tag is implemented in LaTeX with the fancyvrb package. You can compare results here with the table above, lines here are rows above.

🔗

  ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬   ® ¯
° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿
À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
à á â ã ä å æ ç è é ê ë ì í î ï
ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ

The console tag is also implemented with fancyvrb, with adjustments for the input lines. It will not look like it, but these are 8 such inputs, with similar results to above, but now bolded.

🔗

View Source for console

<console margins="0%" prompt="">
<input>  ¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬   ® ¯</input>
<input>° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿</input>
<input>À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï</input>
<input>Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß</input>
<input>à á â ã ä å æ ç è é ê ë ì í î ï</input>
<input>ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ</input>
</console>

¡ ¢ £ ¤ ¥ ¦ § ¨ © ª « ¬   ® ¯
° ± ² ³ ´ µ ¶ · ¸ ¹ º » ¼ ½ ¾ ¿
À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï
Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
à á â ã ä å æ ç è é ê ë ì í î ï
ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ

We take care to render the U+0080–U+00FF characters in Sage cells. This would allow some flexibility in comments and strings employed. The following is just a test of these characters in the input and output of a sage element. This is not functional code.

🔗

The table below has a single column, and each cell of the table has a string of 10 characters inside a c element. It is meant to test if the font is monospace in this situation.

🔗

View Source for table

<table>
  <title>Alignment Test</title>
  <tabular>
    <row>
      <cell><c>0123456789</c></cell>
    </row>
    <row>
      <cell><c>9876543210</c></cell>
    </row>
    <row>
      <cell><c>iiiiiiiiii</c></cell>
    </row>
    <row>
      <cell><c>mmmmmmmmmm</c></cell>
    </row>
  </tabular>
</table>

Table 22.5. Alignment Test

🔗

0123456789

9876543210

iiiiiiiiii

mmmmmmmmmm

Again, more examples and more thorough explanations can be found in the sample: examples/fonts/fonts-and-characters.xml. Be aware that the nature of the more advanced sample is that it will likely produce many errors when processed with pdflatex. Adding -interaction batchmode or -interaction nonstopmode to the pdflatex command-line will sometimes be less painless than acknowledging each error. The more advanced sample will perform well when processed with xelatex.

🔗

Prev Top Next

Section 22 Internationalization

Unicode Characters for HTML Output.

Characters in LaTeX, PDF, print.

Basic Latin, U+0000–U+007F.

Latin-1 Supplement, U+0080–U+00FF.

Monospace, Basic Latin and Latin-1 Supplement, U+0000–U+00FF.

Basic Latin, `U+0000`–`U+007F`.

Latin-1 Supplement, `U+0080`–`U+00FF`.

Monospace, Basic Latin and Latin-1 Supplement, `U+0000`–`U+00FF`.