123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145 |
- <?xml version="1.0" encoding="utf-8" ?>
- <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "https://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
- <!-- This file is generated by Nim. -->
- <html xmlns="https://www.w3.org/1999/xhtml" xml:lang="en" lang="en" data-theme="auto">
- <head>
- <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
- <meta name="viewport" content="width=device-width, initial-scale=1.0">
- <title>std/unidecode</title>
- <!-- Google fonts -->
- <link href='https://fonts.googleapis.com/css?family=Lato:400,600,900' rel='stylesheet' type='text/css'/>
- <link href='https://fonts.googleapis.com/css?family=Source+Code+Pro:400,500,600' rel='stylesheet' type='text/css'/>
- <!-- Favicon -->
- <link rel="shortcut icon" href=""/>
- <link rel="icon" type="image/png" sizes="32x32" href="">
- <!-- CSS -->
- <link rel="stylesheet" type="text/css" href="nimdoc.out.css?v=2.3.1">
- <!-- JS -->
- <script type="text/javascript" src="dochack.js?v=2.3.1"></script>
- </head>
- <body>
- <div class="document" id="documentId">
- <div class="container">
- <h1 class="title">std/unidecode</h1>
- <div class="row">
- <div class="three columns">
- <div class="theme-select-wrapper">
- <label for="theme-select">Theme: </label>
- <select id="theme-select" onchange="setTheme(this.value)">
- <option value="auto">🌗 Match OS</option>
- <option value="dark">🌑 Dark</option>
- <option value="light">🌕 Light</option>
- </select>
- </div>
- <div id="global-links">
- <ul class="simple-boot">
- <li><a href="manual.html">Manual</a></li>
- <li><a href="lib.html">Standard library</a></li>
- <li> <a id="indexLink" href="theindex.html">Index</a></li>
- <li><a href="compiler/theindex.html">Compiler docs</a></li>
- <li><a href="https://nim-lang.github.io/fusion/theindex.html">Fusion docs</a></li>
- <li><a href="https://nim-lang.github.io/Nim/">devel</a>, <a href="https://nim-lang.org/documentation.html">stable</a></li>
- </ul>
- </div>
- <div id="searchInputDiv">
- Search: <input type="search" id="searchInput"
- oninput="search()" />
- </div>
-
- <ul class="simple simple-toc" id="toc-list">
- <li>
- <a class="reference reference-toplevel" href="#6" id="56">Imports</a>
- </li>
- <li>
- <details open>
- <summary><a class="reference reference-toplevel" href="#12" id="62">Procs</a></summary>
- <ul class="simple simple-toc-section">
- <ul class="simple nested-toc-section">loadUnidecodeTable
- <li><a class="reference" href="#loadUnidecodeTable%2Cstring" title="loadUnidecodeTable(datafile = "unidecode.dat")">loadUnidecodeTable(datafile = "unidecode.dat")</a></li>
- </ul>
- <ul class="simple nested-toc-section">unidecode
- <li><a class="reference" href="#unidecode%2Cstring" title="unidecode(s: string): string">unidecode(s: string): string</a></li>
- </ul>
- </ul>
- </details>
- </li>
- </ul>
- </div>
- <div class="nine columns" id="content">
- <a href="https://github.com/nim-lang/Nim/tree/devel/lib/pure/unidecode/unidecode.nim#L1" class="link-seesrc" target="_blank">Source</a>
- <a href="https://github.com/nim-lang/Nim/edit/devel/lib/pure/unidecode/unidecode.nim#L1" class="link-seesrc" target="_blank" >Edit</a>
- <div id="tocRoot"></div>
-
- <p class="module-desc"><p>This module is based on Python's <a class="reference external" href="https://pypi.org/project/Unidecode/">Unidecode</a> module by Tomaz Solc, which in turn is based on the <a class="reference external" href="https://metacpan.org/pod/Text::Unidecode">Text::Unidecode</a> Perl module by Sean M. Burke.</p>
- <p>It provides a <a class="reference external" href="#unidecode,string">unidecode proc</a> that does Unicode to ASCII transliterations: It finds the sequence of ASCII characters that is the closest approximation to the Unicode string.</p>
- <p>For example, the closest to string "Äußerst" in ASCII is "Ausserst". Some information is lost in this transformation, of course, since several Unicode strings can be transformed to the same ASCII representation. So this is a strictly one-way transformation. However, a human reader will probably still be able to guess from the context, what the original string was.</p>
- <p>This module needs the data file <tt class="docutils literal"><span class="pre"><span class="Identifier">unidecode</span><span class="Operator">.</span><span class="Identifier">dat</span></span></tt> to work: This file is embedded as a resource into your application by default. You can also define the symbol <tt class="docutils literal"><span class="pre"><span class="Operator">--</span><span class="Identifier">define</span><span class="Punctuation">:</span><span class="Identifier">noUnidecodeTable</span></span></tt> during compile time and use the <a class="reference external" href="#loadUnidecodeTable">loadUnidecodeTable proc</a> to initialize this module.</p>
- </p>
- <div class="section" id="6">
- <h1><a class="toc-backref" href="#6">Imports</a></h1>
- <dl class="item">
- <a class="reference external" href="unicode.html">unicode</a>, <a class="reference external" href="strutils.html">strutils</a>
- </dl>
- </div>
- <div class="section" id="12">
- <h1><a class="toc-backref" href="#12">Procs</a></h1>
- <dl class="item">
- <div id="loadUnidecodeTable-procs-all">
- <div id="loadUnidecodeTable,string">
- <dt><pre><span class="Keyword">proc</span> <a href="#loadUnidecodeTable%2Cstring"><span class="Identifier">loadUnidecodeTable</span></a><span class="Other">(</span><span class="Identifier">datafile</span> <span class="Other">=</span> <span class="StringLit">"unidecode.dat"</span><span class="Other">)</span> {.<span><span class="Other pragmadots">...</span></span><span class="pragmawrap"><span class="Identifier">raises</span><span class="Other">:</span> <span class="Other">[</span><span class="Other">]</span><span class="Other">,</span> <span class="Identifier">tags</span><span class="Other">:</span> <span class="Other">[</span><span class="Other">]</span><span class="Other">,</span>
- <span class="Identifier">forbids</span><span class="Other">:</span> <span class="Other">[</span><span class="Other">]</span></span>.}</pre></dt>
- <dd>
-
- Loads the datafile that <a class="reference external" href="#unidecode,string">unidecode</a> needs to work. This is only required if the module was compiled with the <tt class="docutils literal"><span class="pre"><span class="Operator">--</span><span class="Identifier">define</span><span class="Punctuation">:</span><span class="Identifier">noUnidecodeTable</span></span></tt> switch. This needs to be called by the main thread before any thread can make a call to <tt class="docutils literal"><span class="pre"><span class="Identifier">unidecode</span></span></tt>.
- <a href="https://github.com/nim-lang/Nim/tree/devel/lib/pure/unidecode/unidecode.nim#L41" class="link-seesrc" target="_blank">Source</a>
- <a href="https://github.com/nim-lang/Nim/edit/devel/lib/pure/unidecode/unidecode.nim#L41" class="link-seesrc" target="_blank" >Edit</a>
- </dd>
- </div>
- </div>
- <div id="unidecode-procs-all">
- <div id="unidecode,string">
- <dt><pre><span class="Keyword">proc</span> <a href="#unidecode%2Cstring"><span class="Identifier">unidecode</span></a><span class="Other">(</span><span class="Identifier">s</span><span class="Other">:</span> <a href="system.html#string"><span class="Identifier">string</span></a><span class="Other">)</span><span class="Other">:</span> <a href="system.html#string"><span class="Identifier">string</span></a> {.<span><span class="Other pragmadots">...</span></span><span class="pragmawrap"><span class="Identifier">raises</span><span class="Other">:</span> <span class="Other">[</span><span class="Other">]</span><span class="Other">,</span> <span class="Identifier">tags</span><span class="Other">:</span> <span class="Other">[</span><span class="Other">]</span><span class="Other">,</span> <span class="Identifier">forbids</span><span class="Other">:</span> <span class="Other">[</span><span class="Other">]</span></span>.}</pre></dt>
- <dd>
-
- Finds the sequence of ASCII characters that is the closest approximation to the UTF-8 string <tt class="docutils literal"><span class="pre"><span class="Identifier">s</span></span></tt>.
- <p><strong class="examples_text">Example:</strong></p>
- <pre class="listing"><span class="Identifier">doAssert</span> <span class="Identifier">unidecode</span><span class="Punctuation">(</span><span class="StringLit">"北京"</span><span class="Punctuation">)</span> <span class="Operator">==</span> <span class="StringLit">"Bei Jing "</span>
- <span class="Identifier">doAssert</span> <span class="Identifier">unidecode</span><span class="Punctuation">(</span><span class="StringLit">"Äußerst"</span><span class="Punctuation">)</span> <span class="Operator">==</span> <span class="StringLit">"Ausserst"</span></pre>
- <a href="https://github.com/nim-lang/Nim/tree/devel/lib/pure/unidecode/unidecode.nim#L53" class="link-seesrc" target="_blank">Source</a>
- <a href="https://github.com/nim-lang/Nim/edit/devel/lib/pure/unidecode/unidecode.nim#L53" class="link-seesrc" target="_blank" >Edit</a>
- </dd>
- </div>
- </div>
- </dl>
- </div>
- </div>
- </div>
- <div class="twelve-columns footer">
- <span class="nim-sprite"></span>
- <br>
- <small style="color: var(--hint);">Made with Nim. Generated: 2025-03-19 07:24:17 UTC</small>
- </div>
- </div>
- </div>
- <script defer data-domain="nim-lang.org" src="https://plausible.io/js/plausible.js"></script>
-
- </body>
- </html>
|