<div dir="auto"><div><br><br><div class="gmail_quote gmail_quote_container"><div dir="ltr" class="gmail_attr">On Thu, Jan 2, 2025, 7:51 AM Douglas McIlroy <<a href="mailto:douglas.mcilroy@dartmouth.edu">douglas.mcilroy@dartmouth.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">I am not aware that the compressed dictionary was used for anything.<br>

Steve Johnson's first shell-script spelling-checker did make a pass<br>

over a dictionary, but not Webster's second, which would have caused<br>

lots of false negatives because it contains so many exotic small words<br>

that could result from typos.</blockquote></div></div><div dir="auto"><br></div><div dir="auto">Where did the Websters Second file come from? Did the labs give the public domain paper dictionary to the equivalent of a typing pool and had them enter it? It did it come from elsewhere? Or something else? How was it checked for accuracy?</div><div dir="auto"><br></div><div dir="auto">Warner</div><div dir="auto"><br></div><div dir="auto"><br></div><div dir="auto"><div class="gmail_quote gmail_quote_container"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">My production spell aggresively stripped<br>

affixes and used hashing and other coding tricks to keep its<br>

"dictionary" in the limited memory of a PDP-11. (The whole story is<br>

told in <a href="https://www.cs.dartmouth.edu/~doug/spell.pdf" rel="noreferrer noreferrer" target="_blank">https://www.cs.dartmouth.edu/~doug/spell.pdf</a> and insightfully<br>

described by Jon Bentley in<br>

<a href="https://dl.acm.org/doi/pdf/10.1145/3532.315102" rel="noreferrer noreferrer" target="_blank">https://dl.acm.org/doi/pdf/10.1145/3532.315102</a>.) When larger memory<br>

became available, these heroics were replaced by basic common-prefix<br>

coding patterned after Morris and Thompson, just as Arnold surmised.<br>

<br>

On Thu, Jan 2, 2025 at 7:41 AM <<a href="mailto:arnold@skeeve.com" target="_blank" rel="noreferrer">arnold@skeeve.com</a>> wrote:<br>

><br>

> Hi.<br>

><br>

> The paper on compressing the dictionary was interesting. In the day<br>

> of 20 meg disks, compressing a ~ 2.5 meg file down to ~ .5 meg is<br>

> a big savings.<br>

><br>

> Was the compressed dictionary put into use? I could imaging that<br>

> spell(1) at least would have needed some library routines to return<br>

> a stream of words from it.<br>

><br>

> Just wondering.  Thanks,<br>

><br>

> Arnold<br>

</blockquote></div></div></div>