<div dir="auto"><div><br><br><div class="gmail_quote gmail_quote_container"><div dir="ltr" class="gmail_attr">On Thu, Jan 2, 2025, 7:51 AM Douglas McIlroy <<a href="mailto:douglas.mcilroy@dartmouth.edu">douglas.mcilroy@dartmouth.edu</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">I am not aware that the compressed dictionary was used for anything.<br>
Steve Johnson's first shell-script spelling-checker did make a pass<br>
over a dictionary, but not Webster's second, which would have caused<br>
lots of false negatives because it contains so many exotic small words<br>
that could result from typos.</blockquote></div></div><div dir="auto"><br></div><div dir="auto">Where did the Websters Second file come from? Did the labs give the public domain paper dictionary to the equivalent of a typing pool and had them enter it? It did it come from elsewhere? Or something else? How was it checked for accuracy?</div><div dir="auto"><br></div><div dir="auto">Warner</div><div dir="auto"><br></div><div dir="auto"><br></div><div dir="auto"><div class="gmail_quote gmail_quote_container"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">My production spell aggresively stripped<br>
affixes and used hashing and other coding tricks to keep its<br>
"dictionary" in the limited memory of a PDP-11. (The whole story is<br>
told in <a href="https://www.cs.dartmouth.edu/~doug/spell.pdf" rel="noreferrer noreferrer" target="_blank">https://www.cs.dartmouth.edu/~doug/spell.pdf</a> and insightfully<br>
described by Jon Bentley in<br>
<a href="https://dl.acm.org/doi/pdf/10.1145/3532.315102" rel="noreferrer noreferrer" target="_blank">https://dl.acm.org/doi/pdf/10.1145/3532.315102</a>.) When larger memory<br>
became available, these heroics were replaced by basic common-prefix<br>
coding patterned after Morris and Thompson, just as Arnold surmised.<br>
<br>
On Thu, Jan 2, 2025 at 7:41 AM <<a href="mailto:arnold@skeeve.com" target="_blank" rel="noreferrer">arnold@skeeve.com</a>> wrote:<br>
><br>
> Hi.<br>
><br>
> The paper on compressing the dictionary was interesting. In the day<br>
> of 20 meg disks, compressing a ~ 2.5 meg file down to ~ .5 meg is<br>
> a big savings.<br>
><br>
> Was the compressed dictionary put into use? I could imaging that<br>
> spell(1) at least would have needed some library routines to return<br>
> a stream of words from it.<br>
><br>
> Just wondering. Thanks,<br>
><br>
> Arnold<br>
</blockquote></div></div></div>