<div dir="ltr"><div dir="ltr"><div><br></div></div><div class="gmail_quote gmail_quote_container"><div dir="ltr" class="gmail_attr">On Sun, Dec 29, 2024 at 3:37 PM Warren Toomey <<a href="mailto:wkt@tuhs.org">wkt@tuhs.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">On Sat, Dec 28, 2024 at 05:53:48PM -0900, Royce Williams wrote:<br>
> Someone I know is seeking the original version of an internal Bell Labs<br>
> memo from 1974 titled "Webster's Second on the Head of a Pin" by Morris<br>
> and Thompson. The topic appears to be related to improving the speed of<br>
> lookups or search. It's cited in a few papers as "Unpublished Technical<br>
> Memo, Bell Laboratories, Murray Hill, NJ 1974." All I can find online<br>
> is citations. Any leads appreciated!<br>
<br>
Doug McIlroy sent me a copy, it's now here:<br>
<br>
<a href="https://www.tuhs.org/Archive/Documentation/TechReports/Bell_Labs/PinheadWebster.pdf" rel="noreferrer" target="_blank">https://www.tuhs.org/Archive/Documentation/TechReports/Bell_Labs/PinheadWebster.pdf</a><br>
<br>
Thanks Doug!<br></blockquote><div><br></div><div>And many thanks from me and my colleague as well, Doug!</div><div><br></div><div>For future searchers, what follows is selected (unique) front matter from the memo, rewrapped slightly for Mailman width.</div><div><br></div><div><br></div><div>Title - Webster's Second on the Head of a Pin<br>Date - July 15, 1974<br>TM - 74-1271-13<br><br>Other keywords - words, text compression<br><br>Author Location Extension<br>Robert Morris MH 2C-524 3878<br>Ken Thompson MH 2C-523 2394<br><br>Charging case - 39199<br>Filing Case - 39199-11<br><br><br> ABSTRACT</div><div><br>We used the list of words from Webster's Second Unabridged Dictionary<br>(without definitions) as a test case for special purpose text <br>compression techniques.<br><br>We compressed it by a factor of 4.52 to 1.<br><br>The 234,932 words originally occupied 2,486,781 bytes and were<br>compressed into 549,388 bytes. The size of the decoding program is <br>1356 bytes.<br><br>The initial characters of a word that agreed with the initial<br>characters of the previous word were dropped and replaced by a code.<br>Common suffixes were also coded. Finally, a variable-length code was<br>used.<br></div><div><br></div><div><br></div><div>Pages Text 6 Other 0 Total 6</div><div><br></div><div><br></div><div>-- </div><div>Royce </div></div></div>