<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/">
   <channel>
      <title>Making Light :: The Implications are Staggering :: comments</title>
      <link>http://nielsenhayden.com/makinglight/archives/015331.html#comments </link>
      <description>Language, fraud, folly, truth, history, and knitting. Et cetera.</description>
      <language>en</language>
      <lastBuildDate>Wed, 07 Aug 2013 01:29:39 -0500</lastBuildDate>
      <generator>http://www.sixapart.com/movabletype/?v=4.34-en</generator>
      
      <item>
      <title>The Implications are Staggering</title>
      <description>Apparently some models of Xerox photocopiers are substituting one number for another in photocopied documents. This isn't an OCR error,...</description>
      <content:encoded>Apparently some models of Xerox photocopiers are substituting one number for another in photocopied documents. This isn't an OCR error,...</content:encoded>
      <link>http://nielsenhayden.com/makinglight/archives/015331.html</link>
      </item>

      
      <item>
         <title>The Implications are Staggering -- comment #1 from Edd Vick</title>
         <description>comment from Edd Vick on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>I for one welcome our duplicating overlords.</p>

<p>While fearing for my life the next time I get a prescription...</p>]]>
	 &lt;p&gt;Posted August  7, 2013  1:29 AM by Edd Vick&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427229</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427229</guid>
         <pubDate>Wed, 07 Aug 2013 01:29:39 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #2 from Miramon</title>
         <description>comment from Miramon on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>It's a hardware bug!<br />
It's a software bug!<br />
It's two, two, two bugs in one!<br />
</p>]]>
	 &lt;p&gt;Posted August  7, 2013  1:32 AM by Miramon&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427233</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427233</guid>
         <pubDate>Wed, 07 Aug 2013 01:32:44 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #3 from Lawrence</title>
         <description>comment from Lawrence on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>Well, damn, that really screws up one of the tests of parallel-world travel my characters used in "The Drifter."</p>]]>
	 &lt;p&gt;Posted August  7, 2013  1:44 AM by Lawrence&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427239</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427239</guid>
         <pubDate>Wed, 07 Aug 2013 01:44:42 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #4 from DriveBy</title>
         <description>comment from DriveBy on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>It's an image compression error.</p>

<p>Data compression algorithms can work, in part, by recognizing repetition in the data being compressed and storing only one copy of the repeated data. Where the input data is grainy, as is often the case with scanned print, "repetition" becomes a matter of judgment; insisting on pixel-perfect matches as the determinant of repetition would result in no matches, and no compression (at least by that part of the algorithm). So, in practice, the machine applies fudge factors to determine whether this grainy part of the image over here is close enough to that grainy part of the image over there to be called a duplicate.</p>

<p>When the fudge factors are set too liberally, similar-looking grainy parts can be mistaken for duplicates. When an erroneous match is made, one or the other of the grainy parts will be assigned as the master part and will be plugged into both places. This can happen to image parts containing text, especially if there is a common visual theme decorating the text. Apparently, the Xerox machines in question are susceptible to this problem primarily when the text in question contains numbers, but not when the text is alphabetical.</p>

<p><a href="http://www.dkriesel.com/en/blog/2013/0806_conference_call_with_xerox" rel="nofollow">There is an update on the situation</a> which includes words like JBIG2, and gets into the specifics of settings on Xerox machines.  There are fingers yet to be pointed, and still some question about who knew what when and why certain people didn't understand sooner what was happening.  Lawyers may at some point become involved. But at the lowest technical level, it's an image compression issue.<br />
</p>]]>
	 &lt;p&gt;Posted August  7, 2013  1:58 AM by DriveBy&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427251</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427251</guid>
         <pubDate>Wed, 07 Aug 2013 01:58:18 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #5 from Laura</title>
         <description>comment from Laura on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>Add in that many Xerox machines have fax capability, and all now have a boatload of memory, and I see even more problems - industrial espionage by chipset in trusted hardware.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  2:02 AM by Laura&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427254</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427254</guid>
         <pubDate>Wed, 07 Aug 2013 02:02:05 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #6 from Bill Stewart</title>
         <description>comment from Bill Stewart on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>I'm sure their statement will be interesting, but I'm not going to believe any of the numbers in it...</p>]]>
	 &lt;p&gt;Posted August  7, 2013  2:19 AM by Bill Stewart&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427259</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427259</guid>
         <pubDate>Wed, 07 Aug 2013 02:19:20 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #7 from Bruce Cohen (Speaker to Managers)</title>
         <description>comment from Bruce Cohen (Speaker to Managers) on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>Architects, contractors, airplane designers, and artillery units had all better be checking what model copier they're using, or things may start falling down, crashing, or blowing up unexpectedly.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  2:49 AM by Bruce Cohen (Speaker to Managers)&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427272</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427272</guid>
         <pubDate>Wed, 07 Aug 2013 02:49:58 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #8 from dcb</title>
         <description>comment from dcb on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>My local Metro (free paper associated with trains, Tube etc.) says that Xerox's "principal engineer, Francis Tse" is saying that the problem can be combatted by copying at higher resolution - which would make sense from what DriveBy @4 says.</p>

<p>In note that the large copier at work which I use occasionally to scan e.g. a book chapter is set at 200 x 200 as standard and I have to keep resetting it to 300 x 300 (for each document, if I'm copying several, which is a pain, 'cos it resets to the lower res automatically at the end of each document). I'm sure the lower standard setting is to save memory, but it makes for lousy copies, particularly if, as I tend to do, you print two pages to a side* (and double sided of course) to save paper.</p>

<p>*Note: two pages to a side works better with UK/European paper sizes, A4 etc. than with American paper sizes.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  4:19 AM by dcb&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427332</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427332</guid>
         <pubDate>Wed, 07 Aug 2013 04:19:39 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #9 from Rob Rusick</title>
         <description>comment from Rob Rusick on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>If I understood Mr. Kriesel's article, this is an issue when one uses a 'scan to PDF' feature of the copier, where the alterations are made in the saved PDF files.</p>

<p>I didn't see a claim that standard copying was affected.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  6:49 AM by Rob Rusick&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427465</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427465</guid>
         <pubDate>Wed, 07 Aug 2013 06:49:45 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #10 from Daniel Martin</title>
         <description>comment from Daniel Martin on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>Specifically, from reading the updates it's a problem of using the quality setting "normal" when scanning to PDF. The three possible settings for quality are "normal", "higher", and "high"; the factory default for quality isn't the "normal" setting but because it doesn't warn "normal activates patch-based image compression (JBIG2), which will corrupt text" in big red letters, apparently many penny-wise users will change the default to "normal" to store their documents in less memory.</p>

<p>Memory/storage space is cheap. Efforts to conserve it almost always come around to bite you in the end.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  7:03 AM by Daniel Martin&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427473</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427473</guid>
         <pubDate>Wed, 07 Aug 2013 07:03:28 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #11 from Jim Macdonald</title>
         <description>comment from Jim Macdonald on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>Another problem of photocopiers which store documents to disk in the process of making copies is that, when you eventually discard/resell/whatever the machine, copies of all documents you ever copied, however confidential, may still be on it and floating around out there somewhere.</p>

<p>See: <a href="http://business.ftc.gov/documents/bus43-copier-data-security" rel="nofollow">Copier Data Security: A Guide for Businesses</a></p>]]>
	 &lt;p&gt;Posted August  7, 2013  7:31 AM by Jim Macdonald&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427498</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427498</guid>
         <pubDate>Wed, 07 Aug 2013 07:31:38 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #12 from Nangleator</title>
         <description>comment from Nangleator on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>I'm sure only copiers can make electronic mistakes.  I'm sure any other conceivable electronic glitches are completely unimportant.  Even multiplied by, say, 314 million.  The population of the United States.</p>

<p>Insignificant.  Even if then multiplied by the number of phone calls and emails U.S. citizens make every day, all year long.</p>

<p>We can rely on our data, even in a court of law.  Even secret courts of law.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  9:22 AM by Nangleator&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427700</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427700</guid>
         <pubDate>Wed, 07 Aug 2013 09:22:31 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #13 from Remus Shepherd</title>
         <description>comment from Remus Shepherd on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>The big problem with this bug is that the 'scan to PDF' feature is intended for users who want a paperless office, which means they shred the original paper documents once they're done scanning them.</p>

<p>It would be interesting if civilization were to collapse over a software bug as trivial as this.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  9:32 AM by Remus Shepherd&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427708</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427708</guid>
         <pubDate>Wed, 07 Aug 2013 09:32:23 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #14 from P J Evans</title>
         <description>comment from P J Evans on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>The place I worked switched to Ricohs a few years back. In some ways, not an improvement over Xerox, and we swore a lot at them. (We were printing graphics, like PDFs reduced from 24x36 to 11x17. The old Xerox copies were readable.)</p>]]>
	 &lt;p&gt;Posted August  7, 2013  9:53 AM by P J Evans&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427724</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427724</guid>
         <pubDate>Wed, 07 Aug 2013 09:53:20 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #15 from Fragano Ledgister </title>
         <description>comment from Fragano Ledgister  on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>Creative copying?</p>]]>
	 &lt;p&gt;Posted August  7, 2013 11:05 AM by Fragano Ledgister &lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427790</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427790</guid>
         <pubDate>Wed, 07 Aug 2013 11:05:50 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #16 from Stan</title>
         <description>comment from Stan on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>A knowledgable MeFi user commented on this problem:<br />
<a href="http://www.metafilter.com/130641/Cat-images-reportedly-unaffected#5125209" rel="nofollow">http://www.metafilter.com/130641/Cat-images-reportedly-unaffected#5125209</a></p>]]>
	 &lt;p&gt;Posted August  7, 2013 12:23 PM by Stan&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427859</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427859</guid>
         <pubDate>Wed, 07 Aug 2013 12:23:01 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #17 from Tom Whitmore</title>
         <description>comment from Tom Whitmore on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>The original photocopiers worked by making a direct master, reversed, of the original document on a photosensitive drum. Now, most copiers work by scanning the image and then printing it out. I wonder, what other sorts of errors are introduced by this change?</p>]]>
	 &lt;p&gt;Posted August  7, 2013 12:59 PM by Tom Whitmore&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427892</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427892</guid>
         <pubDate>Wed, 07 Aug 2013 12:59:22 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #18 from Bill Higgins-- Beam Jockey has been gnomed</title>
         <description>comment from Bill Higgins-- Beam Jockey has been gnomed on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>Bruce Cohen writes in #7:</p>

<p><i>Architects, contractors, airplane designers, and artillery units had all better be checking what model copier they're using, or things may start falling down, crashing, or blowing up unexpectedly.</i></p>

<p>I don't believe you'll find many artillery units using Xerox copiers.  They tend to favor Canon.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  2:40 PM by Bill Higgins-- Beam Jockey has been gnomed&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427972</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427972</guid>
         <pubDate>Wed, 07 Aug 2013 14:40:38 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #19 from Jim Macdonald</title>
         <description>comment from Jim Macdonald on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>Bill Higgins #18 --</p>

<p>Can't find your gnomed post.  Sorry about that.</p>

<p>-- JDM</p>]]>
	 &lt;p&gt;Posted August  7, 2013  3:02 PM by Jim Macdonald&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427981</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427981</guid>
         <pubDate>Wed, 07 Aug 2013 15:02:22 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #20 from Fragano Ledgister </title>
         <description>comment from Fragano Ledgister  on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>Bill Higgins #18: That's Serge-level punditry.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  3:04 PM by Fragano Ledgister &lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1427986</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1427986</guid>
         <pubDate>Wed, 07 Aug 2013 15:04:32 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #21 from Ken Fletcher</title>
         <description>comment from Ken Fletcher on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>"It's not a bug; it's a feature!"</p>

<p>A useful optional setting to discourage clandestine copying of documents thick with numbers. Could maybe even track the individual copy machine, and when the copies were made.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  3:45 PM by Ken Fletcher&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428006</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428006</guid>
         <pubDate>Wed, 07 Aug 2013 15:45:32 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #22 from David Goldfarb</title>
         <description>comment from David Goldfarb on  7.Aug.13</description>
         <content:encoded><![CDATA[<p><strong>dcb @<a href="#1427332" rel="nofollow">8</a>:</strong> Moving between <em><strong>any</strong></em> two standard paper sizes is easier with European A/B series than with the stupid American letter/ledger/architectural etc. non-system.  I worked in a copy shop for two decades, and people were constantly wanting documents designed for letter size blown up to ledger or 24x36, or something larger reduced to letter, and it was always a pain because the aspect ratio was different.  Copy shop clerks in Europe have it so much easier: A4 to A3 is 141%, A4 to A5 is 77%, drop it on the glass and go.  Sigh.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  4:33 PM by David Goldfarb&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428035</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428035</guid>
         <pubDate>Wed, 07 Aug 2013 16:33:53 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #23 from albatross</title>
         <description>comment from albatross on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>The reason this seems like a big deal to me is that I am extremely skeptical that Xerox is the only copier/scanner on which this is a problem.  We have spent some years now trying to get rid of paper in as many places as possible, ranging from voting to e-receipts to e-prescriptions to MERS (electronic mortgage records).  And in all cases, there were cost and hassle savings up front.  But part of the cost is that the underlying paper documents go away, and we can easily end up with *nothing but* electronic records.  If those records are wrong--via glitch or misentry or tampering--there is nothing to check them against.</p>

<p>Our electronic technology is not reliable or secure enough to support this.  With a fallback paper record, you have something that neither a glitch nor an attacker can change, which can be used to decide what should be in the electronic records.  Without that, you trust the electronic records whether they deserve it or not.  </p>]]>
	 &lt;p&gt;Posted August  7, 2013  4:35 PM by albatross&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428037</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428037</guid>
         <pubDate>Wed, 07 Aug 2013 16:35:28 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #24 from Nancy Lebovitz</title>
         <description>comment from Nancy Lebovitz on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>The reason this seems like a big deal to me is that even if it's "just" two Xerox models (how long has this been going on?), it's probably enough to wreck a lot of subtly caused havoc. It wouldn't surprise me if it's enough to get people killed.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  4:47 PM by Nancy Lebovitz&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428046</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428046</guid>
         <pubDate>Wed, 07 Aug 2013 16:47:38 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #25 from eric</title>
         <description>comment from eric on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>I predict that, no matter the real cause, the users will be blamed. <br />
Where really, it's negligent use of a compression algorithm. </p>]]>
	 &lt;p&gt;Posted August  7, 2013  5:33 PM by eric&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428088</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428088</guid>
         <pubDate>Wed, 07 Aug 2013 17:33:52 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #26 from P J Evans</title>
         <description>comment from P J Evans on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>23<br />
The company I worked at was scanning (at a high resolution) all their old maps, and at a more reasonable resolutions all their other construction documents. Because they have to be able to track what was done until it's removed permanently, possibly 80 to 90 years.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  5:45 PM by P J Evans&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428097</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428097</guid>
         <pubDate>Wed, 07 Aug 2013 17:45:15 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #27 from Matthew Brown</title>
         <description>comment from Matthew Brown on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>Eric@25:</p>

<p>Yes, I agree.  This error falls outside of user expectations of a scanner or copier.  Expected failure modes include unreadable scans.  Readable-but-wrong scans fall outside of that.</p>

<p>I haven't seen anywhere mention if this is the normally configured compression mode of these devices, or whether it's an option the user has to set.  Regardless, though, this is not a user error, since it falls outside of reasonable expectations.  It'll just influence how many people have been bitten by this error.</p>

<p>Is it possible to determine if your scanned PDFs from one of these devices were scanned with the problematic compression option?  If not, nobody who's got documents scanned with these can trust them.</p>

<p>Otherwise, it'll be hard to know if one is among the affected users and thus one will generally have to assume your documents might be wrong.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  6:35 PM by Matthew Brown&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428127</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428127</guid>
         <pubDate>Wed, 07 Aug 2013 18:35:26 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #28 from Tom Whitmore</title>
         <description>comment from Tom Whitmore on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>It's not just scanned PDFs, Matthew Brown -- it's simple photocopies. Things that look like they're a printed picture of the original, only with these anomalies. The copier makes a digital file, then prints that file, rather than simply taking a picture and printing that -- and this error comes in when it makes the digital file.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  6:59 PM by Tom Whitmore&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428145</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428145</guid>
         <pubDate>Wed, 07 Aug 2013 18:59:16 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #29 from Matthew Brown</title>
         <description>comment from Matthew Brown on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>At least one of the linked articles I read mentioned that it's only the PDFs that are affected: is that inaccurate?</p>]]>
	 &lt;p&gt;Posted August  7, 2013  7:19 PM by Matthew Brown&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428151</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428151</guid>
         <pubDate>Wed, 07 Aug 2013 19:19:13 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #30 from Heather Rose Jones</title>
         <description>comment from Heather Rose Jones on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>In my mind I am running through all the sorts of documents scanned at my Place of Employment where the scan/copy is treated as equivalent to the original document. Documents like QC test results or Certificates of Analysis shipped with our products as proof that they meet specifications. Lots of numerals in pharmaceutical manufacturing documents. I don't believe there are any points where disposition decisions are made based on a scan/copy rather than an original document (or more often on purely electronic data). But there are certainly points where scanned/copied documents are used to document those decisions, which could present the <i>appearance</i> of non-conformance.</p>]]>
	 &lt;p&gt;Posted August  7, 2013  9:32 PM by Heather Rose Jones&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428226</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428226</guid>
         <pubDate>Wed, 07 Aug 2013 21:32:59 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #31 from lorax</title>
         <description>comment from lorax on  7.Aug.13</description>
         <content:encoded><![CDATA[<p>As I understand the nature of the issue from reading the linked article, there is no <i>a priori</i> reason why this shouldn't affect alphabetical characters as well as numerical ones in similar circumstances (different isolated blocks of characters occurring in different locations in a primarily non-textual document).  Things like names, for instance. </p>]]>
	 &lt;p&gt;Posted August  7, 2013  9:37 PM by lorax&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428232</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428232</guid>
         <pubDate>Wed, 07 Aug 2013 21:37:53 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #32 from Doug Burbidge</title>
         <description>comment from Doug Burbidge on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>David Goldfarb @22:</p>

<p>> A4 to A5 is 70.7%</p>

<p>FTFY.</p>

<p>Going up a size, of course, involves multiplying by √2 (i.e. 141%); going down a size involves dividing by √2 (i.e. multiplying by 70.7%).</p>

<p>Another annoying property of US paper is that there are several different weight scales, all of which are abbreviated to "pounds": bond, index, cover, etc.  The rest of the world has just one scale: gsm, or grams per square metre -- a calculation made simple when you know that an A0 sheet is one square metre, or 16 A4 sheets is one square metre.</p>]]>
	 &lt;p&gt;Posted August  8, 2013 12:12 AM by Doug Burbidge&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428338</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428338</guid>
         <pubDate>Thu, 08 Aug 2013 00:12:24 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #33 from Doug Burbidge</title>
         <description>comment from Doug Burbidge on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>Mr. Kriesel's sample scans for room dimensions show the error in a rectangular block, containing the dimension number inside the block.  The JBIG2 encoder inside the Xerox scanner has decided that this rectangular block is a single glyph, and has (incorrectly) replaced the whole glyph.</p>

<p>Further down, he shows scans where '6' has been substituted with '8'.</p>

<p>Wikipedia says that JBIG2 can use either of two methods for encoding data that it thinks is text: pattern matching and substitution, or soft pattern matching.  Re the first method, it says "substitution errors could be made during the process if the image resolution is low."  The second method stores the differences, so nominally even if it guessed the wrong glyph, the JBIG2 viewer used to view the encoded file would start from the wrong glyph and apply the differences, thus producing something with the correct appearance, which is all you need.</p>

<p>You sometimes see this when copy-pasting text from a JBIG2-encoded PDF: the text looks roughly correct in the PDF, but when you paste into Word or whatever, you see 'I' substituted for '1', or 'O' for '0' or whatever.  (This could also be a problem, but not as insidious as the one Mr. Kriesel has identified.)<br />
</p>]]>
	 &lt;p&gt;Posted August  8, 2013 12:15 AM by Doug Burbidge&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428343</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428343</guid>
         <pubDate>Thu, 08 Aug 2013 00:15:57 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #34 from dcb</title>
         <description>comment from dcb on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>David Goldfarb @22: Yeah. I really only realised the US problem when I was visiting a specialist library and suggested to another person there that they could reduce-copy to give two pages to a side - which, as you know, is so easy with A4 etc. I was flabbergasted when I realised that two American letter size sheets didn't easily reduce-copy-70% (or 71%, since as Doug Burbidge says @32 it should be 70.7%) onto one sheet of American letter. Whoever designed the "A" system deservs a prize - and we just take the advantages for granted.</p>]]>
	 &lt;p&gt;Posted August  8, 2013  4:10 AM by dcb&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428471</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428471</guid>
         <pubDate>Thu, 08 Aug 2013 04:10:36 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #35 from Lila</title>
         <description>comment from Lila on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>Heather Rose Jones @#30, that was the first thing that leapt to mind about EMR (electronic medical records)--that the PDF file would make it look like someone had gotten the wrong dosage of meds, or too strenuous an exercise prescription (wrist curls with 8# 2 weeks after surgery?), and hello malpractice suit.</p>

<p>Scanning paper documents and storing them as PDFs is pretty common practice for EMR, at least in the small clinics I'm familiar with.</p>]]>
	 &lt;p&gt;Posted August  8, 2013  7:47 AM by Lila&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428619</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428619</guid>
         <pubDate>Thu, 08 Aug 2013 07:47:14 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #36 from Dave Bell</title>
         <description>comment from Dave Bell on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>Lila @35</p>

<p>I read the original report, and what struck me was that the examples they showed were of small print, only just large enough to resolve some of the distances between a 6 and an 8. It was the sort of thing you could get with a 9-pin dot-matrix printer, a huge relative pixel size. It was marginally readable even without the errors.</p>

<p>If critical records are in print that small, it maybe isn't the Xerox machine that would be your problem.</p>]]>
	 &lt;p&gt;Posted August  8, 2013  8:24 AM by Dave Bell&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428644</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428644</guid>
         <pubDate>Thu, 08 Aug 2013 08:24:35 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #37 from oldster</title>
         <description>comment from oldster on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>dcb @34:</p>

<p>"Whoever designed the "A" system deserves a prize - and we just take the advantages for granted."</p>

<p>That would be Georg Christoph Lichtenberg for the really cool part of the insight (i.e. seeing the role of root-2), and Walter Porstmann for specifying it to the metric system and making it a standard.  (From the Wiki page for "paper size".)</p>

<p>Probably no prizes will be forthcoming, but some  people refer to the aspect ratio as the "Lichtenberg ratio."  Having your name used as a nickname for root-2 is pretty cool, and better than most prizes.  I mean, what prize could I possibly prefer to having some fundamental constant referred to as "the oldster number"? </p>

<p> Pie?</p>]]>
	 &lt;p&gt;Posted August  8, 2013  9:51 AM by oldster&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428707</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428707</guid>
         <pubDate>Thu, 08 Aug 2013 09:51:22 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #38 from Bob Webber</title>
         <description>comment from Bob Webber on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>Xerox Workcenter like Bank of America: JBIG2 FAIL.</p>]]>
	 &lt;p&gt;Posted August  8, 2013 10:13 AM by Bob Webber&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428730</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428730</guid>
         <pubDate>Thu, 08 Aug 2013 10:13:58 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #39 from Clifton</title>
         <description>comment from Clifton on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>Forwarded the original link to my wife yesterday, because I knew that she and her workplace have spent part of the past year scanning documents on a Xerox WorkCenter for permanent storage in PDF form.  Uh-oh....</p>

<p>The kind of work she does could be (and sometimes is) the subject of lawsuits and/or "administrative hearings" though it's not the kind that usually depends on the values of a few digits.</p>

<p>Meanwhile on Mefi, a user comments that they've seen this problem in the form of inappropriate keming, and had been wondering what caused that - so it's not just numbers, even if numbers seem to be particularly vulnerable .</p>]]>
	 &lt;p&gt;Posted August  8, 2013 12:46 PM by Clifton&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428827</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428827</guid>
         <pubDate>Thu, 08 Aug 2013 12:46:20 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #40 from SarahS</title>
         <description>comment from SarahS on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>Did anyone else immediately think, "Oh my lord...royalty statements!"</p>]]>
	 &lt;p&gt;Posted August  8, 2013  2:56 PM by SarahS&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1428909</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1428909</guid>
         <pubDate>Thu, 08 Aug 2013 14:56:52 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #41 from David Goldfarb</title>
         <description>comment from David Goldfarb on  8.Aug.13</description>
         <content:encoded><![CDATA[<p><strong>Doug Burbidge @<a href="#1428338" rel="nofollow">32</a>:</strong> Thanks for the correction.</p>

<p>Half letter size is actually close enough in aspect ratio to letter that reducing two pages onto one isn't that hard; usually you just have to clip a little bit of whitespace around the edge.  The other stuff I mentioned is much more annoying.</p>]]>
	 &lt;p&gt;Posted August  8, 2013  5:16 PM by David Goldfarb&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1429007</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1429007</guid>
         <pubDate>Thu, 08 Aug 2013 17:16:00 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #42 from Bill Higgins-- Beam Jockey</title>
         <description>comment from Bill Higgins-- Beam Jockey on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>Oldster in #37:</p>

<p>That Lichtenberg guy must have been a million laughs.  <a href="http://beamjockey.livejournal.com/208495.html" rel="nofollow">He was the Jon Singer of his day</a>.</p>]]>
	 &lt;p&gt;Posted August  8, 2013  6:35 PM by Bill Higgins-- Beam Jockey&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1429056</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1429056</guid>
         <pubDate>Thu, 08 Aug 2013 18:35:37 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #43 from Jacque</title>
         <description>comment from Jacque on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>This is fascinating, as I am, at this very moment, proofing a scanned document. Maybe I'm just behind the times, but I'm still boggled that the OCR tech is as good as it is. And, to be fair, the original document has <sup>eentsy weentsy teeny tiny</sup> type, printed on that gray, speckled "recycled" paper that was all the rage back in the '90s, so that the speckle in the paper is less than an order of magnitude smaller than the features in the font. So I am seeing quite a bit of...interpretation. Unsurprisingly, commas become semicolons and so on. Interestingly, less in the numerals than in the letters.</p>

<p>For example: </p>

<ul>
<li>Home &gt; Honie</li>
<li>Petroleum &gt;  Petroi"Lim</li>
<li>PROPERTIES &gt;  Pl'loPERTIES, or l'flOPER:ms</li>
<li>Equipment &gt; Equip,;ent, or Eqilipnieht (the latter is obviously the German spelling)</li>
</ul>

<p>(I find this perversely amusing. If you cock your head and squint just so, you can kinda see how it got there.)</p>

<p>But I do see some numeracy fails:</p>

<ul>
<li>222,195,990 &gt; 222, 195,99\)_</li>
<li>6,775,900 &gt; 6,775,90P</li>
<li>57,147,790 &gt; 5}; 147:790</li>
<li>71,242 &gt; 7<i>i</i> ,242</li>
</ul>

<p>Interestingly, the errors actually <i>look</i> correct on the PDF, as compared to the paper. It's only when the text is copied out and pasted into another program that the parsing errors show up.</p>

<p>I haven't yet caught it out swapping one number for another. But I haven't actually proofed my processed content against the actual paper yet, either.</p>

<p>But I would never, in a million years, even think about putting out the processed text as "correct," without proofing it first.</p>]]>
	 &lt;p&gt;Posted August  8, 2013  6:41 PM by Jacque&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1429058</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1429058</guid>
         <pubDate>Thu, 08 Aug 2013 18:41:31 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #44 from Jacque</title>
         <description>comment from Jacque on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>Okay, then. Just finished proofing, and I did, indeed, find three instances where it changed the numbers. (One of them where it saw 13 and read it as 8.)</p>]]>
	 &lt;p&gt;Posted August  8, 2013  7:11 PM by Jacque&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1429076</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1429076</guid>
         <pubDate>Thu, 08 Aug 2013 19:11:42 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #45 from P J Evans</title>
         <description>comment from P J Evans on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>OCR translations of text can be almost as fun as running things through Babelfish a few times.</p>]]>
	 &lt;p&gt;Posted August  8, 2013  8:13 PM by P J Evans&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1429108</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1429108</guid>
         <pubDate>Thu, 08 Aug 2013 20:13:49 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #46 from Cally Soukup</title>
         <description>comment from Cally Soukup on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>Proofing OCRed text for Distributed Proofreaders for Project Gutenberg has made me smile every time I see the word "arid". Because, you see, in an OCRed text, ninety-nine times or more out of a hundred the word is supposed to be "and". Also: watch out for he/be errors. That's another very, very common swap, in both directions.</p>]]>
	 &lt;p&gt;Posted August  8, 2013  8:26 PM by Cally Soukup&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1429110</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1429110</guid>
         <pubDate>Thu, 08 Aug 2013 20:26:03 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #47 from P J Evans</title>
         <description>comment from P J Evans on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>I spent a few months proofing OCRd text. My favorites were when the software scrambled 'District' to 'Omelet' and turned 'legal obligation' to 'lethal obligation' (well, it <em>was</em> about DC bonds).</p>]]>
	 &lt;p&gt;Posted August  8, 2013  8:33 PM by P J Evans&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1429116</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1429116</guid>
         <pubDate>Thu, 08 Aug 2013 20:33:32 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #48 from Erik Nelson</title>
         <description>comment from Erik Nelson on  8.Aug.13</description>
         <content:encoded><![CDATA[<p>This is what I deal with every day on the job.</p>]]>
	 &lt;p&gt;Posted August  8, 2013  8:46 PM by Erik Nelson&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1429129</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1429129</guid>
         <pubDate>Thu, 08 Aug 2013 20:46:03 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #49 from albatross</title>
         <description>comment from albatross on  9.Aug.13</description>
         <content:encoded><![CDATA[<p>eric:</p>

<p>The users will be blamed, but shipping a product with a normal setting that can cause this kind of damage is an amazingly bad idea.  And I'll bet that there are other copiers and scanners with the same problems.  <br />
</p>]]>
	 &lt;p&gt;Posted August  9, 2013 11:03 AM by albatross&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1429817</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1429817</guid>
         <pubDate>Fri, 09 Aug 2013 11:03:35 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #50 from Alan Hamilton</title>
         <description>comment from Alan Hamilton on  9.Aug.13</description>
         <content:encoded><![CDATA[<p>This is inherent to anything using JBIG2 compression, so yeah, it could affect other documents. I've seen the same issue on scanned documents converted to PDFs on a PC. For example, this <a href="http://www.airdisaster.com/reports/ntsb/AAR86-05.pdf" rel="nofollow">NTSB report</a>. There are a lot of typos and weird font substitutions. This was converted using PDFWriter 3 in 2000, so this is nothing new.</p>]]>
	 &lt;p&gt;Posted August  9, 2013 12:01 PM by Alan Hamilton&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1429875</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1429875</guid>
         <pubDate>Fri, 09 Aug 2013 12:01:54 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #51 from Lin Daniel</title>
         <description>comment from Lin Daniel on 10.Aug.13</description>
         <content:encoded><![CDATA[<p>Speech to text software does some amusing stuff, too. jan finder would send me email with uncorrected speech to text. There were times I'd have to read it out loud, with his inflections, to figure out what it was he was saying. And once, sent it back saying, "I love it when you talk dirty to me" because it was totally unintelligible. </p>]]>
	 &lt;p&gt;Posted August 10, 2013  7:28 PM by Lin Daniel&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1431485</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1431485</guid>
         <pubDate>Sat, 10 Aug 2013 19:28:27 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #52 from Kevin Reid</title>
         <description>comment from Kevin Reid on 11.Aug.13</description>
         <content:encoded><![CDATA[<blockquote>Interestingly, the errors actually look correct on the PDF, as compared to the paper. It's only when the text is copied out and pasted into another program that the parsing errors show up.</blockquote>

<p>An OCR'd PDF simply has the text computed by the OCR algorithm placed <em>behind</em> a copy of the original image, so that it is invisible (but still possible to select). When you select some text, you're actually getting the hidden character text, but it happens to line up with the part of the original image.</p>

<p>(Similar scenario that was a topic a while ago: failed redactions of PDFs, where black boxes are just added on top, but the original text is unaltered.)</p>]]>
	 &lt;p&gt;Posted August 11, 2013  1:09 PM by Kevin Reid&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1432384</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1432384</guid>
         <pubDate>Sun, 11 Aug 2013 13:09:20 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #53 from oldster</title>
         <description>comment from oldster on 11.Aug.13</description>
         <content:encoded><![CDATA[<p>Relax, people!  </p>

<p>It's all cool.  Xerox knew about this all along. In fact, they warned you about it, in their documentation.</p>

<p>From a user-manual: <br />
 “Normal/Small produces small files by using advanced compression techniques. Image quality is acceptable but some quality degradation and *character substitution errors may occur* with some originals.”  [asterisks added].</p>

<p>From the IBM spox: <br />
"You are also correct that we have documented in our user guides as well as within our devices that the high compression mode may cause character substitution which means we have known about the potential for this issue. Our design philosophy was to make available a very useful mode that creates small files while at the same time providing information about its limitations."</p>

<p>Both of these quotes are from their blog:</p>

<p>http://realbusinessatxerox.blogs.xerox.com/2013/08/07/update-on-scanning-issue-software-patches-to-come/#.UgfbRmTF2RY</p>

<p>where they also say that they are working on a software patch for this.</p>

<p>I give them some credit for being very honest on the blog.  Of course, Corporate Counsel's Office doesn't mind being honest, because they know that they warned about it in the original users manuals.  And that warning means that they are shielded from any liability.</p>

<p>Don't you feel more relaxed now?  Xerox does!</p>]]>
	 &lt;p&gt;Posted August 11, 2013  2:49 PM by oldster&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1432450</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1432450</guid>
         <pubDate>Sun, 11 Aug 2013 14:49:19 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #54 from Pfusand</title>
         <description>comment from Pfusand on 11.Aug.13</description>
         <content:encoded><![CDATA[<p>The OCR error I was most grateful to spot was the one that changed "She licked her lips" into "She licked her hips."</p>]]>
	 &lt;p&gt;Posted August 11, 2013  2:50 PM by Pfusand&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1432454</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1432454</guid>
         <pubDate>Sun, 11 Aug 2013 14:50:40 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #55 from David Weingart</title>
         <description>comment from David Weingart on 13.Aug.13</description>
         <content:encoded><![CDATA[<p>Pfusand @ 54:  She'd have to be pretty flexible</p>]]>
	 &lt;p&gt;Posted August 13, 2013  1:34 PM by David Weingart&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1434381</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1434381</guid>
         <pubDate>Tue, 13 Aug 2013 13:34:32 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #56 from Pfusand</title>
         <description>comment from Pfusand on 13.Aug.13</description>
         <content:encoded><![CDATA[<p>David @ 55,<br />
Well, she was an alien, but not that sort of alien.</p>]]>
	 &lt;p&gt;Posted August 13, 2013  3:39 PM by Pfusand&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1434480</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1434480</guid>
         <pubDate>Tue, 13 Aug 2013 15:39:44 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #57 from Allan Beatty</title>
         <description>comment from Allan Beatty on 13.Aug.13</description>
         <content:encoded><![CDATA[<p>Corporate counsel may think they are honest when there is a warning buried deep in a manual.</p>

<p>It would be more honest if the button on the screen said "Sub-Normal" instead of "Normal".</p>]]>
	 &lt;p&gt;Posted August 13, 2013  6:55 PM by Allan Beatty&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1434563</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1434563</guid>
         <pubDate>Tue, 13 Aug 2013 18:55:08 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #58 from Lee</title>
         <description>comment from Lee on 14.Aug.13</description>
         <content:encoded><![CDATA[<p>Allan, #57: Exactly. A format subject to errors of this type should never have been set as the DEFAULT. Designating it as "smallest" or "ultra-compressed" or, yes, "sub-normal" -- combined with the manual's warning that in using this mode you risk substitution errors -- would have been the ethical approach. <br />
</p>]]>
	 &lt;p&gt;Posted August 14, 2013 11:48 AM by Lee&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1435052</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1435052</guid>
         <pubDate>Wed, 14 Aug 2013 11:48:22 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #59 from Mary Aileen</title>
         <description>comment from Mary Aileen on 14.Aug.13</description>
         <content:encoded><![CDATA[<p>Lee (58): Daniel@10 says the overly compressed setting was not the factory default, but calling it "Normal" implies that it's perfectly okay. And yes, that's *very* problematic.</p>]]>
	 &lt;p&gt;Posted August 14, 2013  1:14 PM by Mary Aileen&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1435103</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1435103</guid>
         <pubDate>Wed, 14 Aug 2013 13:14:41 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #60 from Renee</title>
         <description>comment from Renee on 15.Aug.13</description>
         <content:encoded><![CDATA[<p>Hmm. I've seen read errors of this sort when using a hand-held scanner to do inventories of barcoded items. Here I thought such errors were all caused by smudged barcodes. Nice to know what causes this -- in the sense that 'nice' means I can now explain why random nonsense is showing up in the data.</p>

<p>...And now I get to worry about non-nonsensical number swaps. Sigh.</p>]]>
	 &lt;p&gt;Posted August 15, 2013 10:27 PM by Renee&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1436265</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1436265</guid>
         <pubDate>Thu, 15 Aug 2013 22:27:38 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #61 from paul</title>
         <description>comment from paul on 16.Aug.13</description>
         <content:encoded><![CDATA[<p>Not that it really matters, but how many of the people who use a typical officer copier even have access to the user manual, much less have read the thing cover to cover?</p>]]>
	 &lt;p&gt;Posted August 16, 2013  3:21 PM by paul&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1437017</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1437017</guid>
         <pubDate>Fri, 16 Aug 2013 15:21:37 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #62 from Kaleberg</title>
         <description>comment from Kaleberg on 18.Aug.13</description>
         <content:encoded><![CDATA[<p>The real problem is that only one in a thousand font designers realizes that "6" and "8" are different characters. The rest of them seem to lack the mental capacity to understand the concept. I'm not surprised a Xerox machine couldn't tell a six from an eight. Most of the time, I can't either.<br />
</p>]]>
	 &lt;p&gt;Posted August 18, 2013  7:40 PM by Kaleberg&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#1439676</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#1439676</guid>
         <pubDate>Sun, 18 Aug 2013 19:40:48 -0500</pubDate>
      </item>
      
      <item>
         <title>The Implications are Staggering -- comment #63 from Daniel Martin</title>
         <description>comment from Daniel Martin on  1.Sep.15</description>
         <content:encoded><![CDATA[<p>The discoverer of the bug has recorded an hour-long presentation on the experience of finding and reporting the bug here: <a href="https://www.youtube.com/watch?v=c0O6UXrOZJo" rel="nofollow">https://www.youtube.com/watch?v=c0O6UXrOZJo</a>.</p>

<p>It includes some interesting details not available when this first broke, and among other things I need to retract my statement in comment 10: <b>there are Xerox scanners that, in the factory default setting, mangled text including substituting some characters for others</b>.</p>

<p>Patches now exist for those pieces of hardware, but many places haven't applied them. If you depend on scanned documents that would be made useless by the transposition of one or more characters, ensure that your IT people know of this problem and have applied the appropriate patches.</p>]]>
	 &lt;p&gt;Posted September  1, 2015  2:03 PM by Daniel Martin&lt;/p&gt;</content:encoded>
         <link>http://nielsenhayden.com/makinglight/archives/015331.html#4217056</link>
         <guid isPermaLink="true">http://nielsenhayden.com/makinglight/archives/015331.html#4217056</guid>
         <pubDate>Tue, 01 Sep 2015 14:03:30 -0500</pubDate>
      </item>
      
   </channel>
</rss>