Index

Subject : [lug] Digest (1 messages)

From : lug-owner@lists.ncsu.[redacted]

Date : Thu, 11 Sep 2014 07:15:48 -0400


The Lug Digest
Volume 1 : Issue 333 : "text" Format

Messages in this Issue:
201409/3 : OCR Help
Jeffery Mewtamer <mewtamer@gmail.[redacted]>

----------------------------------------------------------------------

Date: Mon, 8 Sep 2014 21:39:38 +0000
From: Jeffery Mewtamer <mewtamer@gmail.[redacted]>
To: lug@lists.ncsu.[redacted]
Subject: OCR Help
Message-ID: <CAO2sX32p1SwEHo8wWsh4Q8A2kXrGMQO+X-G+oA7dGZTKH7nbXg@mail.gmail.[redacted]>

Good Evening,

I am a blind Linux user. As that relates to this message, it means I
have to do most things from the command line and I often have to
convert documents to plain text. For most formats I deal with on a
regular basis, I've found ways to extract the text that I can use
despite my disability, but I am having trouble working with documents
whose content is largely image based. Poppler-utils's pdfimages
command makes extracting images from PDF files, but I haven't found a
good way of extracting images from other document formats. However,
the more pressing issue at the moment is extracting text from images.

I've been using cuneiform to perform Ocular Character Recognition on
images, and while it works well enough on images with plain English
text, it tends to produce gibberish when processing images that
contain mathematical formulas and other math-related things, and
handouts and homework for the math classes I'm taking seem to be the
most common reason for me needing to do OCR.

Attached are several .pbm images I need OCR performed on along with
the output I got from cuneiform after cleaning it up a bit to
illustrate how poorly it's meeting my OCR needs. I've also attached
the bash scrip I use for processing many images at once. I've also
tried Tesseract, but its more complicated command line format makes it
harder to use and my tests seemed to indicate that cuneiform tended to
have better results for less effort.

Any suggestions for a command-line OCR program that could do a better
job at making useable text-files from pages of scanned mathematics
would be greatly appreciated.
Name: Jeffery Wright
Date: 2014/08/23
MATH 351-Linear Algebra Homework Assignment 1: Systems of Linear Equations

Solve the system by the substitution method.

1. x+2y=6
y = 2x + 3

x + 2(3x + 3) = 6
7x + 6 = 6
7x = 0
x = 0

Y = 3(0) + 3
y = 3

ordered pair = (0, 3)

2. xr = 7xz+ 9
xz = 5xz — 1

3. 7x, + 9xz = -44
4xx- xz=24

II. Solve the system by the Elimination of Addition Method.

1. x + y = -6
x — y= 2

2. Xz+ 4xz = 6
Sxz + 3xz = -4

3. 4lz — 5lz = -6
5lz — 3lz = -4

III. Solve the following system.

1. 6x — y+ 2z = 15
x+ 2y — z =-1
2x + 2y — 3z = -1

2. 5xz+ 3xz — Sxs = -11
3xz — 4xz + 3xs = 6
6xz — Sxz + 6xs = 21


[Attachment of type image/x-portable-bitmap removed.]
Name: Jeffery Wright
Date: 2014/08/23
MATH 351-Linear Algebra
Homework 2: Matrix Transformation

1. Give the size of the following matrices.
1 2 3 ' 0 9 A. 0 1 2 I B.— 6 4 C. D.1 2 3
4533 — 231245 2. Give the (1,1), (2,2), (3,3), (1,5), (2,4) and (3,2) element of the following matrix.
1 2 3 1 — 1 — 2 4 — 5 0 6
5 8 9 3 3
3. Write down the identity matrix for I4.

|1 0 0 0|
|0 1 0 0|
|0 0 1 0|
|0 0 0 1|

4. Determine the coefficient matrix (CM) and the augmented matrix (AM) of each of the following systems of equations.

A. x+4y= 7
2x-5y = -3

CM:
|1 4|
|2 -5|

AM:
|1 4 : 7|
|2 -5 : -3|

B. 5x + 2y — 4z = 8 x+ 3y+ 6z =4 4x+ 6y — 9z = 7 C. 5'+ 2x> — 4x> = 8 4x,+3x>=0
-x, = 7


xz =12
x> = 8 5. Interpret the following matrices as augmented matrices of systems of equations. Write down each system of equations.
8 7 5 B. 4 6 2
9 3 7

6. In the following exercise you are given a matrix followed by an elementary row operation. Determine each resulting matrix. 2 6 — 4 0 1 2 — 3 6 1. NR1 2. R1~R3 3 2 5 2 3 -1 1 7 1 1..R2+ R1 2. -2R1+ R3 -4 5 -3 7. Solve the following systems of equations. A. xz — 2xz = -8
2x> — 3xz = -11 B. x, + xg = 3 2xz — 2x> = -4 x,— 2x,= 5



[Attachment of type image/x-portable-bitmap removed.]


[Attachment of type image/x-portable-bitmap removed.]
MATH 351-Linear Algebra Homework Assignment 3-Gaussian and Gauss Jordan
Elimination

Determine whether or not each of the following matrices is
in row echelon form. 1 2 1 A. 0 1 3 0 0 0 1 5 6 2 B. 0 1 04 0 0 1 2 1 3
4 2 C.0025 0 0 0 1 0 0 0 0 DI1 24 6 0012 0 13 3 0001

II. Determine whether or not each of the following matrices is in
reduced row echelon form.

A.102 B.1256 1 0 0 C.O 1 0 0 0 1





D.I 1 0 0 3 2 '. '0 2 0 6 1! j00123! Solve the following by Gaussian Elimination. A.10241 . 0 1 -3 5 -6
00 00 0 B. x, +4xq+3x>=3. 2x, + 8x~ + 11x~ =7 x, + 6x~ + 7xs = 3 C. xy+xz+xs = 7 2x, + 3x~ + xq = 18 -xg+ xp -3' = 1 D. Xz+ 2' = 5 xg+2xz+Sxs= 13 xg+ 2' =4 Solve the following by Gauss-Jordan Elimination A 002-22
3-391
44-2 111 B. 3x,— 3x~+3xs= 9 2xg — xz+ 4' = 7 3x> -Sx> — x> = 7 C. Xy + xz + xz = 6 xg — xp+ xp = 2 x, + 2x~ + 3' = 14



[Attachment of type image/x-portable-bitmap removed.]


[Attachment of type image/x-portable-bitmap removed.]


[Attachment of type application/x-sh removed.]

------------------------------

End of [lug] Digest (1 messages)
**********