You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+11-9Lines changed: 11 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -12,6 +12,8 @@ Tesseract OCR Lambda Layer
12
12
13
13
> AWS Lambda layer containing the [tesseract OCR](https://github.com/tesseract-ocr/tesseract) libraries and command-line binary for Lambda Runtimes running on Amazon Linux 1 and 2.
14
14
15
+
> :warning:[The Amazon Linux AMI (Version 1) is being deprecated](https://aws.amazon.com/blogs/aws/update-on-amazon-linux-ami-end-of-life/). Users are advised to not use Lambda runtimes (i.e. Python 3.6) based on this version. Refer also to the [AWS Lambda runtime deprecation policy](https://docs.aws.amazon.com/lambda/latest/dg/lambda-runtimes.html#runtime-support-policy).
## Building a different tesseract version and/or language
@@ -155,21 +157,21 @@ Per default the build generates the [tesseract 4.1.3](https://github.com/tessera
155
157
The build process can be modified using different build time arguments (defined as `ARG` in `Dockerfile.al[1|2]`), using the `--build-arg` option of `docker build`.
156
158
157
159
| Build-Argument | description | available versions |
|`TESSERACT_VERSION`| the tesseract OCR engine |https://github.com/tesseract-ocr/tesseract/releases|
160
162
|`LEPTONICA_VERSION`| fundamental image processing and analysis library |https://github.com/danbloomberg/leptonica/releases|
161
163
|`OCR_LANG`| Language to install (in addition to `eng` and `osd`) |https://github.com/tesseract-ocr/tessdata (`<lang>.traineddata`) |
162
164
|`TESSERACT_DATA_SUFFIX`| Trained LSTM models for tesseract. Can be empty (default), `_best` (best inference) and `_fast` (fast inference). |https://github.com/tesseract-ocr/tessdata, https://github.com/tesseract-ocr/tessdata_best, https://github.com/tesseract-ocr/tessdata_fast|
163
-
|`TESSERACT_DATA_VERSION`| Version of the trained LSTM models for tesseract. (currently - in July 2022 - only `4.1.0` is available) |https://github.com/tesseract-ocr/tessdata/releases/tag/4.1.0|
165
+
|`TESSERACT_DATA_VERSION`| Version of the trained LSTM models for tesseract. (currently - in July 2022 - only `4.1.0` is available) |https://github.com/tesseract-ocr/tessdata/releases/tag/4.1.0|
164
166
165
167
166
168
**Example of custom build**
167
169
168
170
```bash
169
-
## Build a Dockerimage based on Amazon Linux 1, with Tesseract 4.0.0
0 commit comments