-
大小: 5KB文件類型: .java金幣: 1下載: 0 次發(fā)布日期: 2021-01-02
- 語言: Java
- 標(biāo)簽: PDFBox??坐標(biāo)定位??java??pdf??獲取坐標(biāo)??
資源簡介
基于PDFBox,根據(jù)指定字符串定位起始位置,然后進(jìn)行指定區(qū)域的文本提取。代碼較粗糙,供參考
代碼片段和文件信息
package?com.df.pdf.util;
import?java.awt.Rectangle;
import?java.io.ByteArrayOutputStream;
import?java.io.File;
import?java.io.IOException;
import?java.io.OutputStreamWriter;
import?java.io.Writer;
import?java.util.List;
import?org.apache.commons.lang3.StringUtils;
import?org.apache.pdfbox.pdmodel.PDDocument;
import?org.apache.pdfbox.pdmodel.PDPage;
import?org.apache.pdfbox.text.PDFTextStripper;
import?org.apache.pdfbox.text.PDFTextStripperByArea;
import?org.apache.pdfbox.text.TextPosition;
public?class?ExtractTextByArea?extends?PDFTextStripper?{
private?int?xBegin?=?0;
private?int?yBegin?=?0;
private?int?xEnd?=?0;
private?int?yEnd?=?0;
public?int?getxBegin()?{
return?xBegin;
}
public?void?setxBegin(int?xBegin)?{
this.xBegin?=?xBegin;
}
publ
評(píng)論
共有 條評(píng)論