Package 'pdftables'

Title: Programmatic Conversion of PDF Tables
Description: Allows the user to convert PDF tables to formats more amenable to analysis ('.csv', '.xml', or '.xlsx') by wrapping the PDFTables API. In order to use the package, the user needs to sign up for an API account on the PDFTables website (<https://pdftables.com/pdf-to-excel-api>). The package works by taking a PDF file as input, uploading it to PDFTables, and returning a file with the extracted data.
Authors: Eric Persson [aut, cre]
Maintainer: Eric Persson <[email protected]>
License: CC0
Version: 0.1
Built: 2025-01-02 03:10:42 UTC
Source: https://github.com/expersso/pdftables

Help Index


Convert PDF Tables to format more amenable to analysis

Description

Convert PDF Tables to format more amenable to analysis

Usage

convert_pdf(input_file, output_file = NULL, format = "csv",
  message = TRUE, api_key = Sys.getenv("pdftable_api"))

Arguments

input_file

The PDF file to be converted. If this is a url to a PDF rather than a file path then it is downloaded to a temporary file via download.file before being sent to pdftables for conversion. If input_file is a url then output_file must not be NULL.

output_file

The desired name for the output file

format

One of 'csv', 'xml', 'xlsx-single', 'xlsx-multiple'

message

If TRUE, outputs a message that conversion was successful

api_key

Your API key (from https://pdftables.com)

Value

Creates an output file with the converted PDF table

Examples

## Not run: 
write.csv(head(iris), file = "test.csv", row.names = FALSE)

# Open test.csv and print as PDF to "test.pdf"

convert_pdf("test.pdf", "test2.csv")

## End(Not run)

Retrieve the number of pages left on your account

Description

Retrieve the number of pages left on your account

Usage

get_remaining(api_key = Sys.getenv("pdftable_api"))

Arguments

api_key

Your API key (from https://pdftables.com)

Value

A numeric vector of length 1

Examples

## Not run: get_remaining()