San Diego Homelessness Contract Annotations

Processed TagTog annotations for homelessness contracts collected by the

sdcta.org-hl_contracts-1.1.1. Modified 2021-08-12T19:25:41

Resources | Packages | Documentation| Contacts| References| Data Dictionary

Resources

  • annotations. Extracted and processed annotations.
  • contexts. Surrounding paragraphs for the annotations, linked by ‘part’

Documentation

Processed TagTog annotations for homelessness contracts collected by the San Diego County Taxpayers Association.

Contacts

Data Dictionary

annotations | contexts

annotations

Column NameData TypeDescription
classidstring
partstring
offset_startinteger
texttext
coordinatesstring
confidencestring
confidence_probnumber
fieldsstring
normalizationsstring
whostring
file_namestring
html_file_namestring
coordinates_0_xnumber
coordinates_0_ynumber
coordinates_1_xnumber
coordinates_1_ynumber
valueinteger
anno_typestring

contexts

Column NameData TypeDescription
partstring
contexttext

References

Urls used in the creation of this data package.

  • data/homelessness-contracts-20210811.zip. Zip file downloaded from TagTog with text and annotations, date 2021-08-11

Packages

Accessing Data in Vanilla Pandas

import pandas as pd


annotations_df =  pd.read_csv('http://library.metatab.org/sdcta.org-hl_contracts-1.1.1/data/annotations.csv')
contexts_df =  pd.read_csv('http://library.metatab.org/sdcta.org-hl_contracts-1.1.1/data/contexts.csv')

Accessing Package in Metapack

import metapack as mp
pkg = mp.open_package('http://library.metatab.org/sdcta.org-hl_contracts-1.1.1.zip')

# Create Dataframes
annotations_df = pkg.resource('annotations').dataframe()
contexts_df = pkg.resource('contexts').dataframe()