Package: RcppCWB 0.6.4

Andreas Blaette

RcppCWB: 'Rcpp' Bindings for the 'Corpus Workbench' ('CWB')

'Rcpp' Bindings for the C code of the 'Corpus Workbench' ('CWB'), an indexing and query engine to efficiently analyze large corpora (<https://cwb.sourceforge.io>). 'RcppCWB' is licensed under the GNU GPL-3, in line with the GPL-3 license of the 'CWB' (<https://www.r-project.org/Licenses/GPL-3>). The 'CWB' relies on 'pcre2' (BSD license, see <http://www.pcre.org/licence.txt>) and 'GLib' (LGPL license, see <https://www.gnu.org/licenses/lgpl-3.0.en.html>). See the file LICENSE.note for further information. The package includes modified code of the 'rcqp' package (GPL-2, see <https://cran.r-project.org/package=rcqp>). The original work of the authors of the 'rcqp' package is acknowledged with great respect, and they are listed as authors of this package. To achieve cross-platform portability (including Windows), using 'Rcpp' for wrapper code is the approach used by 'RcppCWB'.

Authors:Andreas Blaette [aut, cre], Bernard Desgraupes [aut], Sylvain Loiseau [aut], Oliver Christ [ctb], Bruno Maximilian Schulze [ctb], Stephanie Evert [ctb], Arne Fitschen [ctb], Jeroen Ooms [ctb], Marius Bertram [ctb], Tomas Kalibera [ctb]

RcppCWB_0.6.4.tar.gz
RcppCWB_0.6.4.zip(r-4.5)RcppCWB_0.6.4.zip(r-4.4)RcppCWB_0.6.4.zip(r-4.3)
RcppCWB_0.6.4.tgz(r-4.4-x86_64)RcppCWB_0.6.4.tgz(r-4.4-arm64)RcppCWB_0.6.4.tgz(r-4.3-x86_64)RcppCWB_0.6.4.tgz(r-4.3-arm64)
RcppCWB_0.6.4.tar.gz(r-4.5-noble)RcppCWB_0.6.4.tar.gz(r-4.4-noble)
RcppCWB.pdf |RcppCWB.html
RcppCWB/json (API)
NEWS

# Install 'RcppCWB' in R:
install.packages('RcppCWB', repos = c('https://polmine.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/polmine/rcppcwb/issues

Uses libs:
  • glib– GLib library of C routines
  • pcre2– New Perl Compatible Regular Expression Library
  • c++– GNU Standard C++ Library v3

On CRAN:

6.48 score 2 stars 1 packages 83 scripts 665 downloads 99 exports 2 dependencies

Last updated 9 months agofrom:38551b2dfe. Checks:OK: 3 NOTE: 6. Indexed: yes.

TargetResultDate
Doc / VignettesOKNov 21 2024
R-4.5-win-x86_64OKNov 21 2024
R-4.5-linux-x86_64OKNov 21 2024
R-4.4-win-x86_64NOTENov 21 2024
R-4.4-mac-x86_64NOTENov 21 2024
R-4.4-mac-aarch64NOTENov 21 2024
R-4.3-win-x86_64NOTENov 21 2024
R-4.3-mac-x86_64NOTENov 21 2024
R-4.3-mac-aarch64NOTENov 21 2024

Exports:attribute_sizecheck_corpuscheck_cposcheck_idcheck_p_attributecheck_pkg_registry_filescheck_querycheck_region_matrixcheck_registrycheck_s_attributecheck_strucscl_attribute_sizecl_charset_namecl_cpos2idcl_cpos2lboundcl_cpos2rboundcl_cpos2strcl_cpos2struccl_delete_corpuscl_find_corpuscl_id2cposcl_id2freqcl_id2strcl_lexicon_sizecl_list_corporacl_load_corpuscl_regex2idcl_str2idcl_struc_valuescl_struc2cposcl_struc2strcorpus_data_dircorpus_full_namecorpus_info_filecorpus_is_loadedcorpus_p_attributescorpus_propertiescorpus_propertycorpus_registry_dircorpus_s_attributescpos_to_idcpos_to_lboundcpos_to_rboundcpos_to_strcpos_to_struccpos2idcpos2strcqp_drop_subcorpuscqp_dump_subcorpuscqp_get_registrycqp_initializecqp_is_initializedcqp_list_corporacqp_list_subcorporacqp_load_corpuscqp_querycqp_reset_registrycqp_subcorpus_sizecqp_verbositycwb_charsetscwb_compress_rdxcwb_encodecwb_huffcodecwb_makeallcwb_versionget_cbow_matrixget_count_vectorget_pkg_registryget_region_matrixget_tmp_registryid_to_cposid_to_freqid2strids_to_count_matrixmatrix_to_subcorpusp_attrp_attr_defaultp_attr_lexicon_sizep_attr_sizeranges_to_cposregex_to_idregion_matrix_contextregion_matrix_to_count_matrixregion_matrix_to_idsregion_matrix_to_struc_matrixregion_to_strucss_attrs_attr_is_descendents_attr_is_siblings_attr_regionss_attr_relationships_attr_sizes_attribute_decodestr_to_idstruc_to_cposstruc_to_strstruc2cpossubcorpus_get_rangesuse_tmp_registry

Dependencies:fsRcpp

Writing performance code with RcppCWB

Rendered fromvignette.Rmdusingknitr::rmarkdownon Nov 21 2024.

Last update: 2023-03-17
Started: 2018-01-03

Readme and manuals

Help Manual

Help pageTopics
Rcpp Bindings for the Corpus Workbench (CWB).RcppCWB-package RcppCWB
Rcpp wrappers for CWB Corpus Library functionsattribute_size cpos2id cpos2str id2str struc2cpos
Check Input to Rcpp Functions.check check_corpus check_cpos check_id check_p_attribute check_query check_region_matrix check_registry check_strucs check_s_attribute
Check Paths in Registry Filescheck_pkg_registry_files
Get Attribute Size (of Positional/Structural Attribute).cl_attribute_size
Get charset of a corpus.cl_charset_name
Drop loaded corpus.cl_delete_corpus
Load corpus.cl_find_corpus
Get Lexicon Size.cl_lexicon_size
Show CL corporacl_list_corpora
Load corpuscl_load_corpus
Low-level CL access.cl_rework cpos_to_id cpos_to_lbound cpos_to_rbound cpos_to_str cpos_to_struc id_to_cpos id_to_freq p_attr p_attr_lexicon_size p_attr_size regex_to_id struc_to_cpos struc_to_str str_to_id s_attr s_attr_size
Check whether structural attribute has valuescl_struc_values
Using Positional Attributes.CL: p_attributes cl_cpos2id cl_cpos2str cl_id2cpos cl_id2freq cl_id2str cl_regex2id cl_str2id
Using Structural Attributes.CL: s_attributes cl_cpos2lbound cl_cpos2rbound cl_cpos2struc cl_struc2cpos cl_struc2str
Get information from registry filecorpus_data_dir corpus_full_name corpus_info_file corpus_properties corpus_property corpus_p_attributes corpus_registry_dir corpus_s_attributes
Check whether corpus is loadedcorpus_is_loaded
Initialize Corpus Query Processor (CQP).cqp_get_registry cqp_initialize cqp_is_initialized cqp_load_corpus cqp_reset_registry cqp_verbosity
List Available CWB Corpora.cqp_list_corpora
Execute CQP Query and Retrieve Results.cqp_drop_subcorpus cqp_dump_subcorpus cqp_list_subcorpora cqp_query cqp_subcorpus_size
Character sets supported by CWBcwb_charsets
CWB Tools for Creating Corporacwb_compress_rdx cwb_encode cwb_huffcode cwb_makeall
Get CWB versioncwb_version
Get CBOW Matrix.get_cbow_matrix
Get Vector with Counts for Positional Attribute.get_count_vector
Get Registry Directory Within Packageget_pkg_registry
Get Matrix with Regions for Strucs.get_region_matrix
Perform Count for Vector of IDs.ids_to_count_matrix
Create CWB subcorpus from matrix with regions.matrix_to_subcorpus
Get default p-attributep_attr_default
Get IDs and Counts for Region Matrices.ranges_to_cpos region_matrix_context region_matrix_ops region_matrix_to_count_matrix region_matrix_to_ids
Get min and max strucs of s-attribute present in regionregion_matrix_to_struc_matrix region_to_strucs
Explore XML structure of CWB corpuss_attr_is_descendent s_attr_is_sibling s_attr_relationship
Get regions defined by a structural attributes_attr_regions
Decode Structural Attribute.s_attribute_decode
Get ranges of subcorpussubcorpus_get_ranges
Use Temporary Registryget_tmp_registry use_tmp_registry