All functions

create_message()

Creates message string for reporting during procedures

dhms()

Formats time difference as X days HH:MM:SS

disambr_add_set_attr()

Add disambr attribures to focal set from template set and update some of them

disambr_filter_authors_by_researcher_ids()

Subsets WoS authors table with Researcher IDs

disambr_get_first_data_set()

Get first data set in list sets

disambr_get_last_set()

Gets last set in sets which strength less or equal than 0.5

disambr_get_last_unstrong_set()

Gets last set from sets with strength <= 0.5 and excludes from this set all sets with strength of 1

disambr_get_last_weak_set()

Gets last set in sets which strength less or equal than 0.5

disambr_get_output_set()

Gets output set from sets (in case we already made it)

disambr_get_strong_set()

Get sets with strength parameter of 1 and rbind them into single set

disambr_get_truth_set()

Get sets with strength parameter of 10 (ground thruth)

disambr_in_sets()

Checks if sets with certain attribures are present

disambr_make_wos_tsv_author_year_citations()

Makes citations table by matching first author - year keys. This, of course, can produce false positives matched

disambr_make_wos_tsv_authors()

Parses all WoS field related to authors and makes author table

disambr_make_wos_tsv_publications()

Make set of publications

disambr_make_wos_tsv_references()

Make references table from WoS tsv data

disambr_match_authors_if_sharing_coauthors()

Make a set of matched authors that share co-authors

disambr_merge_authors_if_citing_others_papers()

Make a set of matched authors bases on the cases when one author cites the others paper.

disambr_merge_authors_if_citing_self_citation()

Make a set of matched authors based on cases when one author cites others self citation. Self-citations here are detected based on DOI.

disambr_merge_authors_with_common_keywords()

Makes set of authors with number of keywords in their papers in common

disambr_merge_authors_with_common_references()

Make set of authors that have number of references in common

disambr_merge_authors_with_same_affiliation()

Makes set of matched authors based on same affiliation

disambr_merge_authors_with_same_emails()

Makes set of matched authors bases on same email addresses

disambr_merge_authors_with_same_researcher_ids()

For testing. Makes so called ground truth set based on same research IDs

disambr_merge_authors_with_similar_initials()

Makes set of similar authors based on their initials

disambr_merge_authors_with_similar_last_names()

Makes set of similar authors based on their last names

disambr_message()

Report a message with message()

disambr_message_finish()

Post a starting message for disambr procedure. Records time started in disambr_start_time variable in its parent.frame()

disambr_message_start()

Post a starting message for disambr procedure. Records time started in disambr_start_time variable in its parent.frame()

disambr_read()

Reads the data for disambiguation

disambr_read_file()

Reads file based on file extention

disambr_read_output_set()

Reads last output set saved on disk

disambr_read_tsv()

Read tsv file

disambr_read_tsv_wos()

Reads WoS tsv export file and makes disambr set out of it (just adding some attributes to the data.table)

disambr_save_set()

Save set and adds file attribute

disambr_set_attr()

Adds attribures to the set with data.table::setattr

disambr_split_authors_if_on_the_same_paper()

Makes sets of co-authors assuming that all authors on paper are different person.

disambr_stats()

Make a statistics for a collection of sets (specific algorithm) and writes to the file in sets_dir

disambr_stats_table()

Makes comparative table with statistics

disambr_stop()

Report a message with stop()

disambr_subsets()

Filters list of sets

disambr_warning()

Report a message with warning()

disambr_wos_tsv_parse_af()

Parses AF (author full name) column of WoS saved records export

disambr_wos_tsv_parse_au()

Parses AU column of WoS saved records export

disambr_wos_tsv_parse_c1()

Parses C1 (author adress/affiliation) column of WoS saved records export

disambr_wos_tsv_parse_cr()

Parses WoS CR (Cited References) record into separate talbe

disambr_wos_tsv_parse_em()

Parses EM (email) column of WoS saved records export

disambr_wos_tsv_parse_oi()

Parses OI column of WoS saved records export

disambr_wos_tsv_parse_ri()

Parses RI column of WoS saved records export

disambr_wos_tsv_parse_rp()

Parses RP (reprint author) column of WoS saved records export

get_file_extension()

Extention extractor. Same as tools::file_ext but for NULL input returns NULL instead of logical(0).

get_upper_triangle_index()

Returns the index of element in upper triangle of squared n by n matrix

match_fuzzy()

Fuzzy match all combinations of character vector

parse_files_path()

Returns vector of file paths from path(s) recursively

read_to_utf8()

Reads file as UTF-8, convert it if other encoding is deteted

recode_return_characters()

Fixed end of line characters in wierd text

stop_unless()

Stops process unless cond is true