Orthographic Measures ===================== Orthographic measures are based on the written form of words. Jiwar calculates the following orthographic measures: Orthographic N (orth_N) ----------------------- **Full Name:** Orthographic Neighborhood Size **Description:** Number and forms of words that differ by one letter via substitution only. **Output Columns:** * ``orth_N``: Number of orthographic neighbors which differ from the target word by one letter via substitution only. * ``orth_N_nbr``: A list of the forms of orthographic neighbors identified in 'orth_N' Orthographic Density (orth_density) ----------------------------------- **Full Name:** Orthographic Neighborhood Density **Description:** Number and forms of words which differ from the target word by one letter via substitution, addition, or deletion. **Output Columns:** * ``orth_density``: Number of orthographic neighbors which differ from the target word by one letter via substitution, addition, or deletion * ``orth_density_nbrs``: A list of the forms of orthographic neighbors identified in 'orth_density' OLD20 ----- **Full Name:** Orthographic Levenshtein Distance-20 **Description:** Average orthographic Levenshtein distance of the 20 closest neighbors. **Output Column:** ``OLD20`` Orthographic Network ----------------------- **Full Name:** Orthographic netowrk science measures **Description:** Measures the interconnectedness of a word's orthographic neighborhood at the near and distant neighbor levels. **Output Columns:** * ``orth_C``: The clustering coefficient C measures the degree to which a word's immediate orthographic neighbors are also orthographic neighbors of each other. * ``orth_2hop_density``: The 2-hop density measures the degree to which a word's immediate and distant orthographic neighbors are also orthographic neighbors of each other. Orthographic Neighbor Frequency ------------------------------- **Full Name:** Orthographic Neighborhood Frequency **Description:** Statistics about the frequencies of orthographic neighboring words. In this measure, neighbors are defined as words differing by one letter via substitution, addition, or deletion. **Output Columns:** * ``orth_nbr_fpm_m``: The mean frequency per million (fpm) of orthographic neighbors. * ``orth_nbr_fpm_SD``: The standard deviation of the frequency per million (fpm) of orthographic neighbors. * ``orth_nbr_fpm_higher_m``: The mean frequency per million (fpm) of orthographic neighbors that have a higher frequency than the target word. * ``orth_nbr_fpm_lower_m``: The mean frequency per million (fpm) of orthographic neighbors that have a lower frequency than the target word. * ``orth_nbr_zipf_m``: The mean Zipf value of orthographic neighbors. * ``orth_nbr_zipf_SD``: The standard deviation of the Zipf values of orthographic neighbors. * ``orth_nbr_zipf_higher_m``: The mean Zipf value of orthographic neighbors that have a higher frequency than the target word. * ``orth_nbr_zipf_lower_m``: The mean Zipf value of orthographic neighbors that have a lower frequency than the target word.