Skip to main content

Table 3 The top 25 most common words in the full unprocessed corpus of 358,949 words with exact counts and proportion of the word amongst all words shown

From: Unsupervised identification of crime problems from police free-text data

Word

Count

Proportion

Door

13,031

0.052

Dwelling

10,819

0.043

House

9217

0.037

Rear

8226

0.033

Window

6473

0.026

Entp

5714

0.023

Front

5473

0.022

Occupied

5336

0.021

Seen

4801

0.019

Detached

4097

0.016

Extp

3896

0.015

Suspect

3534

0.014

Unknown

3523

0.014

Property

3415

0.014

Semi

3070

0.012

Entry

3056

0.012

Lower

2918

0.012

Single

2767

0.011

Lock

2390

0.009

Attack

2328

0.009

Terraced

2302

0.009

Aentp

2261

0.009

Side

2017

0.008

Victim

1965

0.008

Kitchen

1879

0.007