Accidents dataset


 

Dataset summary

Records538,989
Columns4
Memory usage2.06 MB
Attribute Type / # categories Info Memory usage
Driver_Age_Band 8 16 - 20, 21 - 25, 26 - 35, 36 - 45, 46 - 55, 56 - 65, 66 - 75, Over 75 527.22 KB
Driver_IMD 10 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 526.56 KB
Sex 2 Female, Male 526.70 KB
Journey 5 2,Commuting to/from work, 3,Taking pupil to/from school, 4,Pupil riding to/from school, 5 Other/Not known, Part of work 527.00 KB

Variable profiles

Driver_Age_Band

Categorical Ordered
Categories 8
Most frequent 26 - 35 (102,643 values, 19.04%)
Least frequent Over 75 (13,041 values, 2.42%)
Missings 61,389 (11.39%)
Memory 527.22 KB
Driver_Age_Band Count Freq. Driver_Age_Band Frequency
16 - 20 70032 12.99% 16 - 20
12.99% 
21 - 25 67785 12.58% 21 - 25
12.58% 
26 - 35 102643 19.04% 26 - 35
19.04% 
36 - 45 92768 17.21% 36 - 45
17.21% 
46 - 55 69615 12.92% 46 - 55
12.92% 
56 - 65 42675 7.92% 56 - 65
7.92% 
66 - 75 19041 3.53% 66 - 75
 
3.53%
Over 75 13041 2.42% Over 75
 
2.42%
Driver_Age_Band Count Frequency
16 - 20 70032 12.99%
21 - 25 67785 12.58%
26 - 35 102643 19.04%
36 - 45 92768 17.21%
46 - 55 69615 12.92%
56 - 65 42675 7.92%
66 - 75 19041 3.53%
Over 75 13041 2.42%
Driver_Age_Band
16 - 20
12.99% 
21 - 25
12.58% 
26 - 35
19.04% 
36 - 45
17.21% 
46 - 55
12.92% 
56 - 65
7.92% 
66 - 75
 
3.53%
Over 75
 
2.42%

Driver_IMD

Categorical Ordered
Categories 10
Most frequent 2 (37,705 values, 7.00%)
Least frequent 10 (26,931 values, 5.00%)
Missings 199,869 (37.08%)
Memory 526.56 KB
Driver_IMD Count Freq. Driver_IMD Frequency
1 36829 6.83% 1
6.83% 
2 37705 7.00% 2
7.00% 
3 37102 6.88% 3
6.88% 
4 36079 6.69% 4
6.69% 
5 35090 6.51% 5
6.51% 
6 34549 6.41% 6
6.41% 
7 33076 6.14% 7
6.14% 
8 31671 5.88% 8
5.88% 
9 30088 5.58% 9
5.58% 
10 26931 5.00% 10
 
5.00%
Driver_IMD Count Frequency
1 36829 6.83%
2 37705 7.00%
3 37102 6.88%
4 36079 6.69%
5 35090 6.51%
6 34549 6.41%
7 33076 6.14%
8 31671 5.88%
9 30088 5.58%
10 26931 5.00%
Driver_IMD
1
6.83% 
2
7.00% 
3
6.88% 
4
6.69% 
5
6.51% 
6
6.41% 
7
6.14% 
8
5.88% 
9
5.58% 
10
 
5.00%

Sex

Categorical Ordered
Categories 2
Most frequent Male (365,266 values, 67.77%)
Least frequent Female (137,270 values, 25.47%)
Missings 36,453 (6.76%)
Memory 526.70 KB
Sex Count Freq. Sex Frequency
Female 137270 25.47% Female
25.47% 
Male 365266 67.77% Male
67.77% 
Sex Count Frequency
Female 137270 25.47%
Male 365266 67.77%
Sex
Female
25.47% 
Male
67.77% 

Journey

Categorical Ordered
Categories 5
Most frequent 5 Other/Not known (378,749 values, 70.27%)
Least frequent 4,Pupil riding to/from school (1,337 values, 0.25%)
Missings 0 (0.00%)
Memory 527.00 KB
Journey Count Freq. Journey Frequency
2,Commuting to/from work 47450 8.80% 2,Commuting to/from work
8.80% 
3,Taking pupil to/from school 7183 1.33% 3,Taking pupil to/from school
 
1.33%
4,Pupil riding to/from school 1337 0.25% 4,Pupil riding to/from school
 
0.25%
5 Other/Not known 378749 70.27% 5 Other/Not known
70.27% 
Part of work 104270 19.35% Part of work
19.35% 
Journey Count Frequency
2,Commuting to/from work 47450 8.80%
3,Taking pupil to/from school 7183 1.33%
4,Pupil riding to/from school 1337 0.25%
5 Other/Not known 378749 70.27%
Part of work 104270 19.35%
Journey
2,Commuting to/from work
8.80% 
3,Taking pupil to/from school
 
1.33%
4,Pupil riding to/from school
 
0.25%
5 Other/Not known
70.27% 
Part of work
19.35% 

Correlations

Cramér's V — categorical variables

Crosstab heatmaps — categorical pairs

Created by pandas-cat version 0.1.5