Blogger Export Analyzer (BEA) is a simple analyzer for Blogger Export XML file, that I created to get some statistics using data from Blogger Export file. I was hoping the Blogger Stats data is part of Export file, but they weren’t. The code is written in Python 3 and licensed under the MIT License.

The following is a sample output:


= Blogger Export Analyzer 0.0.2 ==============================================

YJL --verbose by Yu-Jie Lin
Outputs directly from me <strike>about almost everything</strike>...

- General --------------------------------------------------------------------

930 Posts 258.727 per year 21.561 per month
391 Comments 108.777 per year 9.065 per months 0.420 per post
2 Pages
0 Drafts
2,041 Labels

First post <- 3.6 years -> Last post
Let's Make Some Garbages <- 43 months -> Multitasking with storytell...
2008-09-13 16:13:00-07:00 <- 1312 days -> 2012-04-18 10:56:00-07:00

- Posts ----------------------------------------------------------------------

246,339 Words 264.881 per post
1,315,911 Chars 1,414.958 per post
4,970 Labels 5.344 per post

- Comments -------------------------------------------------------------------

34 out of 391 Comments are not counted in this section.

. Top Commenters .............................................................

125 ( 35.0%): livibetter
9 ( 2.5%): Calidan
7 ( 2.0%): Vajrasar
6 ( 1.7%): Derick Dalton Lee
5 ( 1.4%): Mario César
4 ( 1.1%): Jain
4 ( 1.1%): Guilherme Lino
3 ( 0.8%): zizukabi
3 ( 0.8%): MHazell
3 ( 0.8%): Lenama7

. Most Commented Posts .......................................................

34 ( 9.5%): Get ready for this Falling Snow Season!
24 ( 6.7%): Stick div at top after scrolling
17 ( 4.8%): Using Django's I18N in Google App Engine
16 ( 4.5%): Adobe AIR 1.5 on Fedora 10 x86_64
14 ( 3.9%): Migrating to tmux from GNU/Screen
10 ( 2.8%): Follow mouse for x11grab of FFmpeg
9 ( 2.5%): Sigh, glad I still have Disqus on my side
9 ( 2.5%): jQuery plugin jk navigation
9 ( 2.5%): Bad value X-UA-Compatible for attribute http-equiv on eleme...
8 ( 2.2%): Installing Woopra 1.2 beta on Ubuntu amd64

. Most Commented Posts Over Days Since Published aka. Popular Posts ..........

1.000: Sigh, glad I still have Disqus on my side
0.400: One Day Without Shoes 2012
0.375: Silly retard filename for removal
0.308: The Reading list in Blogger Dashboard
0.111: Better Bitbucket Explore
0.062: Earth Hour, a one-hour globally fanatic phenomenon?
0.038: Follow mouse for x11grab of FFmpeg
0.033: Disquise
0.030: Stick div at top after scrolling
0.027: Get ready for this Falling Snow Season!

- Posts and Comments by Month ------------------------------------------------

YYYY-MM Posts | Comments
2008-09 18 #####| 1
2008-10 25 #######|## 6
2008-11 57 ##################|##### 11
2008-12 51 ################|############################## 63
2009-01 32 ##########|########## 23
2009-02 13 ####|### 8
2009-03 27 ########|######## 18
2009-04 41 #############|#### 10
2009-05 14 ####|# 3
2009-06 1 |# 4
2009-07 1 | 1
2009-08 1 | 2
2009-09 1 | 0
2009-10 27 ########|## 6
2009-11 18 #####|#### 9
2009-12 16 #####|#### 9
2010-01 16 #####|# 3
2010-02 3 |# 4
2010-03 2 | 0
2010-04 9 ##| 0
2010-05 19 ######| 0
2010-06 2 | 0
2010-07 1 | 0
2010-08 53 ################| 0
2010-09 62 ###################|########################### 57
2010-10 8 ##|# 3
2010-11 51 ################|##### 11
2010-12 10 ###|### 8
2011-01 26 ########| 1
2011-02 56 #################|####### 16
2011-03 5 #|#### 9
2011-04 0 | 2
2011-05 0 |# 3
2011-06 0 |# 4
2011-07 13 ####| 2
2011-08 18 #####|###### 14
2011-09 27 ########|##### 12
2011-10 0 |# 4
2011-11 0 |# 4
2011-12 0 | 1
2012-01 16 #####|# 3
2012-02 42 #############|###### 14
2012-03 94 ##############################|######### 20
2012-04 54 #################|########## 22

- General --------------------------------------------------------------------

2,041 Labels labled 4,970 times 2.435 Labeled per label

. Most Labeled Labels ........................................................

117 ( 2.4%): OldBlogBlogarbage
95 ( 1.9%): OldBlogTuxWearsFedora
93 ( 1.9%): Python
74 ( 1.5%): OldBlogGetCtrlBack
72 ( 1.4%): Bash
63 ( 1.3%): JavaScript
61 ( 1.2%): OldBlogmakeYJL
58 ( 1.2%): thought
51 ( 1.0%): Google
50 ( 1.0%): Blogger

. Least Labeled Rate .........................................................

1396 ( 68.4%) Labels labeled 1 times
293 ( 14.4%) Labels labeled 2 times
131 ( 6.4%) Labels labeled 3 times
62 ( 3.0%) Labels labeled 4 times
38 ( 1.9%) Labels labeled 5 times
24 ( 1.2%) Labels labeled 6 times
19 ( 0.9%) Labels labeled 7 times
16 ( 0.8%) Labels labeled 8 times
12 ( 0.6%) Labels labeled 9 times
4 ( 0.2%) Labels labeled 10 times

The output is plain text like my Google Analytics report and I don’t intend to add options for customization unless someone requests with good reason. The statistics you see in the image on the right is pretty much I have planned to have. I will only add new stuff when I get a new idea.

One number I wanted to see is the labels count, which have confirmed that my labeling seems to have gone out of control. 1,396 labels only are used one time. I always knew I was unable to tame my labeling misbehavior. ;p