This project is analysing what profiles profitable applications have in common in App Store and Google Play Markets. Two major application stores. As of 2018, there are 4 million apps in App Store and Google Play market. It requires significant time and money to analyze them. For that we are going to sample them and analyze. I'm going to use
The goal of the project is to identify what are the traits of profitable applications and define the strategy that we can create one.
We are only analyzing the apps for English speaking audience and free apps only. We are going to remove:
As the following example, Instagram, we can see that there are duplicates in the data. There is only one instagram app but there are four record of it. One way to define the most recent data is the number of reviews. The assumption is that the more reviews it has, the more recent data it is.
The steps to identifying duplicates
The steps to identifying the most recent data
Steps to store data without duplicate
The goal is to determine what are the profiles that are popular in both platforms. Because the more users you attract, the more likely your app will be profitable. To minimize the risks and overhead, the validation strategy for an apps is:
It looks like they both provide Genre or Category columns
From the result above we can say
Compare to iOS data, it's much more messy to navigate. This seems more detailed data as well. For instance, most litkely Roll Playing, Strategy, Adventure, and ect would be categorized in iOS as Games.
Now this is more tidy data. What we see is that:
One way to see how many users there are is see how many installations there have been. This data is missing in iOS data. The next best thing we can use is total number of rating.
It's interesting. There are lots of games out there. However, what people use most is Social Networking in App Store. M
Now we are going to look at android data. It does have installation number. However it's not precise. The values are in the format of 100+, 1000+, 5000+, etc. We don't actually know exact number. 5000+ could mean 6000, 7000, or 9999. But for our purpose here it could be enough.
The result is the same as iOS. The most popular catego