as you can see, the vast majority of the tweets did not become very popular, while a very small number (too few to generate a bar more than a pixel or two high) went extremely viral. distribution imbalances represent a significant challenge in this dataset. in the case of this histogram, remember that the core idea behind the visualization of data is to gain some quality insight about the underlying data. this histogram is too unbalanced to achieve that goal. the leftmost bar, for example, includes tweets that range from 0 to several thousand retweets. shouldn't a visualization better capture the nuances within this wide range as well?