Data Guy on the Author Earnings Methodology
29 responses to “Data Guy on the Author Earnings Methodology”
[…] via Data Guy on the Author Earnings Methodology | Hugh Howey. […]
QUOTE: “Our Author Earnings charts say “Daily $ Revenue to Authors” because that’s what actually matters to 99.9% of traditionally-published authors END QUOTE
With Amazon’s new pricing tool (in beta) Amazon is helping authors determine pricing strategies to make the most daily income. I’ve priced a few of my books at their higher suggested price to see if it pans out. As long as Amazon is implementing tools to help authors reach their readers, self publishers could gain market share.
I would love more numbers. I would love if Amazon didn’t just show reviews, but number of sales. How many times have i been interested in a book that has very few reviews? lots. If I could see it sold a thousand copies and then see no bad reviews, I would be more likely to take a shot at it.
I would also like Amazon to poll it’s own writers, find out how many people are earning over a certain amount a year writing for them. Is it a thousand, ten thousand? Doesn that mean Amazon self employs more than Walmart?
There might be a hundred writers making a good living in the world of the big 5, and a few hundred more making an okay living, I would like to see Amazons numbers to see once and for all that self publishing supports more people.
It may be obvious to most readers of this blog, but it might be helpful to also point out the fact that ebooks continue to earn royalties forever. So unlike in the past when a typical print book would sell for the first 3-6 months or so and then disappear from bookstore shelves, an book with an ebook version can take 5 to 10 or many more years to earn out, but it is fairly certain that it will eventually do so. This is in contrast to the past when it was widely reported that something like 7 out of 10 books did not earn out their advances which would alter the effective percentage those authors received.
As William says, traditional publishers still think in a “produce model” of thinking that the only sales figures that matter are the first year or so when the book is “fresh”. In contrast ebooks are like the items at a hardware store. Even some little bin of obscure screws or bolts that only sell once in a while will eventually pay for itself.
To be fair, there is a certain amount of truth to the view of most fiction being similar to a produce model. By far the most sales happen in the first few months of a book’s release. If you want confirmation, just look at the statistics from Michael J. Sullivan http://amazingstoriesmag.com/2014/03/authors-earnings-part-3-mid-list-print-sales/
Notice how the sales fall off and then stabilize at a fairly low level. Now some books don’t see that tremendous drop because they start at a modest level and therefore don’t have far to fall. But then again, notice that “long tail” of sales that just keeps bumping right along selling week after week and month after month. Look at that as consistent, ongoing income like Dean Wesley Smith described in today’s Think Like a Publisher post about the return on investment of a book (http://www.deanwesleysmith.com/think-like-a-publisher-2015-chapter-5-return-on-investment/). Notice the small figures of sales that he describes that just keep going and represent a continuing “dividend” on that investment.
Man, I don’t understand this obsession you self-pub guys have with facts and whatnot. We all know Amazon is evil and are trying to kill literature and that the tsunami of crap will doom us all lest we stop it by denying all these losers and their lame “books” that people may or may not want to read.
Facts schmacks. Get over yourselves!
Trad publishing #1!
Thanks Jack. I really needed that wake up call. I will immediately stop checking my ranks and sales on Amazon and go back to writing query letters. I was running low on scrap paper and those rejection letters really com in handy for that.
You two are hilarious!
I desperately needed that laugh. You don’t even know. Thank you so much.
This blog post is most welcome, because the advances of the very few authors at the top of traditional publishing are a frequent argument to dismiss authors earnings conclusions. People like Mike Shatzkin who defend trad pub often use it. I guess that’s it’s a mean for them to reintroduce the Ponzi’s scheme of traditional publishing.
There is a nice straightforward report about authorearnings on Publishers Weekly:
One other significant fact about advances,
If an author doesn’t earn out their advance, they are far less likely to get another one (except for the mega-bestsellers who, as you say, are really just getting lump-sum payments rather than being paid royalties)
so for anyone trying to make a career out of writing, failing to earn out the advances isn’t getting paid more, it’s the end of the road.
Thanks for the post. I’ve always felt Jeremy Greefield’s remarks were not unfounded criticism, and it’s good to see Data Guy answer them this way.
The biggest interrogation that will always remain, though, is who Data Guy is, and how exactly you’ve been able to pull those numbers. I still feel like you ask people for a leap of faith and belief in this study from within the Amazon numbers.
And personally, I’m ready to do this leap of faith and do believe that your numbers reflect reality in today’s author earnings. Because I believe in your cause. But I can also get that people who are persuaded of the contrary just won’t buy numbers where they cannot see solid, tangible proofs of validity.
I’m not saying Data Guy should reveal himself, I’m just saying that it’s comprehensible if some people don’t believe in these reports. Again, I’m not one of these people. At all.
Data guy has explained how he gets the data from amazon. He’s explained it in more detail than most readers can even absorb. The trouble with this is that it is not possible to go beyond what he has already explained. The only thing more he could do would be to post large amounts of the computer code used in his crawler. Obviously very few people would understand the code.
I don’t believe it is terribly complex, but it is definitely technical in a way that is beyond the knowledge of the vast majority of people. Those who do understand the code could verify that it does what he says it does. (I’m not suggesting he do this, by the way.) Or he could put in notes every few lines explaining what the following lines do. He could break that down into exactly how they do it, which would then require trying to educate every reader in the fundamentals of code logic and how they build on each other to make those lines of code work. We’re talking about at least a large textbook full of writing that would take many months to complete.
This is not only unrealistic, it’s a complete waste of time. Who would read that? But my point is, apart from doing that, Data Guy has already done the only thing a technical guy can do in this situation by saying that it’s a crawler (or spider) of some kind, the data is publicly accessible, so it just goes onto the Amazon site and pulls the relevant information needed to make the calculations, which have also been described in detail.
As I mentioned, I doubt if the software is very complex, because the logic required to do this is not very complex. Cross-referencing and compiling the collected data is somewhat more complex, but it isn’t genius-level stuff. I’m sure Data Guy would agree with me that thousands of developers out there could produce the same results if it was something they were interested in. Data Guy happens to be interested in it, so he wrote the code out of curiosity to see what he could find out. The results were so fascinating that he continued to refine and develop the code until it accounted for more and more possibilities. The current results are almost completely unassailable.
The dirty little secret here is not that Data Guy might be an evil genius trying to trick the world, it’s that traditional publishing or any other interested party could easily contract a few developers to write something similar and view the results. The reason they don’t is because they KNOW the data is accurate. (Heck, they might have already done this privately, then buried the results.)
Now don’t get me wrong, in the future they might hire some shady group of devs to purposely produce FAKE results, then put their propaganda machine behind promoting the fake results. They are absolutely not above that. In fact, I think I might go on record as predicting it right now. They really are irredeemable as a group. It’s not surprising, but it is disappointing that they are actually that bad.
Data guy has explained how he gets the data from amazon… I doubt if the software is very complex, because the logic required to do this is not very complex… I’m sure Data Guy would agree with me…
Agree 100%. People seem to get unnecessarily hung up on the technical method of data collection. But there’s nothing here that can’t be done with a notepad and a pencil. This ain’t exactly rocket science :)
Edward W. Robertson has independently done some similar analysis in various genres without using any software code at all. His manual tallies of bestseller count by publisher type match the Author Earnings results quite well.
You don’t need to trust some anonymous “Data Guy” or his software spider to see how ebook market share divides up.
The answers are staring you in the face whenever you check the bestseller lists.
Yeah, just had a look at the spreadsheets. It’s not that technical indeed. A statistician would not be a fan of your daily book sales interpolation method based on friend’s testimonials, but well, that’s another story.
In any case, thanks for pointing this out to me and making me look closely at the numbers :)
Our sales-to-rank calculations are based on a little more than “friends testimonials,” though. Many dozens of authors contributed data and we found it all remarkably consistent.
I think one of the unfortunate ironies of the Author Earnings effort is that anyone who already has a self-published book selling decently on Amazon can take one look at our sales-to-rank graph and immediately say, “Yep. Those are precisely the numbers I see with my own books.” But the folks who could benefit most from this information — the ones who don’t have any self-published books of their own to compare with — are the most likely to doubt it :)
Dear Hugh and Data Guy,
Hi – just so you know where I am coming from. I’m a hybrid author who isn’t doing as well at Amazon with my backlist as I wish – and am all for hybrid/indie authors. So understand this note is not a challenge or argument but I’m bad at math and mathematical concepts so I might be totally off on this, but I’m confused about the report.
Branded top authors such as Lee Child, James Patterson, Nora Roberts, Steve Berry, James Rollins and on and on report that Amazon accounts for 15% or less of their sales since they are in every store from airports to supermarkets.
So how do you factor that into your reports? Or don’t you? And does it matter – or doesn’t it?
Since indie books are not for sale in any of those airports to supermarket places and Amazon accounts for 60%-100% of their sales and since Amazon supports and gives indies pushes -doesn’t it makes sense that indie authors are getting 1/3 or more of the pie chart?
How can we look at the charts and make sense of them in terms of the millions of books selling outside of Amazon.
We’ve been working to get a handle on the print side, too — by looking at BookScan as a source. (Nielsen BookScan is only 65%-70% accurate, but it’s better than nothing).
Sales for a small tier of mega-bestsellers like Patterson, King, Evanovich, Roberts, etc. skew toward brick & mortar print and away from ebooks and online because of the broad brick-and-mortar visibility you mention in airports, supermarkets, etc., and especially because of paid co-op placement in bookstores, which they benefit from disproportionately (Because publishers concentrate marketing spend disproportionately in their biggest-name tentpole authors).
The vast majority of traditionally published and hybrid fiction authors see a very different mix — one that skews much more heavily toward ebooks and online print sales.
Traditionally-published mid-list authors we’ve spoken with report that, according to their royalty statements, 60%-65% of their sales are coming from ebooks now.
Right now, brick-and-mortar print sales make up no more than 33% – 36% of all trade publishing unit sales, averaged across all genres. Online print and audiobook sales (at Amazon and elsewhere) make up another 14% – 17% of trade publishing’s unit sales, while the remaining 45% – 52% are ebooks.
For genres like non-fiction, children’s, and literary the brick-and-mortar print % will be higher, but for romance, sci-fi/fantasy, etc. less than a third(!) of sales are currently coming from brick-and-mortar print.
So I guess my takeaway would be to evaluate each publishing opportunity based on who our peers are in usual sales numbers, as well as genre. If I’m Grisham or Evanovich, and my publisher will guarantee front-rack airport and supermarket placement, buying giant wall-of-my-books displays at the front of bookstores, and a huge dedicated marketing spend, then brick & mortar print will be major factor in my earnings. Otherwise, there probably ain’t much there there, compared to the loss of royalties on the e-book side.
As always, YMMV :)
From the above-referenced blog post:
Now, the biggest bestsellers in the industry — say, James Patterson, or Doug Preston, or Richard Russo, or Scott Turow — sell the majority of their books in paper. After all, they’ve won the distribution lottery and their books are available in every airport kiosk, Wal-Mart, drugstore, and supermarket across the land. So their interest in retarding the growth of digital — where the same distribution is available to everyone — and in preserving the position of paper is identical to that of their publishers. It stands to reason they would fight to maintain the system that has made them so rich. But if you’re a legacy-published author whose sales are increasingly digital, you need to understand that the legacy strategy of pricing ebooks high is costing you money. Is that really something you want to help perpetuate? Yes, it works for James Patterson, but what is it costing you?