While the primary purpose of this list is to help software recognise a name when it sees one, it's understandable that some people will also want to use it to 'look up' how a specific name "should" be translated. However:
I've been wanting to put as much as possible into the public domain, so that commercial software like Wenlin, Pleco, Hanping and ChinaScribe as well as community projects like CC-CEDICT and online services can all help learners to read by incorporating these words by default instead of an "after-market" addition. But I was held back by possible 'intellectual property' considerations: if I (as a learner) saw a word in a text, and wanted my software to recognise it next time, I'd add it to my personal dictionary (with extra notes on where and when I saw it, and maybe other thoughts too), but that by itself doesn't mean I can share it: how do I know my source doesn't have some kind of "trademark rights" to their particular way of writing it?
I now understand that most countries' copyright laws do include a provision for third-party indexing, so you can say "I saw this word on page 234 of that book" and not be held liable for copyright infringement of favourite books that feature too often in your list: at worst, your list is an index of your books, which is (in countries that have those provisions) allowed. But you still run the risk of accidentally defaming a book by writing wrong notes---there are "free speech" laws protecting reviews (up to a point), but I quite like the books I read and didn't want to cast them in a bad light by publishing all my misunderstandings.
Eventually, I found a large Chinese Internet search engine that hadn't yet prohibited being used by automatic programs, and was able to write a program that queried it for every word in my database, to get some measure of which words were common enough to warrant disregarding my reading notes and just saying "here's a translation that's 'out there' and worth recognising". I had to be careful to ensure the search results really showed the word in common use (not just illegal copies of the source I read), and I also had to beware of having documented a rare different use-case of an otherwise common word.
I managed to subset and edit my database, and can now present 53% of the 'specialist' words I collected between 2009 and 2018 as confirmed "public domain" words you can do what you want with (i.e. please do add them to products to help learners---and email me if you'd like me to mention here that you've added it to your product). The other 47% (and my reading notes) have not been added to CedPane, but I hope it's already useful.
CedPane is a table that you should be able to copy into the spreadsheet software of your choice (use Select All, Copy, Paste). The columns are:
Of course it goes without saying that, despite my best efforts, mistakes are possible anywhere (as is true of every dictionary) and I'm happy to receive corrections.
There is an SVN repository thanks to Cameron Wong: svn co http://svn.code.sf.net/p/e-guidedog/code/ssb22/CedPane
and there is a Git repository on GitHub: git clone https://github.com/ssb22
(I also have a separate collection of Chinese words that are in typical dictionaries, with short English definitions that have either been confirmed by multiple independent sources to the extent that it is reasonable to believe they are public domain, or that I've written myself. This separate collection is not likely to help with software that already has a good normal dictionary, but it might be useful for developers to prototype interlinear annotators etc. It is in the Git repository as
PD-English-Definitions.txt but has not been included in the main CedPane files.)
If you're using Pleco, you can add the first four columns via these Pleco user dictionaries: CPN-CE.pqb and CPN-EC.pqb. You might wish to update these periodically (sorry there's not yet a notification system for third-party Pleco dictionary updates).