Code Recollection

Firefly III

2023-12-03T00:00:00+00:00

For over a decade, I’ve been tracking our monthly spending habits with Mint. This was a free service from Intuit aggregating data across their user’s purchases. Despite the resistance from my financial institutions to provide reporting access for my account data, Mint worked well enough that I could track spending and expenses. However, Intuit is terminating the Mint service, offering something called “Credit Karma” instead, only for those in the US.

So, I went looking for alternatives to Mint. After trying a number of paid services, I found them all to be wanting. I’m not interested in paying a monthly fee for “account disconnected from sync” errors every other day. If these services can’t sync reliably, I figured I’d try a self-hosted system, relying on an old-fashioned monthly import of a CSV-format export from my finacial instuitions.

Firefly III

Based on a Reddit discussion, I came across an open-source solution named Firefly III. There is Docker template for this application on the Unraid Community Apps, so I figured I’d give it a shot. My goal was to import and categorize transactions from my chequing, savings, and credit cards into Firefly accounts.

PostgreSQL

First, off Firefly requires a separate database. Not sure why an embedded database like SQLite couldn’t have been used by the author to simplfiy setup for new users, but this may be a planned technical hurdle to weed out users who would be better paying for a managed web service. Again the Community Apps section of my Unraid server made setting up a fresh Postgres database simple. The Firefly documentation doesn’t cover this database setup, but I’ve been using psql for many, many years now.

Data Importer

For reasons of “security”, Firefly has a completely separate Docker container that runs alongside the main Firefly container. To perform an import of CSV account data from my bank, I have carefully create OAuth credentials, carefully de-selecting a default option that will break the import process, which I learned digging through this app’s discussion forums.

Then I have to browse to another Console page entirely from the main Firefly WebUI. Then I enter the URI of the main WebUI and go through the import process.

This screen was a bit finicky, so be careful about the choices you make here, depending on the exact format of your CSV. Picklist menus are not sorted alphabetically despite containing tens of menu items.

There seems to be no way to save all these import choices so I’ll need to remember these steps each month I guess. Hopefully, I don’t make any mistakes.

The initial import worked alright, and even subsequent imports a few weeks later were fine. Overlapping transactions in the CSV files didn’t result in duplicate entries in Firefly. So the base functiionality is there, but the main annoyance is the need to repeat a whole set of configurations on every single import. It would be very user-friendly, if an account’s import settings were remembered. For example, why do I need to enter the Firefly URI, an unchanging clientid, an unchanging date format, etc every single time. I can see this being a pain I will want to do at most once a month.

WebUI

After creating the accounts with a starting balance, importing transactions, then deleting and retrying a few times… I was able to get our chequing, saving, and credit cards (Visa & Mastercard) into Firefly.

Conclusion

While I have a monthly manual import working now, I have zero categorization of the incoming transactions yet. So, I’m not able to answer the only questions I am interested in:

How much did I spend last month total?
How much did I spend, by category, last month?

Still lots of work to go, but as manual replacement for Mint, I think Firefly-III is promising.

Coming Wave

2023-11-15T00:00:00+00:00

So this hold finally came in to my local library and I had a chance to read it. I don’t really buy books anymore when I can get them for free months later. I wouldn’t call this a blockbuster anyway. Overall, the author argues that containment of Artifical General Intelligence (AGI) is both nearly impossible and absolutely essential. By the end of the book, I’d been convinced of neither.

While the ability to control and even shutdown an intelligence greater than our own seems important, I am pessimistic that humanity will get its act together as there is too much financial incentive for nations and corporations to build AGI first. Unlike nuclear weapons, the hardware for AGI will get cheaper each year. Just look at what hobbyists are doing with ChatGPT alternatives.

Overall, I’m in the camp that nearterm problems from disinformation controlling the populace is a bigger problem today than AGI in the future. Finally, I’m more worried about climate chaos leading to serious war and refugee problems over some billionaires with a lot of GPUs.

Chia Layoffs

2023-10-02T00:00:00+00:00

In the bad news column for the Chia cryptocurrency, the company (CNI) controlling the reference platform has annoucned quite large layoffs and a delay in their efforts to take company public.

While I can’t predict the future, its never a good sign when a software company lays off that many of their developers. I can’t see how this doesn’t impact the maintenance, let alone the future development, of their software.

Anecdotally, as developer of a tool that builds upon their platform, I’m already encountering serious defects that remain open for months and months (indefinitely?). Not a good sign at all.

Lidarr Importing

2023-09-23T00:00:00+00:00

So, I finally got sick of Youtube Music constantly playing me the same 10 songs when say “OK Google, play some music” to my house speakers. I still remember the good old days of the Google Play Music service when that prompt would result in a nice mix of old favorites and also music I’d not encountered before. Tragically, Google knifed GPM a few years ago, despite Youtube Music being a far inferior service.

Instead, I’ve dusted off my decades old collection of MP3s, ripped from my long-gone CDs. After pointing Lidarr at my collection, I was able to start adding additional artists. I was intrigued by the list of “1001 Albums to Listen to Before You Die” by Robert Dimery.

After initially trying to use a Trackt list to import into Lidarr, I had just a mess of incorrect matches. So, restarting after a cleanup, I wrote my own Python script to add the correct albums into Lidarr. This took a bit of time as fuzzy matching was required for differing namings between the book editor and the MusicBrainz database that Lidarr uses.

Eventually, I was able to get a lot of neat new music (to me anyway) loaded in. Now I listen around the house with Plex and find myself discovering interesting new artists from past decades.

Gigahorse Fees

2023-06-22T00:00:00+00:00

So back at the start of the year, I added support for the Gigahorse closed-source GPU farmer of the Chia blockchain. While GPU-supported plotting and farming was an interesting enhancement to the Chia ecosystem, I was less than enthused by the Gigahorse offering. Therefore, I generated a grand total of 2 Gigahorse-proprietary plots, simply for testing the new support in Machinaris.

My biggest concerns with Gigahorse included:

The closed source binaries approach. While understandable due to the developer fee, I always prefer open source software where I can get it.
The single Gigahorse developer. Despite Madmax coding circles around the entire “Chia Network Inc” (CNI) development group, it was clear that he is a single point of failure for Gigahorse.

Dev Fees

After apparently negotiating with CNI for months, eventually the Gigahorse developer chose to implement a set of dev fees to get paid for his work. Fair enough, as his work was at least 6 months in ahead of what CNI was able to do themselves. As shown in the screenshot of his Github README, the dev fee should be capped at just over 3%.

However, a keyword above is “on average” as the fee is collected by grabbing entire block rewards occassionally, averging out to the percentages above. One user on Reddit reported hitting a stretch where they encountered a much higher fee rate, closer to 17%.

Basically, the fee is being collected probabilistically, which no doubt does average out to the ~3% quoted by the developer, across all users over time. However, one can understand the frustration of a particular user experiencing higher than average fee collections.

Conclusion

So, clearly one needs to be mindful of the average keyword in the statement about fees. As someone who dabbles in crypto tech, not a business-oriented farmer, I was not interested in buying GPUs solely for this project. I am quite glad to have kept my setup simpler, despite forgoing the possibility of earning a few extra dollars from my hobby farm.

Campsite Booking

2023-05-10T00:00:00+00:00

Obstensibly, camping is an outdoor pursuit, not deserving of discussion on a technology blog such as this. While possibly true in decades past, today booking a campsite is now an escalating arms race between new online booking systems run by National/Provincial/State parks and automated alert systems such as Campnab and Schnerp.

These days, most campsites are reservable months in advance and are often booked solid minutes afterward. Canada parks such Banff and Jasper often have a single time slot to get a reservation, requiring an early morning in the middle of the winter. Provincial campsites are often on rolling windows 60, 90, or 120 days in advance. Naturally weekends, particularly long weekends, book up fastest.

Beyond campsites, the national parks are becoming so overwhelmed they are rationing access to shuttles and even roads. For example, it has long been the case that spots on the bus up to beautiful Lake O’Hara in Yoho National Park have been hard to come by. In the past few years, Glacier National Park even started rationing slots to drive the famous Going to the Sun Road.

Booking Effectively

So given the limited spots and the challenge of booking for a family many months in advance, I’ve discovered that there is a sweet spot about 3-6 weeks in advance of your desired vacation dates. This window is caused by all those folks who woke up early in January realizing their plans don’t line up and having to cancel their prized bookings. These cancellations then appear on the Park’s booking site. You want to snap these up as soon as they become available.

Canadian Bookings

For the last few years, I have used Schnerp to snag spots in both National parks like Waterton, Banff, and Jasper. As well, I’ve used this site for provincial campgrounds in British Columbia and Alberta. For example, this allowed us to camp twice on the shoreline of Two Jack Lake in Banff

American Bookings

For an upcoming trip I planned to Glacier National Park, I relied heavily on Campnab to book cheap stays at campgrounds inside the park. For about $30/night, I was able to avoid $90/night private campgrounds outside the park, and even better avoid $300/night hotels outside the park. Booking a campground inside the park also covers the vehicle registrations for access to relevant roads.

Using Campnab about 2 months in advance we have a week’s worth of camping booked for this summer, despite most people saying one needs to book a full year in advance. As well, we are using an itinerary from a retired Glacier Park Ranger.

Conclusion

Given the overall growing demand for all things these days (inflation anyone), it’s become clear that one needs to employ technilogical assistance in all vactation planning and especially bookings.

ChatGPT Detection

2023-04-27T00:00:00+00:00

Now that ChatGPT is widely available, there have been concerns raised from educators that students are using this generative AI to do their assignments for them. This has been deemed “cheating” and instructors have been searching for tools to identify AI-generated content.

Is it actually cheating?

So first off, let’s consider if using ChatGPT is truly cheating. To do this, I think we need to distinguish the different ways students can use this generative AI to create content for their assignments. Similar to how students have used the Google search engine for years, there is a slippery slope between research on one end and outright plagarism on the other end.

Plagarism: “Presenting work or ideas from another source as your own, with or without consent of the original author, by incorporating it into your work without full acknowledgement.” - definition from University of Oxford

Oxford recently updated the definiton to include: “All published and unpublished material, whether in manuscript, printed or electronic form, is covered under this definition, as is the use of material generated wholly or in part through use of artificial intelligence (save when use of AI for assessment has received prior authorisation e.g. as a reasonable adjustment for a student’s disability).”

Do take the time to read Oxford’s “Forms of Plagarism” section. It’s clear that “Verbatim” copying of text without attribution is cheating. However, drawing the line on “Paraphrasing” becomes more difficult. What is ChatGPT doing execept summarizing a massive corpus of online human knowledge when it generates content? How do we cite ChatGPT’s primary sources?

In the end, I think using ChatGPT to research a topic, as if asking a knowledgable colleague for directions to investigate further, is not plagarism. On the other hand, prompting for an essay to simply copy verbatim is wrong.

Alright, how to identify cheaters?

One of the best known “identifiers” of AI-generated content is TurnitIn. Educators can paste text into the TurnitIn system and a percentage confidence will be output, indicating likelihood that AI generated it. I wasn’t able to try out this service, as one needs to subscribe. However, they have some images on their site if you’re interested.

Another site, ZeroGPT did successfully flag my exact copy/pasted essay:

A key question is how accurate such an identification system in the face of paraphrasing… in particular how often are false positives flagged? Leave alone the question of what sanctions an educational institution will apply in a suspected case. This is clearly a difficult challenge to apply a fair system across all students.

How do cheaters avoid detection?

As this discussion on Reddit showed, it is possible to tweak AI-generated content to minimize detection. In particular, asking for shorter sentences, using the playground mode, and heavily editing and changing the text can decrease the detection percentage.

However, a cheater, by definition is lazy. They don’t want to put in the effort to do the work properly themselves. So one needs to ask whether all the effort a cheater might apply to avoid detection, would not simply be better spent in organizing their own thoughts, properly researching the topic, and attributing all sources.

Conclusion

Given that plagarism exists on a spectrum, it’s clear that identifying cheaters is becoming harder and harder with generative AI tools these days. As editing copied content was used in past years to avoid detection, now one can simply tweak their ChatGPT prompts to minimize detection. Much like in-person math tests without calculators, limiting an exam on writing to an in-person situation can help, but greatly limits the type of writing being validated.

At a higher level though, why do we write? To organize our thoughts and effectively communicate them with others. When looked at from that viewpoint, plagarism is really a means of cheating yourself out of the skills to succeed in the future. One day, you’ll be in a work place situation where an organized thought process will be needed and faking it isn’t a possibility. That’s when you’ll wish you hadn’t cheated yourself out of much needed practice.

Update

Here’s an excellent comment from Reddit for students who wish to avoid being incorrectly fingered for plagarism:

Work on Google Docs so that you have version histories saved and can prove your process, including all edits.
If accused, request to be verbally pop-quizzed on the material.
Show your prof articles from credible sources about the flaws in these AI checkers.
Show your prof FAQ pages of the AI checkers themselves, which state not to use these tools for consequential purposes.

OpenAI Partnering

2023-03-30T00:00:00+00:00

I saw that OpenAI’s ChatGPT will become available on Microsoft’s Azure cloud platform, probably with additional integration to their Bing search engine soon.

Google seems to have missed out on the deal with OpenAI as they are pushing their own generative AI, named Bard.

Anybody remember how Google pushed their own “Google Videos” service way back when, before finally declaring defeat and buying out Youtube? I wonder how this space will mature in the future…

Gigahorse

2023-02-20T00:00:00+00:00

On my Machinaris cryptocurrency platform, I have added support for Gigahorse, which is a new GPU-based plotter and farmer for the Chia blockchain. From the developer that released the Madmax plotter and his own MMX blockchian, this new software is a closed-source alternative to the official Chia software that offers more plots per disk, increasing a farmer’s chances of finding a block on the network.

By adding GPU hardware into the mix, an arms race has been created in the Chia ecosystem that will push small farmers out, much like happened for blockchains like Bitcoin and Ethereum before it.

Chia goes PoW!

Not really proof-of-work of course. However, Chia was originally envisaged as a storage-based blockchain that used free space on hard drives, requiring no additional hardware for farmers to get started. In particular, outside of tuning for your hardware’s electrical efficiency and buying cheap used disks, there was basically an even playing field for small and large farmers.

However, with the introduction of Gigahorse, a hint of the old Bitcoin/Ethereum proof-of-work model requiring GPU acceleration has arrived in the Chia ecosystem. It is undeniable that those farmers with a spare Nvidia GPU (or two) will benefit from Gigahorse’s improved efficiency over the standard Chia model. By running a supported GPU, Gigahorse farmers can expose more Chia plots to the network, using a smaller amount of disk space. This advantage translates into a higher probability of block wins and increased pool payouts, as compared to someone farming un-compressed Chia plots.

As the table above shows, as “compression” of the plot file goes up, more GPU power is required to farm it during each blockchain challenge, however the plot file itself gets smaller and smaller, allowing more plots to be held on a given amount of disk space.

Is this Madmax’s Fault?

Absolutely not. Given the ability to throw additional hardware at a computing problem in pursuit of profits, the incentive meant this addition of GPU plotting/farming was inevitable for the Chia ecosystem, despite the original intent of the designers.

In response, Chia Network Inc. will eventually offer GPU-enabled plotting and farming as well. So, no one is to blame for the situation. However that does mean that a GPU will be the bare minimum hardware required to be competitive in the Chia farming space.

Conclusion

With a dedicated GPU now being the admission required for plotting and farming chia in a competitive landscape, one really needs to question whether there is any room left for small farmers, as has been the case for the first 2 years of Chia’s existence. Given the history of the Bitcoin blockchain that eventually required custom-built ASIC hardware to mine, this really shouldn’t have come as a surprise. Needless to say, I am disappointed in what this means for the future of the Chia blockchain and it’s so-called “green” advantage.

ChatGPT Prompts

2023-01-31T00:00:00+00:00

So with ChatGPT released in public preview at the end of last year, the ability to craft effective prompts for ChatGPT became an impressive skill. For the last few decades, masters of the Google search box have been wowing co-workers, but now acolytes of the prompt are the new hotness.

Prompting AI

As with Google, how you ask ChatGPT a question greatly determines the quality and usefulness of the reponse returned. To that end, here are some of things you can ask ChatGPT. Each type of query can be tuned to get better responses, by refining the prompt.

Answer my Question!

Asking a question with a clear answer is really where Google shined, so ChatGPT needs to nail this one. So far, on questions with a clear answer, ChatGPT seems to be doing well, correctly identifying the musician, artist, and actor Guy Davises are all more talented than the nerdy computer one. :)

Write for Me!

This is perhaps the most impressive feature of ChatGPT, asking it to create new text given a topic and a style. For example, asking it to write a short story in the style of Stephen King on the topic of werewolves.

Teach Me!

Asking for a step-by-step guide to completing a task is another common type of prompt. Examples include asking for a recipe to bake a cake:

Suggestions Please!

Finally, you can ask ChatGPT for a set of suggestions to a problem you are facing. Examples include asking for 3 ways to request a raise at work.

Limitations

Incorrect Responses

A less charitable read on this is that ChatGPT can sometimes make up compelling bullshit. The response may sound right, but on closer inspection a domain expert can see the response is simply not true. This is sometimes known as AI hallucination.

Biased Answsers

Given human-created inputs any AI system will potentially be affected by bias, simply reflecting the bias cataloged in the world. Not unique to ChatGPT, but rather any AI system trained on data not screened for bias in advance will show this problem.

Out of Date

While impressive, the ChatGPT model (free version) does have some limitations such as currency. It was trained on a corpus from a year ago so current events are beyond its purview:

Conclusion

Overall, ChatGPT is really an impressive technology and will definitely give Google a run for their money. I’ve been noticing that the top results in my recent Google searches have been declining in quality for years now, often clogged with blogspam such as above. :) With Chat interfaces improving and providing adequate summaries and responses, will users bother clicking through to original sources anymore?

Code Recollection

Firefly III

Firefly III

PostgreSQL

Data Importer

WebUI

Conclusion

Coming Wave

Chia Layoffs

More in this series…

Lidarr Importing

Gigahorse Fees

Dev Fees

Conclusion

More in this series…

Campsite Booking

Booking Effectively

Canadian Bookings

American Bookings

Conclusion

ChatGPT Detection

Is it actually cheating?

Alright, how to identify cheaters?

How do cheaters avoid detection?

Conclusion

Update

More in this series…

OpenAI Partnering

More in this series…

Gigahorse

Chia goes PoW!

Is this Madmax’s Fault?

Conclusion

More in this series…

ChatGPT Prompts

Prompting AI

Answer my Question!

Write for Me!

Teach Me!

Suggestions Please!

Limitations

Incorrect Responses

Biased Answsers

Out of Date

Conclusion

More in this series…