Rockstone Data Ltd : MVP Software Development

System Partitioning (and Predicting Tides)

Refine the product idea
Review the market
Elaborate the use cases
Elaborate the system
Define monetization
Choose the tooling and methodology
Deploy and maintain

3rd November 2023

Maximising software value and managing those bicycle shed walls

20th Oct 2022

Server Monitoring with Grafana and Prometheus

4th April 2022

Time Series Database (TSDB) Technologies

28th Jan 2022

DevOps, Data Engineering and MLOps

5th Oct 2021

Digital Twin and Internet of Things Data

2nd July 2021

CONTACT

Software innovation blog

System Partitioning (and Predicting Tides)

Today we're looking at how to kick off projects with very high level goals by referring back to the AnyTide project.

Back in 2013, the National Oceanography Centre approached Winchester Innovation (an earlier incarnation of this company) to generate impact using their bathymetric model and tidal prediction technologies.

The steps taken were :

This is neatly summarized in a presentation at the British Geological Survey Smartphone Conference, with the resulting app going live in Android and iOS app stores in late 2013 and remaining for many years. See the Archive.org link for the iOS version here.

One key aspect to this process is system partitioning, shown in slide 13. Maximally satisfying a number of competing goals such as user experience, security, data protection and performance requires a great deal of experience of desiging and building complex systems.

The net result was a very popular and unique app with many tens of thousands of users. It was later upgraded to become a global tidal prediction app before being retired in 2021.

The app generated much user value and publicity over the years.

If you are a university or company with under-utilized intellectual property, talk to us about turning it into impactful products and publicity.

Maximising software value and managing those bicycle shed walls

On a previous project in a large organisation, we had a phrase 'the bicycle shed walls', for referring to management procrastination through focussing on the trivial.

Today, we manage software using an agile process, with a task 'backlog' (aka todo list) and weekly client/user meetings to prioritise the next weekly 'sprint' of activity.

It's vital that these weekly meetings avoid procrastination.

We have to frame the discussion around outcomes and benefits to the user's 'world'. And how will this software change their 'world' for the better ? Use this lens for discussions around features and priorities.

Do this and you'll find your new software tools provide more business benefits for the money and deliver them more quickly.

And the bike shed walls may grow a little tatty.

Server Monitoring with Grafana and Prometheus

We use the industry leading Grafana dashboard app with some of its huge library of open source dashboards. The Prometheus tool is used to scrape metrics from the micro services and feed the data into Grafana. Prometheus also has functionality to create alerts by email / SMS to on-call DevOps staff.

In general, each microservice comes with a ready made metrics generator. For postgres it is pgexporter, Cadvisor for monitoring Docker and ClickhouseDB has its own metrics generator. We use node_exporter to generate metrics for the host PC itself.

We can see whether we're over or under provisioned with CPU, disk and memory, and quickly flag up any Dockerized micro-services that need attention.

It's a game changer.

If any of this is of interest to you and your cloud or SaaS or IoT or Digital Twin RCM system, then please contact us.

Time Series Database (TSDB) Technologies

We're developing a remote condition monitoring system that has been saving time series data into a PostgreSQL database hosted on a DigitalOcean Droplet and Volume. It is working well as a proof of concept, which is the intention.

And columnar databases are particularly suited to the task. There are a number to choose from . Further due diligence has led us to take ClickHouse for a spin.

There are other TSDBs to choose from and we may well test some more. An independent industry standard set of database performance benchmarks would help.

The next stage is to expand the table towards the billion row mark for stress testing.

Do contact us if your projects are using any of the technologies mentioned in these blogs posts, and we'd be happy to help.

DevOps, Data Engineering and MLOps

In the beginning there was software (well actually discreet logic, but we have to start somewhere). Software became more complex, so needed management tools such as Subversion, then Git.

And so on and so on until we arrived at DevOps which is a set of tools and processes used by an organisation to ensure deployed software is fit for purpose.

More software led to more applications and hence more data. New terms started to spring up like 'Data Lake' and 'Big Data'. This in turn has spawned a new range of '***Ops', namely DataOps or Data Engineering and MLOps for machine learning operations.

And to prove there really is nothing new, take a look at the 1999 Trimedia Software Streaming Architecture (TSSA) - which certain members of Rockstone Data worked on/with 'back in the day' so to say.

So what do all these 'Ops provide us with ?

The answer is control. Control of your development, R&D, deployment and testing. Without these 'Ops your machine learning data pipeline cloud deployment will at some point become unmanageable.

Digital Twin and Internet of Things Data

It's now standard practice to connect any new piece of machinery or plant to the cloud to record and use the data. Often referred to as creating a 'Digital Twin' or 'The Internet of Things'.

This is partly driven by the almost negligable cost and power requirements of sophisticated embedded systems and by the rise of cloud services such as Amazon AWS and Microsoft Azure with associated state of the art AI and ML tools.

So we are currently installing remote condition monitoring collectors (RCMC) to hundreds of remote devices. Each RCMC is collecting data from a number of sensors such as: Logic signalling events Analogue values Data recorders Electronic control unit logs Environmental sensors GPS coordinates

If you are interested in or require help with projects involving any of these topics, please contact us.

CONTACT

We're developing a remote condition monitoring system that has been saving time series data into a PostgreSQL database hosted on a DigitalOcean Droplet and Volume.
It is working well as a proof of concept, which is the intention.

So we are currently installing remote condition monitoring collectors (RCMC) to hundreds of remote devices. Each RCMC is collecting data from a number of sensors such as:

Logic signalling events

Analogue values

Data recorders

Electronic control unit logs

Environmental sensors

GPS coordinates