Counting Sand

Bonus: Season 1 Recap

Episode Summary

Both research and practice create an optimum balance in delivering the best solutions. Angelo reflects on Season 1 with topics of how Computer Science changes and impacts our lives.

Episode Notes

Angelo begins this episode with reflections on history and what brought us to the AI Winter.  Why do we need a balance between research and practice?  You don’t want to rediscover what has already been discovered or settle for something that could be better if you took the time to research a bit more.  

In episode 4 we meet Angelo’s friend Andy Lee who talks about computer science predicting our biological age.  Andy actually met Greg Fahy who talked about longevity.  The study focused on injecting the thymus gland with a growth hormone that produced regeneration effects.  The effects were measured through the epigenetic clock known as DNA methylation.

In Episode 6, Jim Shalaby talks with Angelo about how COVID-19 changed healthcare forever.  Patients don’t have to wait in waiting rooms, they don’t have to find transportation to get there, and the patient has access to the clinicians.  

The hard problems associated with explainability in artificial neural networks, we talked about in Episode 8.  Angelo’s friend Nikos explained to us about five classic problems, one of which includes data privacy.  Another big issue is developing a machine learning system to create adversarial attacks on the existing system.

In episode 7, Angelo’s friend Manos shared how complicated it is for people to invoke their right to have their data removed from a system.  Typically those systems have to schedule deletions to remove the data through tombstones and a process called compacting.

What is on the horizon and what should we be paying attention to?  We are going to run against barriers of technology. For instance, Moore's law is coming to an end. What do we do about that?  What is happening in the short-term and how do we get past this barrier to the next?  And then how do we blow away all those barriers with moonshots like quantum computing?

Finally, wrapping up our first season, Angelo wants to reflect on gratitude.  Gratitude for you our listeners.  Thank you so much for joining us on this journey. We really want to hear about your thoughts. The show is evolving just as the world is and we want to make sure that we're covering topics that you're interested in.

We would love for you to follow, rate, and review the show on your favorite podcast platform so that others can find us too. Thank you so much for listening.

 

Our Guests - Thank you!:

Nikos Myrtakis on  LinkedIn

Manos Athanassoulis on LinkedIn and Boston University
Jim Shalaby on Twitter and LinkedIn

Andy Lee on Twitter and LinkedIn

 

About the Host

Angelo Kastroulis is an award-winning technologist, inventor, entrepreneur, speaker, data scientist, and author best known for his high-performance computing and Health IT experience. He is the principal consultant, lead architect, and owner of Carrera Group, a consulting firm specializing in software modernization, event streaming (Kafka), big data, analytics (Spark, elastic Search, and Graph), and high-performance software development on many technical stacks (Java, .net, Scala, C++, and Rust). A Data Scientist at heart, trained at the Harvard Data Systems Lab, Angelo enjoys a research-driven approach to creating powerful, massively scalable applications and innovating new methods for superior performance. He loves to educate, discover, then see the knowledge through to practical implementation.

Host: Angelo Kastroulis

Executive Producer: Kerri Patterson

Producer: Albert Perrotta;

Communications Strategist: Albert Perrotta;

Audio Engineer: Ryan Thompson

Music: All Things Grow by Oliver Worth

Episode Transcription

Angelo: This season we asked the question, can computer science make our  lives better? Join us as we recap season one. I'm your host Angelo Kastroulis  and this is Counting Sand. 

First, we need to go back to a statement I made in episode one. And that was  that I said, you have to have a balance between research and application. Both  are important. Why is that balance so important? Well, you will not get the best  solution unless you apply both. Why can I say that? Well, back in the second  and third century B.C., the Library of Alexandria was established. 

Now we don't know exactly how much information was stored there, but some  estimates go as high as 400,000 scrolls. What made it important was not just  that it was a library with a lot of information in it, which there were other  civilizations that had libraries, maybe not at that time, but new works in  mathematics, astronomy physics, natural science were housed there. 

So when I say new works, I mean, there was new, original research being done.  It wasn't just a library, it was a place of study. It had empirical standards that  were applied and it was one of the first and certainly the most strongest homes  for serious textual criticism. In fact, today, when we talked about in that same  episode, how research papers come about and the fact that they're peer reviewed  and criticized, that's key, because you're able to kind of go through some of the  things in the paper that make it strong and make it reproducible. 

The reason we do that is so that we can build on knowledge and before we were  able to have something like this, a place where people could go study and take  the entire breadth of knowledge, and then build on top of it and then keep doing  that generation after generation. That's how you're able to springboard  knowledge. 

That value is that scholars could gather there and they could hyper-focus their  knowledge so that a succeeding researcher could then build on top of it. And  according to legend and our namesake, the Syracuse and inventor Archimedes  invented the Archimedes screw, a pump for moving water while studying at the  Library of Alexandria. 

But what made it, again, interesting was that this was a parallelizable process.  Now, you didn't have to start over again or even have one person moving  knowledge. You could have people from all over the world come here, bring 

their knowledge and in parallel, in different directions, move human knowledge  forward. 

Now, I want to also mention something and that is the value of peer review. If  you remember in season five, we talked about the AI winter, and that was what  happens when we don't do that. If we just believe everything we read, like for  example, the criticism of the perceptron and we just kind of took that at face  value and effectively halted all research in neural networks in that area.  

It caused decades long, an AI winter, which prevented us from moving the ball  forward in that. So that is certainly a lesson learned. Don't believe everything  you read. Be critical of it and take a scholarly approach to things. Now, the one  thing I want to say about cutting out research.  

Obviously, if you cut out research and you go straight to application, you're  either going to spend too much time and money getting to some mediocre  solution or you're just going to rediscover something that's already been  discovered. Tread ground that's already been tread. If you cut out application  and you only do research, you're forever going to be in the pursuit of perfection. 

So what happens in both of these cases? You waste time and you waste money.  What else can we learn from computer science? I'm going to talk about three  perspectives. First, I think there's the personal perspective. For example, did  computer science make my life better or did computer science make another  person who decided to become a computer scientist’s life better? I think so and I  think they're grateful for finding a job they can do. 

For instance, who would have ever thought that you can make the living just  pushing buttons on a computer, that people will pay you to do that. But I think  what I mean when I ask, can computer science make our lives better, we're  talking about something different. Maybe something collective. That's another  perspective. 

Can we make the world a better place in ways that are hard to measure? For  example, one of my partners was telling me about a project he was working on  that detected birds around windmills. And if a bird that's a particular endangered  species comes near a windmill, it could slow down or stop to make sure that the  bird doesn't get harmed. Does that make the world a better place? Certainly.  

And we can apply that to many other things—energy, general health. Which  takes us, I think, to the third category and that is, our individual perspective. 

Can we improve another person's health? Can we ease their suffering? Can it  make us happier? We spent a lot of season one in this area here. 

In episode four, I had a chance to speak with an old friend, Andy Lee and Andy  and I talked a bit about a way of getting actionable information to change our  lives. A way of having computer science predict our biological age.  

This is what Andy had to say about that paper and the research that they're  doing in that regard. 

Andy: Greg Fahy is the first author and I actually had the opportunity to see him  speak in a longevity webcast, just this week and got to ask him a couple of  questions, which was pretty exciting. He started a study using growth hormone  as a treatment to regenerate the thymus. And from that, reverse the biological  age as measured through this epigenetic clock. 

And then another approach looking at plasma and some markers in the blood  other than the DNA methylation that can do the same thing. So through this,  they were able to show reversal of age. So people would come in, you calculate  their baseline by looking at their biological age minus their chronological age to  figure out are they above or below coming in and then track them over a year of  treatment and see are they further above or below their chronological age at the  end? 

And they were on average able to shift people. Reduce their biological age by  about two and a half years over a one year treatment plan. So really turning  back the clock and making people younger throughout this pretty exciting  concept that you may be able to take a treatment and become younger. 

Angelo: So computer science definitely was able to make a change in some  people and even reduce their biological age—effectively turned back the clock.  So in that case, computer science did change their lives, but notice what it can't  do. It didn't make the decision for them. It wasn't a magical pill automatically. 

It still requires us to do something. I think that that's the key that we should  expect out of computer science.  

In episode six, I had an opportunity to speak with a friend of mine, Jim Shalaby.  And we talked about how COVID-19 changed healthcare forever. Telemedicine  had been there had been around for decades, but it really hadn't taken hold for  whatever reason, mainly, because of privacy, computer science, and  interpersonal issues of trust.

But when it was forced upon us by COVID-19, things changed. Notice what  Jim said. 

Jim: Simple things like that really lessen that anxiety on the clinical side as to  whether they're getting reliable data or whether they're going to be legally  responsible for monitoring that data on a regular basis. It wasn't as bad as they  thought. 

It doesn't substitute for face-to-face, but boy that's convenient. That level of  accessibility of the patient having access to their providers, certainly they don't  have to wait in a waiting room, they don't have to find transportation to get  there. They have almost immediate access to their clinician without going  through the waiting room issue. The clinicians also liked it because if they had a  quick question, if they needed a quick followup, they knew they could actually  engage with patients without having to play phone tag.  

They can schedule something and do it. So, I think it's now become a part of  healthcare. COVID has really changed the way patients and the clinicians. And  the reason I say clinicians is that I don't just include physicians. I include the  whole allied staff that supports a physician, the nurse practitioner, the  physician’s assistant, the pharmacist, the dietician, even the case manager and  the social worker are using the same modalities to communicate with the patient  and they've all gotten into a rhythm. It's not perfect, but they've incorporated it  into their daily workflows, which is really promising. It opens up new avenues  for improving health and really the primary thing is it allows a new way to  assess outcomes of interventions. 

Angelo: Again, computer science wasn't a magical cure for anything, but it was  a tremendous enabler for us to be able to do something different and be able to  still have really high quality care. 

Like artificial intelligence and artificial neural networks. We find that they  become so complicated that they become difficult to use, or they have side  effects. For example, in episode eight, we talked about the explainability of  machine learning and how hard of a problem it is. 

I was joined by Nikos Myrtakis, a PhD student at the University of Crete. We  talked a bit about artificial neural networks and how hard it is to explain  complex models like those and SVMs and others. Notice what he said. 

Nikos: So it is very easy to understand why the system came up with a specific  decision. On the other hand now, we have more complex models, nonlinear, 

highly parameterized models, such as SVMs, such as neural networks, random  forest, etc. These models are not interpretable by nature. So what we do is to  explain them in a post-hoc fashion. 

Angelo: So while expert systems tend to have a little bit of explainability built  into them, these other systems don't, but that was a fascinating solution using a  kind of neural network that is more explainable to try to reverse engineer and  figure out why a different machine learning model made the decision it did. But  it's not without its problems. 

In that episode, we talked about five problems that are classically part of  explainability and AI, but two of those problems I'll talk about right now. One is  privacy of data. Nikos explains. 

Nikos: Because to explain anything, we need access to the original data. So  imagine that you have a database and you're provided with a model that you get  from a different company. So we give the company the data in an encrypted  manner. 

Another very subtle issue is the adversarial attacks. So imagine that  interpretability might enable people, or programs, to manipulate the system. So  if one knows that by for instance, having three credit cards can increase his  chance of getting a loan. 

Angelo: And that's another problem I hadn't thought about. What happens when  someone tries to reverse engineer the system to kind of scam it. Whether it's the  individual whose credit is being pulled or some other third-party system trying  to learn how the algorithms work so they can manipulate it. 

In addition to those privacy concerns there are many more, not related  necessarily to AI, but to our data in general. For example, in episode seven, I  had a chance to speak with a friend of mine, Manos Athanassoulis, who's an  assistant professor at Boston university where he's researching trying to solve  problems like, how do you make sure your data is actually deleted from a  database? He explains it this way. 

Manos: We're trying to solve a very specific problem, which is, when you task  your data with a data provider in the cloud, you have rights based on recent  legislation. You have the right to ask them to delete your data. And if you ask  them to delete your data, then they have to comply within a specific amount of  time. They cannot just keep your data forever because it's not theirs. It's your  data, right? When you generated something, it's your data.

For anybody who's familiar with LSM trees, which is not necessarily, you know,  thousands of people in the world, but there's a few. Essentially every we delete  we insert the tombstone, but eventually we'll meet the corresponding file that  contains the invalidated data. And then we'll discard this data. And this merging  is, which is called compaction, happens over time depending on how much data  you have and how much data you're actually getting into your system.  

So what we do is that instead of actually propagating these compactions with  random order or with an order that depends on, I think which files are more  frequently accessed. 

I think the first one we call it lithy, by the way, the deleted related paper, we call  it lithy, which comes from the Ancient Greek word which means forgetfulness.  And the goal is to forget things. 

Angelo: In all of these examples, we saw either practical application or theory.  Many times he saw them together. Speaking to professors like Manos or  research Nikos is doing, or even Andy, taking those papers and applying them  with real code in real situations, we see that computer science can make a huge  impact in our lives. 

And in fact, those are not the only ones researching, either creating research or  applying research. New research is being created in all different directions on  the same issue. For example on privacy, we don't just worry about deletes.  There's research being done in all aspects of privacy and all aspects of machine  learning. 

And imagine if we could take all of that together and start putting it into  cohesive systems. Taking this piece of what Manos is working on and putting  that in our installation of rocksDB and then maybe adding something else from  over here and putting that into our implementation and taking this knowledge  and creating something new. 

We also, I think, had a lot of cautions. In addition to the AI winter, we talked  about a little while ago. There were so many others. In episode three, we talked  about the problems of micro-optimizing. Instead of looking at research and  doing the hard work, kind of pulling that into implement it we sometimes get  afraid of it and we get lost in finding a library or trying to micro-optimize  something to get a mediocre solution. 

Or, kind of being stuck in an old way of thinking. In that episode, we talked  about designing for performance instead of tuning for it. Tuning for it is the way 

we used to think decades ago. Now you can't build a solution then tune it later.  Now, that's not to say you can't build a solution and then optimize it, but it's to  say that do not expect to get big gains. 

You have to start with a technology and an architecture that lends itself to the  problem at hand. You can't just use some general purpose technology and then  expect to tune it for big gains. 

In episode two, we talked a little bit about the dangers of artificial intelligence.  Here’s a quote that I really like it. I'm going to say it again that, “all machine  learning models are bad. Some are less bad than others.” And that's because we  build bias into these models. And that, we talked about a bit in episode two, we  also talked about the bad side of technologies like social media that are  supposed to bring people together. 

But in reality, they don't, they don't make us happier. They make us a bit  antisocial. So I think from that, we learned that you cannot foresee what's going  to happen. Sometimes there are unforeseen side effects to technology even  though we might be well meaning. 

I'd like to add one more lesson to this, and that is, think about gratitude. The  people that have gotten you where you are. You may have met people like this  when you went to grad school or maybe you've met them in your day-to-day job  and you may have met individuals, maybe you're building a company. 

And the people who have worked with you and helped you get where you are,  the mentors who have taught you through the years, your employees, take a  moment to make sure that you have gratitude for those people around you. That  will really go a long way. And in fact, I think that that's the right way to lead. 

In episode one, I think I said that Chios Island, which is where my family comes  from, has a long history of entrepreneurship. And I mentioned that there's a  story there for another day. So I think it's time to make good on that here. Here's  the basic story and it's going to take you back over 150 years to March 1822. 

In retaliation to a Greek revolt against Turkish occupation, the Sultan sent Kara  Ali Pasha, an Admiral with 40,000 Turkish troops to the Island of Chios. They  set at a fire and they killed all infants under age three, all males, 12 and older  and all females, 40 and older, then they took 52,000 slaves and they murdered  another 52,000. The population of the island was only 120,000. Only 2000 were  left on the island. 21,000 managed to escape and there were probably about a  thousand that were out of the island because Chios was a seafaring center. So 

they had lots and lots of ships and they were in places like Marseilles, London,  and Liverpool. 

And so they happen to be in these areas. So some of them ended up settling in  those areas and when they did it, they became members of that community and  entrepreneurs there. They restarted and rekindled. For instance, in London, F  Sotros Raleigh laid the foundation stone in St. Sophia's cathedral and he and his  brothers formed the finance committee. Another famous Yoti moved to  Marseilles with his parents and his son then formed the Baltic exchange in  London.  

And the story can be repeated many, many times until 1832 when they moved  back to Chios and kind of started rebuilding for about a hundred years until  1930s. And when the Germans came in and occupied Chios and they went  through their massacre and so then they had to, again, rebuild. 

You have individuals who are entrepreneurs and they continue that legacy and  rebuild. The reason I say this is because I think the story of Chios is one of  rebirth. Adversity, sure, but then landing on your feet and starting over. I think  that that has affected me in several ways. For example, I distrust five-year plans.  Look at COVID. What happened to everybody's five-year plan? Instead, I think  it's really about opportunity, about creating doors of opportunity, and then later  deciding which ones to go through. 

I'm not saying we never plan, but what can you foresee five years in advance?  It's actually really hard to do that.  

Season two is going to be about opportunity. What is on the horizon and what  should we be paying attention to. We're going to be talking about, how do we  improve existing techniques? For example, we talked about expert systems and  rules engines. We've done a ton of those. How can we improve that? How can  we continue to protect privacy while threats are actually evolving at the same  time. How can we make better and faster decisions in a world where threats to  healthcare and other things appear out of nowhere like COVID-19 did. 

And now we're having to kind of catch up, how do we get ahead of that? And  then we're going to run against barriers of technology. For instance, Moore's  Law is coming to an end. What do we do about that? So there's two sides to  that. There's a, what's happening in the short-term. How do we get past this  barrier to the next?

And then how do we blow away all those barriers with moonshots like quantum  computing? So going back to my theme of gratitude, I wanted to make sure I  express gratitude to all my listeners. Thank you so much for joining us on this  journey. We really want to hear about your thoughts. The show is evolving just  as the world is and we want to make sure that we're covering topics that you're  interested in. 

And we also want to know what is it that you want to deep dive on? I'm your  host, Angelo Kastroulis and this has been Counting Sand. Please take a minute  to follow, rate, and review the show on your favorite podcast platform so that  others can find us. Thank you so much for listening.