Voices in AI – Episode 86: A Conversation with Amir Husain

[voices_in_ai_byline]

About this Episode

Episode 86 of Voices in AI features Byron speaking with fellow author Amir Husain about the nature of Artificial Intelligence and Amir’s book The Sentient Machine.

Listen to this one-hour episode or read the full transcript at www.VoicesinAI.com

Transcript Excerpt

Byron Reese: This is Voices in AI brought to you by GigaOm, and I’m Byron Reese. Today my guest is Amir Husain. He is the founder and CEO of SparkCognition Inc., and he’s the author of The Sentient Machine, a fine book about artificial intelligence. In addition to that, he is a member of the AI task force with the Center for New American Security. He is a member of the board of advisors at UT Austin’s Department of Computer Science. He’s a member of the Council on Foreign Relations. In short, he is a very busy guy, but has found 30 minutes to join us today. Welcome to the show, Amir.

Amir Husain: Thank you very much for having me Byron. It’s my pleasure.

You and I had a cup of coffee a while ago and you gave me a copy of your book and I’ve read it and really enjoyed it. Why don’t we start with the book. Talk about that a little bit and then we’ll talk about SparkCognition Inc. Why did you write The Sentient Machine: The Coming Age of Artificial Intelligence?

Byron, I wrote this book because I thought that there was a lot of writing on artificial intelligence—what it could be. There’s a lot of sci fi that has visions of artificial intelligence and there’s a lot of very technical material around where artificial intelligence is as a science and as a practice today. So there’s a lot of that literature out there. But what I also saw was there was a lot of angst back in 2015, 2014. I actually had a personal experience in that realm where outside of my South by Southwest talks there was an anti-AI protest.

So just watching those protesters and seeing what their concerns were, I felt that a lot of the sort of philosophical questions, existential questions around the advent of AI, if AI indeed ends up being like Commander Data, it has sentience, it becomes artificial general intelligence, then it will be able to do the jobs better than we can and it will be more capable in let’s say the ‘art of war’ than we are and therefore does this mean that we will lose our jobs. We will be meaningless and our lives will be lacking in meaning and maybe the AI will kill us?

These are the kinds of concerns that people have had around AI and I wanted to sort of reflect on notions of man’s ability to create—the aspects around that that are embedded in our historical and religious tradition and what our conception of Man vs. he who can create, our creator—what those are and how that influences how we see this age of AI where man might be empowered to create something which can in turn create, which can in turn think.

There’s a lot of folks also that feel that this is far away, and I am an AI practitioner and I agree I don’t think that artificial general intelligence is around the corner. It’s not going to happen next May, even though I suppose some group could surprise us, but the likely outcome is that we are going to wait a few decades. I think waiting a few decades isn’t a big deal because in the grand scheme of things, in the history of the human race, what is a few decades? So ultimately the questions are still valid and this book was written to address some of those existential questions lurking in elements of philosophy, as well as science, as well as the reality of where AI stands at the moment.

So talk about those philosophical questions just broadly. What are those kinds of questions that will affect what happens with artificial intelligence?

Well I mean one question is a very simple one of self-worth. We tend to define ourselves by our capabilities and the jobs that we do. Many of our last names in many cultures are literally indicative of our profession. You know goldsmiths as an example, farmer as an example. And this is not just a European thing. Across the world you see this phenomenon of last names just reflecting the profession of a woman or a man. And it is to this extent that we internalize the jobs that we do as essentially being our identity, literally to the point where we take it on as a name.

So now when you de-link a man or a woman’s ability to produce or to engage in that particular labor that is a part of their identity, then what’s left? Are you still, the human that you were with that skill? Are you less of a human being? Is humanity in any way linked to your ability to conduct this kind of economic labor? And this is one question that I explored in the book because I don’t know whether people really contemplate this issue so directly and think about it in philosophical terms, but I do know that subjectively people get depressed when they’re confronted with the idea that they might not be able to do the job that they are comfortable doing or have been comfortable doing for decades. So at some level obviously it’s having an impact.

And the question then is: is our ability to perform a certain class of economic labor in any way intrinsically connected to identity? Is it part of humanity? And I sort of explore this concept and I say “OK well, let’s sort of take this away and let’s cut this away let’s take away all of the extra frills, let’s take away all of what is not absolutely fundamentally uniquely human.” And that was an interesting exercise for me. The conclusions that I came to—I don’t know whether I should spoil the book by sharing it here—but in a nutshell—this is no surprise—that our cognitive function, our higher order thinking, our creativity, these are the things which make us absolutely unique amongst the known creation. And it is that which makes us unique and different. So this is one question of self worth in the age of AI, and another one is…

Just to put a pin in that for a moment, in the United States the workforce participation rate is only about 50% to begin with, so only about 50% of people work because you’ve got adults that are retired, you have people who are unable to work, you have people that are independently wealthy… I mean we already had like half of adults not working. Does it does it really rise to the level of a philosophical question when it’s already something we have thousands of years of history with? Like what are the really needy things that AI gets at? For instance, do you think a machine can be creative?

Absolutely I think the machine can be creative.

You think people are machines?

I do think people are machines.

So then if that’s the case, how do you explain things like the mind? How do you think about consciousness? We don’t just measure temperature, we can feel warmth, we have a first person experience of the universe. How can a machine experience the world?

Well you know look there’s this age old discussion about qualia and there’s this discussion about the subjective experience, and obviously that’s linked to consciousness because that kind of subjective experience requires you to first know of your own existence and then apply the feeling of that experience to you in your mind. Essentially you are simulating not only the world but you also have a model of yourself. And ultimately in my view consciousness is an emergent phenomenon.

You know the very famous Marvin Minsky hypothesis of The Society of Mind. And in all of its details I don’t know that I agree with every last bit of it, but the basic concept is that there are a large number of processes that are specialized in different things that are running in the mind, the software being the mind, and the hardware being the brain, and that the complex interactions of a lot of these things result in something that looks very different from any one of these processes independently. This in general is a phenomenon that’s called emergence. It exists in nature and it also exists in computers.

One of the first few graphical programs that I wrote as a child in basic [coding] was drawing straight lines, and yet on a CRT display, what I actually saw were curves. I’d never drawn curves but it turns out that when you light a large number of pixels with a certain gap in the middle and it’s on a CRT display there there are all sorts of effects and interactions like the Moire effect and so on and so forth where what you thought you were drawing was lines, and it shows up if you look at it from an angle, as curves.

So I mean the process of drawing a line is nothing like drawing a curve, there was no active intent or design to produce a curve, the curve just shows up. It’s a very simple example of a kid writing a few lines of basic can do this experiment and look at this but there are obviously more complex examples of emergence as well. And so consciousness to me is an emergent property, it’s an emergent phenomenon. It’s not about the one thing.

I don’t think there is a consciousness gland. I think that there are a large number of processes that interact to produce this consciousness. And what does that require? It requires for example a complex simulation capability which the human brain has, the ability to think about time, to think about objects, model them and to also apply your knowledge of physical forces and other phenomena within your brain to try and figure out where things are going.

So that simulation capability is very important, and then the other capability that’s important is the ability to model yourself. So when you model yourself and you put yourself in a simulator and you see all these different things happening, there is not the real pain that you experience when you simulate for example being struck by an arrow, but there might be some fear and a why is that fear emanating? It’s because you watch your own model in your imagination, in your simulation suffer some sort of a problem. And now that is a very internal. Right? None of this has happened in the external world but you’re conscious of this happening, so to me at the end of the day it has some fundamental requirements. I believe simulation and self modeling are two of those requirements, but ultimately it’s an emergent property.

Listen to this one-hour episode or read the full transcript at www.VoicesinAI.com

[voices_in_ai_link_back]

Byron explores issues around artificial intelligence and conscious computers in his new book The Fourth Age: Smart Robots, Conscious Computers, and the Future of Humanity.

Voices in AI – Episode 85: A Conversation with Ilya Sutskever

[voices_in_ai_byline]

About this Episode

Episode 85 of Voices in AI features host Byron Reese and Ilya Sutskever of Open AI talk about the future of general intelligence and the ramifications of building a computer smarter than us.

Listen to this one-hour episode or read the full transcript at www.VoicesinAI.com

Transcript Excerpt

Byron Reese: This is Voices in AI brought to you by GigaOm and I’m Byron Reese. Today my guest is Ilya Sutskever. He is the co-founder and the chief scientist at OpenAI, one of the most fascinating institutions on the face of this planet. Welcome to the show Ilya.

Ilya Sutskever: Great to be here.

Just to bring the listeners up to speed, talk a little bit about what OpenAI is, what its mission is, and kind of where it’s at. Set the scene for us of what OpenAI does.

Great, for sure. The best way to describe OpenAI is this: so at OpenAI we take the long term view that eventually computers will become as smart or smarter than humans in every single way. We don’t know when it’s going to happen, — some number of years, something [like] tens of years, it’s unknown. And the goal of OpenAI is to make sure that when this does happen, when computers which are smarter than humans are built, when AGI is built, then its benefits will be widely distributed. We want it to be a beneficial event, and that’s the goal of OpenAI.

And so we were founded three years ago, and since then we’ve been doing a lot of work in three different areas. We’ve done a lot of work in AI capabilities and over the past three years we’ve done a lot of work we are very proud of. Some of the notable highlights are: our Dota results where we had the first and very convincing demonstration of an agent playing a real time strategy game, trained the reinforcement learning with no human data. We’ve trained robots to record, robot hands to re-orientate the block. This was really cool, it was cool to see it transfer.

And recently we’ve released the GPT-2 — a very large language model which can generate very realistic text as well as solve lots of different energy problems [with] a very high level of accuracy. And so this has been our working capabilities.

Another thrust to the work that we are doing is AI safety, which at [its] core is the problem of finding ways of communicating a very complicated reward function to an agent so that the agent that we build, can achieve goals and great competence. It will do so while taking human values and preferences into account. And so we’ve done some significant amount of work there as well.

And the third line of work we’re doing is AI policy, where we basically have a number of really good people thinking hard about what kind of policies should be designed and how should governments and other institutions respond to the fact that AI is improving pretty rapidly. But overall our goal, eventually the end game of the field, is that AGI will be built. The goal of OpenAI is to make sure that the development of AGI will be a positive event and that its benefits are widely distributed.

So 99.9% of all the money that goes into AI is working on specific narrow AI projects. I tried to get an idea of how many people are actually working on AGI and I find that to be an incredibly tiny number. There’s you guys, maybe you would say Carnegie Mellon, maybe Google, there’s a handful, but is my sense of that wrong? Or do you think there are lots of groups of people who are actually explicitly trying to build a general intelligence?

So explicitly. OK, a great question. So it’s an explicitly… most people, most research labs are indeed not having this as their goal, but I think that many people, the work of many people indirectly contributes to this. Where for example the fact is that much better learning algorithms, better network architecture, better optimization methods, all tools which are classically categorized as conventional machine learning, they also are likely to be directly contributing to those…

Well let’s stop there for a second, because I noticed you changed your word there to “likely.” Do you still think it’s an open question whether narrow AI, whatever technologies we have that do that, is it an open question whether that has anything to do with general intelligence, or is it still the case that a general intelligence might have absolutely nothing to do with that propagation, neural nets and machine learning?

So I think it’s very highly unlikely. Sorry. I want to make it clear, I think that the tools, that is the field of machine learning that is developing today, such as deep networks, backpropagation, — I think those are immensely powerful tools, and I think that it is likely that they will stay with us, with the field, for a long time all the way until we build very true general intelligence. At the same time I also believe, I want to emphasize that, important missing pieces exist and we haven’t figured out everything. But I think that the deep learning has proven itself to be so versatile and so powerful and it’s basically been exceeding our expectations in every turn. And so for these reasons I feel that deep learning is going to stay with us.

Well let’s talk about that though, because one could summarize the techniques we have right now as: let’s take a lot of data about the past, let’s look for patterns in that data and let’s make predictions about the future, which isn’t all that exciting when you say it like that. It’s just that we’ve gotten very good at it.

But why do you believe that method is the solution to things like creativity, intuition, emotion and all of these kind of human abilities? It seems to be at an intuitive level that if you want to teach a machine to play Dota or Go or whatever, yeah that works great. But really when you come down to human level intelligence with its versatility, with transferred learning with all the things we do effortlessly, it’s not even… it doesn’t seem at first glance to be a match. So why do you suspect that it is?

Well I mean I can tell you how I look at it. So for example you mentioned intuition is one thing which – so you used the certain phrase to describe the current tools where you kind of look for patterns in the past data and you use that to make predictions about the future and therefore it sounds not exciting. But I don’t know if I’d agree with that statement. And on the question of intuition, I can tell you a story about about AlphaGo. So… if you look at how AlphaGo works, there is a convolutional neural network.

OK actually let me give you a better analogy – so I believe there is a book by Malcolm Gladwell where he talks about experts, and one of the things that he has to say about experts is that an expert as a result of all their practice. They can look at a very complicated situation and then they can instantly tell like the three most important things in this situation. And then they think really hard about which of those things is really important. And apparently the same thing happens with Go players, where a Go player might look at the board and then instantly see the most important moves and then do a little bit of thinking about those moves. And like I said, instantly seeing what those moves are, — this is their intuition. And so I think that it’s basically unquestionable with the neural network that’s inside AlphaGo calculates a solution very well. So I think I think it’s not correct to say that intuition cannot be captured.

Listen to this one-hour episode or read the full transcript at www.VoicesinAI.com

[voices_in_ai_link_back]

Byron explores issues around artificial intelligence and conscious computers in his new book The Fourth Age: Smart Robots, Conscious Computers, and the Future of Humanity.

How Intel’s Newest Product Enhancements Could Redefine the Future of Infrastructure Design

On the 2nd of April, at their Data Centric Innovation Day, Intel announced a slew of new products, including brand spanking new ones as well as updates to existing product line-ups. There was some news for everybody – whether your specific interests are in datacenter, or edge, or infrastructures. Among other things, what impressed me the most was the new Optane DC Persistent Memory DIMM, and even more so its implications when coupled with the new (56-core) Intel Xeon Platinium 9200 CPU.

Datacenter Optanization!

Up until yesterday, Optane was already considered a great technology, although to some extent it was seen as a niche product. Although more and more vendors are adopting it as an adequate substitute of NVDIMMs as a tier 0 or cache for their storage systems, it was still harder to foresee a broader adoption of the technology. Yes, it is cheaper than RAM and faster than a standard NAND-based device, but that’s about it.

Maybe it was part of Intel’s strategy. In fact, the first generation of Intel Optane products was developed with Micron, and with the first generation of products perhaps they didn’t want to be too aggressive — something which has most likely radically changed after their divorce. The introduction of Optane DC Persistent Memory DIMM actually offers a good idea of the real potential and benefits of this technology and demonstrates how this could change the way infrastructures will be designed in the future, especially for all data-intensive workloads. In practice, in a very simplistic way, Optane DC Persistent Memory DIMMs work in conjunction with standard DDR4 RAM DIMMs. They are bigger (up to 512GB each) and slightly slower than DDR4 DIMMs, but they allow configuration of servers with several TBs of memory at a very low cost. Optane is slower than RAM, but caching techniques and the fact that these DIMMs sit next to the CPU make the solution a good compromise. At the end of the day, you are trading some performance for a huge amount of capacity, avoiding data starvation for the CPU that otherwise should access data on SSD or, even worse, HDDs or network.

How Does It Work?

There are two operation modes for Optane DIMMs, persistent and non-persistent. I know it could sound confusing, but it’s actually very straight forward.

The way the DIMM operates is selected at the beginning of the bootstrap process. When non-persistent mode is selected, the DIMMs look like RAM, and the real RAM is used as a cache. This means that you don’t have to re-write your app, and practically any application can benefit from the increased memory capacity. On the other hand, when Optane DIMMs operate in persistent mode, it is the application that stores data directly in the Optane and manages RAM and Optane as it fits and, as the name suggests, your data is still there after a reboot. SAP HANA, for example, has already demonstrated this mode and there are several other applications that will follow suit. Take a look at the video below, recorded at the Tech Field Day event that followed the main presentation, its a good deep dive on the product.

There Are Several Benefits (And a Few Trade-Offs)

All the performance tests shown during the event demonstrated the benefits of this solution. Long story short, and crystal clear, every application that relays on large amounts of RAM to work will see a huge benefit. This is mostly because the latency to reach active data is lower than any other solution on the market, but it is also because it will do so at a cost that is a fraction of what you’d get from a 100% configuration. This is not possible today due to the size of the DIMMs and the cost. All of this translates into faster results, fewer servers to obtain them, better efficiency and, of course, lower infrastructure costs.

Furthermore, with more and more applications taking advantage of the persistent memory mode, we will see interesting applications with Optane DIMMs that could replace traditional, and expensive, NVDIMMs in many scenarios like, for example, storage controllers. What are the trade-offs then? Actually, these are not real trade-offs, but the consequences of the introduction of this innovative technology. In fact, Optane DIMMs work only on new servers based on the latest Intel CPUs, those announced alongside the DIMMs. The reason for this is that the memory controller is in the CPU, and older CPUs wouldn’t be able to understand the Optane DIMM nor manage the interaction between RAM and Optane.

Maintaining the Balance

As mentioned earlier, Intel announced many other products alongside Optane Persistent Memory DIMMs. All of them are quite impressive and, at the same time, necessary to justify each other; meaning that it would be useless to have multi-TB RAM systems without a CPU strong enough to get the work done. The same goes for the network, which can quickly become a bottleneck if you don’t provide the necessary bandwidth to get data back and forth from the server.

From my point of view, it’s really important to understand that we are talking about data-crunching monsters here, with the focus on applications like HPC, Big Data, AI/ML and the like. These are not your average servers for VMs, not today at least. On the other hand, it is also true that this technology opens up many additional options for enterprise end users too, including the possibility to create larger VMs or consolidate more workloads on a single machine (with all its pros and cons).

Another feature which I thought noteworthy is the new set of instructions added to the new Xeon Platinum 9200 CPU for deep learning. We are far from having general purpose CPUs competing against GPUs, but the benchmark given during the presentations shows an incredible improvement in this area regarding Inference workloads (the process that is behind the day-to-day ML activity after the neural network is trained). Intel has done a great job, both on hardware and software. With more and more applications taking advantage of AI/ML to work, an increasing number of users will be able to benefit from it.

Closing the Circle

In this article, I’ve covered only a few aspects of these announcements, those that are more related to my job. There is much more to it, including IoT, edge computing, and security. I was especially impressed because they express a vision that is broad, clear and without doubts, providing answers for all the most challenging aspects of modern IT.

Most of the products presented during this announcement are focused on ultimate performance and efficiency or, at least, in finding the best compromise to serve next-gen data-hungry and high demanding applications. Something which is beyond the reach of many enterprises today and more in the ballpark of web and hyper scalers. That said, even if on a smaller scale, all enterprises are beginning to face these kinds of challenges and, no matter if the solutions come from the cloud or their on-prem infrastructure, the technology to do it is now more accessible than ever.

Voices in AI – Episode 84: A Conversation with David Cox

[voices_in_ai_byline]

About this Episode

Episode 84 of Voices in AI features host Byron Reese and David Cox discuss classifications of AI, and how the research has been evolving and growing

Listen to this one-hour episode or read the full transcript at www.VoicesinAI.com

Transcript Excerpt

Byron Reese: This is Voices in AI, brought to you by GigaOm and I’m Byron Reese. I’m so excited about today’s show. Today we have David Cox. He is the Director of the MIT IBM Watson AI Lab, which is part of IBM Research. Before that he spent 11 years teaching at Harvard, interestingly in the Life Sciences. He holds an AB degree from Harvard in Biology and Psychology, and he holds a PhD in Neuroscience from MIT. Welcome to the show David!

David Cox: Thanks. It’s a great pleasure to be here.

I always like to start with my Rorschach question which is, “What is intelligence and why is Artificial Intelligence artificial?” And you’re a neuroscientist and a psychologist and a biologist, so how do you think of intelligence?

That’s a great question. I think we don’t necessarily need to have just one definition. I think people get hung up on the words, but at the end of the day, what makes us intelligent, what makes other organisms on this planet intelligent is the ability to absorb information about the environment, to build models of what’s going to happen next, to predict and then to make actions that help achieve whatever goal you’re trying to achieve. And when you look at it that way that’s a pretty broad definition.

Some people are purists and they want to say this is AI, but this other thing is just statistics or regression or if-then-else loops. At the end of the day, what we’re about is we’re trying to make machines that can make decisions the way we do and sometimes our decisions are very complicated. Sometimes our decisions are less complicated, but it really is about how do we model the world, how do we take actions that really drive us forward?

It’s funny, the AI word too. I’m a recovering academic as you said. I was at Harvard for many years and I think as a field, we were really uncomfortable with the term ‘AI.’ so, we desperately wanted to call it anything else. In 2017 and before we wanted to call it ‘machine learning’ or we wanted to call it ‘deep learning’ [to] be more specific. But in 2018 for whatever reason, we all just gave up and we just embraced this term ‘AI.’ In some ways I think it’s healthy. But when I joined IBM I was actually really pleasantly surprised by some framing that the company had done.

IBM does this thing called the Global Technology Outlook or GTO which happens every year and the company tries to collectively figure out—research plays a very big part of this—we try to figure out ‘What does the future look like?’ And they came up with this framing that I really like for AI. They did something extremely simple. They just put some adjectives in front of AI and I think it clarifies the debate a lot.

So basically, what we have today like deep learning, machine learning, tremendously powerful technologies are going to disrupt a lot of things. We call those Narrow AI and I think that narrow framing really calls attention to the ways in which even if it’s powerful, it’s fundamentally limited. And then on the other end of the spectrum we have General AI. This is a term that’s been around for a long time, this idea of systems that can decide what they want to do for themselves that are broadly autonomous and that’s fine. Those are really interesting discussions to have but we’re not there as a field yet.

In the middle and I think this is really where the interesting stroke is, there’s this notion we have a Broad AI and I think that’s really where the stakes are today. How do we have systems that are able to go beyond what we have that’s narrow without necessarily getting hung up on all these notions of what ‘General Intelligence’ might be. So things like having systems that are that are interpretable, having systems that can work with different kinds of data that can integrate knowledge from other sources, that’s sort of the domain of Broad AI. Broad Intelligence is really what the lab I lead is all about.

There’s a lot in there and I agree with you. I’m not really that interested in that low end and what’s the lowest bar in AI. What makes the question interesting to me is really the mechanism by which we are intelligent, whatever that is, and does that intelligence require a mechanistic reductionist view of the world? In other words, is that something that you believe we’re going to be able to duplicate either… in terms of its function, or are we going to be able to build machines that are as versatile as a human in intelligence, and creative and would have emotions and all of the rest, or is that an open question?

I have no doubt that we’re going to eventually, as a human race be able to figure out how to build intelligent systems that are just as intelligent as we are. I think in some of these things, we tend to think about how we’re different from other kinds of intelligences on Earth. We do things like… there was a period of time where we wanted to distinguish ourselves from the animals and we thought of reason, the ability to reason and do things like mathematics and abstract logic was what was uniquely human about us.

And then, computers came along and all of a sudden, computers can actually do some of those things better than we can even in arithmetic and solving complex logic problems or math problems. Then we move towards thinking that maybe it’s emotion. Maybe emotion is what makes us uniquely human and rational. It was a kind of narcissism I think to our own view which is understandable and justifiable. How are we special in this world?

But I think in many ways we’re going to end up having systems that do have something like emotion. Even you look at reinforcement learning—those systems have a notion of reward. I don’t think it’s such a far reach to think maybe we’ll even in a sci-fi world have machines that have senses of pleasure and hopes and ambitions and things like that.

At the end of day, our brains are computers. I think that’s sometimes a controversial statement but it’s one that I think is well-grounded. It’s a very sophisticated computer. It happens to be made out of biological materials. But at the end of the day, it’s a tremendously efficient, tremendously powerful, tremendously parallel nanoscale biological computer. These are like biological nanotechnology. And to the extent that it is a computer and to think to the extent that we can agree on that, Computer Science gives us equivalencies. We can build a computer with different hardware. We don’t have to emulate the hardware. We don’t have to slavishly copy the brain, but it is sort of a given that will eventually be able to do everything the brain does in a computer. Now of course all that’s all farther off, I think. Those are not the stakes—those aren’t the battlefronts that we’re working on today. But I think the sky’s the limit in terms of where AI can go.

You mentioned Narrow and General AI, and this classification you’re putting in between them is broad, and I have an opinion and I’m curious of what you think. At least with regards to Narrow and General they are not on a continuum. They’re actually unrelated technologies. Would you agree with that or not?

Would you say like that a narrow (AI) gets a little better then a little better, a little better, a little better, a little better, then, ta-da! One day it can compose a Hamilton, or do you think that they may be completely unrelated? That this model of, ‘Hey let’s take a lot of data about the past and let’s study it very carefully to learn to do one thing’ is very different than whatever General Intelligence is going to be.

There’s this idea that if you want to go to the moon, one way to go to the moon—to get closer to the moon—is to climb the mountain.

Right. Exactly.

And you’ll get closer, but you’re not on the right path. And, maybe you’d be better off on top of a building or a little rocket and maybe go as high as the tree or as high as the mountain, but it’ll get you where you need to go. I do think there is a strong flavor of that with today’s AI.

And in today’s AI, if we’re plain about things, is deep learning. This model… what’s really been successful in deep learning is supervised learning. We train a model to do every part of seeing based on classifying objects and you classify a lot – many images, you have lots of training data and you build a statistical model. And that’s everything the model has ever seen. It has to learn from those images and from that task.

And we’re starting to see that actually the solutions you get—again, they are tremendously useful, but they do have a little bit of that quality of climbing a tree or climbing a mountain. There’s a bunch of recent work suggesting… basically they’re looking at texture, so a lot of solution for supervision is looking at the rough texture.

There are also some wonderful examples where you take a captioning system—a system can take an image and produce a caption. You can produce wonderful captions in cases where the images look like the ones it was trained on, but you show it anything just a little bit weird like an airplane that’s about to crash or a family fleeing their home on a flooding beach and it’ll produce things like an airplane is on the tarmac at an airport or a family is standing on a beach. It’s like they kind of missed the point, like it was able to do something because it learned correlations between the inputs it was given and the outputs that we asked it for, but it didn’t have a deep understanding. And I think that’s the crux of what you’re getting at and I agree at least in part.

So with Broad, the way you’re thinking of it, it sounds to me just from the few words you said, it’s an incremental improvement over Narrow. It’s not a junior version of General AI. Would you agree with that? You’re basically taking techniques we have and just doing them bigger and more expansively and smarter and better, or is that not the case?

No. When we think about Broad AI, we really are thinking about a little bit ‘press the reset button, don’t throw away things that work.’ Deep learning is a set of tools which is tremendously powerful, and we’d be kind of foolish to throw them away. But when we think about Broad AI, what we’re really getting at is how do we start to make contact with that deep structure in the world… like commonsense.

We have all kinds of common sense. When I look at a scene I look at the desk in front of me, I didn’t learn to do tasks that have to do with the desk in front of me by lots and lots of labeled examples or even many, many trials in a reinforcement learning kind of setup. I know things about the world – simple things. And things we take for granted like I know that my desk is probably made of wood and I know that wood is a solid, and solids can’t pass through other solids. And I know that it’s probably flat, and if I put my hand out I would be able to orient it in a position that would be appropriate to hover above it…

There are all these affordances and all this super simple commonsense stuff that you don’t get when you just do brute force statistical learning. When we think about Broad AI, we’re really thinking about is ‘How do we infuse that knowledge, that understanding and that commonsense?’ And one area that we’re excited about and that we’re working on here at the MIT IBM Lab is this idea of neuro-symbolic hybrids.

So again, this is in the spirit of ‘don’t throw away neural-networks.’ They’re wonderful in extracting certain kinds of statistical structure from the world – convolutional neural network does wonderful job of extracting information from an image. LSDMs and recurrent neural networks do a wonderful job of extracting structure from natural language, but building in symbolic systems as first-class citizens in a hybrid system that combines those all together.

Some of the work we’re doing now is building systems where we use neural networks to extract structure from these noisy, messy inputs of vision and different modalities but then actually having symbolic AI systems. Symbolic AI systems have been around basically contemporaneous with neural networks. They’ve been ‘in the wings’ all this time. Neural networks deep learning is in any way… everyone knows this is a rebrand of the neural networks from the 1980s that are suddenly powerful again. They’re powerful for the first time because we have enough data and we have enough compute.

I think in many ways a lot of the symbolic ideas, sort of logical operations, planning, things like that. They’re also very powerful techniques, but they haven’t really been able to shine yet partly because they’ve been waiting for something—just the way that neural networks were waiting for compute and data to come along. I think in many ways some of these symbolic techniques have been waiting for neural networks to come along—because neural networks can kind of bridge that [gap] from the messiness of the signals coming in to this sort of symbolic regime where we can start to actually work. One of things we’re really excited about is building these systems that can bridge across that gap.

Listen to this one-hour episode or read the full transcript at www.VoicesinAI.com

[voices_in_ai_link_back]

Byron explores issues around artificial intelligence and conscious computers in his new book The Fourth Age: Smart Robots, Conscious Computers, and the Future of Humanity.

The Future of Software Innovation? Hardware-Enabled AI & ML Innovation

Hardware innovation is a fickle beast. It takes money, lots of money. It takes time, a great team, and execution in more than twenty separate domains, many of which are often overlooked until it’s too late (certifications anyone?!). But, I’ve got some good news. Hardware is back and it’s about to get really exciting.

You’re probably thinking right now, “All I ever read about is how AI, ML, blockchain, and XR are ready to revolutionize the world” and that’s exactly the point. There are some amazing software technologies coming out, but this “new” software has hardware in its DNA. Until recently, smart technologies have largely been limited by their access points: computers, tablets, smartphones, etc. Going forward, hardware innovations will become increasingly integral and valuable as the interface for tomorrow’s software. Hardware will capture the data through wearables, hearables, cameras and an increasing variety of sensors and will then be leveraged as the outputs to interact with the world as robots, drones, and the myriad of other IoT devices that are being developed.

As the ecosystem of devices, computation, connection, and data evolve, the platforms, tools and systems are naturally finding more synergy and lowering the barriers of integration. The line between what is a hardware or software product will continue to blur. The sensor technologies leading the way are cameras and microphones. If there is a camera, there’s a good chance there’s an AI stack behind it, with self-driving cars being the most prominent example. On the microphone/speaker side, the Smart Home assistants Amazon Echo, Google Home, and others are obvious and ubiquitous.

The beauty of hardware enabled AI/ML is that it not only crosses the boundary between the physical and virtual, but also between analog and digital. It’s particularly valuable when interacting with the world and dealing with its messy data. The next generation of AI hardware startups will take all that messy analog data and transform it into productive and executable knowledge that provide better experiences all the way from shopping to cancer treatments that enable personalized health care at scale.

While the future is clear, the hurdles are as well. Processing power, robustness, generalization and cost are all tradeoffs future hardware products will need to balance. Unlike the on-demand and scalable cloud and other services software enjoys, each hardware product will have onboard processing, sensors, connectivity tech, and other requirements that all make their way into the product cost. Sure a GPU can be thrown into the BOM, but can the market accept the price? High performance computing at the edge is still in its nascent stages so real-time processing of images and other data can be expensive as well.

At the same time, specific tools are being developed to improve this integration. We’re seeing a lot of edge computing such as NVIDIA’s Jetson line and Google’s Edge TPUs. TensorFlow is probably the most common AI framework, since it has such broad support for hardware deployment, including Raspberry Pi. ROS is still fairly popular despite being a jumble of mismatched and complicated software, and people have done ports to OpenAI’s Gym environments.

The future of hardware is bright and full of highly accessible, processed, happiness-inducing data.

Join 600 hardware innovators, entrepreneurs, disruptors and investors at HardwareCon 2019, the premier event for hardware innovation. Plan to attend April 17-18 at the Computer History Museum in Mountain View, California. Use promo code: GIGA-OM-IL for a special 20% discount on the ticket. Visit www.hardwarecon.com to redeem the discount.

Author Bio

Greg Fisher is all about hardware innovation. As founder/CEO of Berkeley Sourcing Group, Greg has spent the last 13 years working with over 1000 hardware startups to develop and manufacture innovative products. Living in China one third of that time, he worked with hardware startups and factories to help improve their designs for manufacturing, qualify and select factories, manage factory negotiations and relationships, and develop and implement quality control processes. With this history, Greg has a unique perspective and immense passion for what it takes for hardware startups to build the right foundation and scale their operations.

Seeing the need for more support for hardware startups to realize success, Greg started Hardware Massive, which is now the leading Global Community/Platform for Hardware Startup Innovation, and HardwareCon, the Bay Area’s premier hardware innovation conference. Their missions are to empower hardware startups to succeed through networking, events, education, and providing access to resources.

How IBM is Rethinking its Data Protection Line-Up

Following up on my take on the evolution of product and strategy of companies like Cohesity and NetApp, today I’d like to talk about IBM and its new data protection solution Spectrum Data Protect Plus.

IBM AND ITS DATA PROTECTION LINE-UP

In short, not too long ago IBM changed the names of all its storage products. I totally understand why they did it and it makes a lot of sense from a marketing point of view, but it is still confusing for people like myself that were familiar with the products before this change. Besides, with products now having similar names, it could be difficult to discern who does what.

In this particular case, data protection, you now have two products:

IBM Spectrum Data Protect: the good, old, TSM. While this product is one of those that have written Backup’s history and supports a myriad of Operating Systems and applications as well as backup, it is complex to operate and designed for large environments. Furthermore, it was designed well before the advent of hypervisors and modern applications, making it really tough to protect this environment efficiently.
IBM Data Protect Plus: a new product designed from the ground up for modern environments, including hypervisors, NoSQL DBs and more. It has a very modern snapshot-based design that pairs nicely with VMWARE CBT (change block tracking) for example. It’s easy to use and can be adopted by IT organizations of all sizes.

Videos from SFD18 can give you a good idea of the features and the potential of IBM Spectrum Data Protect Plus and there are a few aspects that I think are interesting to note:

IBM Spectrum Data Protect Plus might be a good companion for IBM TSM customers. Although the two don’t share anything, it is still an IBM product and from the procurement and budgetary standpoints, it could be much easier to adopt this solution instead of others.

Licensing is pretty flexible, making this product competitive from a cost standpoint on smaller infrastructures too. And this also makes it easier to place it in large infrastructures, aligning the cost with what is actually being protected.

Data Protect Plus is not at the level of features you can find on more mature products like Veeam, but the Data Protect Plus team is very committed and have a very aggressive release schedule.

This product has a good, scalable, architecture and the roadmap shows great potential for future releases, especially when it comes to sophisticated features around data reuse and management.

CLOSING THE CIRCLE

As I wrote above, Data Protect Plus might be a good option for IBM customers that already have TSM for their legacy infrastructure. What the IBM Spectrum Data Protect family is lacking the most for this type of customer at the moment, is a sort of unified GUI to allow SysAdmins to speed up operations and have better control of the backup infrastructure. But, as far as I can tell, it seems I’m not the first one to note this deficiency … the development team is already looking into it.

NetApp NDAS Integrate On-Premises & Cloud as One

At the end of February 2019, at Storage Field Day 18, NetApp presented another tool aimed at integrating its on-premises solutions with the cloud, NetApp Data Availability Service (NDAS). As already mentioned in a previous post, this tool might be somewhat immature, but it has huge potential if developed in the right way.

TWO WORDS ABOUT NDAS

Long story short, NDAS takes advantage of snap mirror functionalities available on NetApp Arrays and syncs volumes to the cloud. The cool part is that the content of the volumes is converted into objects (taking advantage of AWS S3). In the short term, it’s all about saving money because S3 is way cheaper than Elastic Block Storage (EBS), but the real deal comes from the fact that data stored in this format is much more re-usable for a major number of use cases, including index and search, ransomware protection, analytics and so on. Take a look at the videos recorded during their SDF18 session to have an idea of what I’m talking about.

NETAPP IS (ALSO) A CLOUD COMPANY

NetApp already had me a few years back when they presented their Data Fabric vision. But, as is the case for any other vision, it’s only good on paper until it gets executed properly. If I needed confirmation about how they are executing, NDAS, and especially the speech that Dave Hitz gave at the beginning of the session, were what sustained my excitement level about them. He gave clear examples of their cloud products, their approach with customers, of partnerships that are all-in with cloud, and of cloud-native end users — all while keeping an eye on traditional customers and how to support them in their journey to the cloud.

The risk they are taking is to cannibalize some of their on-premises sales or, better, their traditional product sales … but this is paying off, both in terms of mindshare as well as overall results. And if we look at company growth and financial results in the last three years, they seem nothing but positive.

CLOSING THE CIRCLE

Many storage vendors are now more cloud-focused than in the past. NetApp just started sooner and had the courage to disrupt its internal status quo, they listened to end users, hired people with a different mindset, designed new cloud-focused products and services; but they also opened their core products to better cloud integration. And this is paying off big time.

Usually, you can expect this kind of turnaround from smaller, younger, and nimbler companies but it’s always refreshing to see it happening to those of NetApp’s size.

Originally posted on Juku.it

Voices in AI – Episode 83: A Conversation with Margaret Mitchell

[voices_in_ai_byline]

About this Episode

Episode 83 of Voices in AI features host Byron Reese and Margaret Mitchell discussing the nature of language and it’s impact on machine learning and intelligence.

Listen to this one-hour episode or read the full transcript at www.VoicesinAI.com

Transcript Excerpt

Byron Reese: This is Voices in AI brought to you by GigaOm and I’m Byron Reese. Today my guest is Margaret Mitchell. She is a senior research scientist at Google doing amazing work. And she studied linguistics at Reed College and Computational Linguistics at the University of Washington. Welcome to the show!

Margaret Mitchell: Thank you. Thank you for having me.

I’m always intrigued by how people make their way to the AI world, because a lot of times what they study in University [is so varied]. I’ve seen neuroscientists, I see physicists, I see all kinds of backgrounds. [It’s] like all roads lead to Rome. What was the path that got you from linguistics to computational linguistics and to artificial intelligence?

So I followed a path similar to I think some other people who’ve had sort of linguistics training and then go into natural language processing which is sort of [the] applied field of AI, focusing specifically on processing and understanding text as well as generating. And so I had been kind of fascinated by noun phrases when I was an undergrad. So that’s things that refer to person, places, objects in the world and things like that.

I wanted to figure out: is there a way that I could like analyze things in the world and then generate a noun phrase? So I was kind of playing around with just this idea of ‘How could I generate noun phrases that are humanlike?’ And that was before I knew about natural language processing, that was before this new wave of AI interest. I was just kind of playing around with trying to do something that was humanlike, from my understanding of how language worked. Then I found myself having to code and stuff to get that to work—like mock up some basic examples of how that could work if you had a different knowledge about the kind of things that you’re trying to talk about.

And once I started doing that, I realized that I was doing essentially what’s called natural language generation. So generating phrases and things like that based on some input data or input knowledge base, something like that. And so once I started getting into the natural language generation world, it was a slippery slope to get into machine learning and then what we’re now calling artificial intelligence because those kinds of things end up being the methods that you use in order to process language.

So my question is: I always hear these things that say “computers have a x-ty-9% point whatever accuracy in transcription” and I fly a lot. My frequent flyer number of choice has an A, an H and an 8 in it.

Oh no.

And I would say it never gets it right.

Right.

And it’s only got 36 choices.

Right.

Why is it so awful?

Right. So that’s speech processing. And that has to do with a bunch of different things including just how well that the speech stream is being analyzed and the sort of frequencies that are picked up are going to be different depending on what kind of device you’re using. And a lot of times the higher frequencies are cut off. And so words that when [spoken] face to face or sounds that we hear face to face really easily are sort of muddled more when we’re using different kinds of devices. And so that ends up especially on things like telephones cutting off a lot of these higher frequencies that really help those distinctions. And then there’s like just general training issues, so depending on who you’ve trained on and what the data represents, you’re going to have different kinds of strengths and weaknesses.

Well I also find that in a way, our ability to process linguistics is ahead of our ability in many cases to do something with it. I can’t say the names out loud because I have two of these popular devices on my desk and they’ll answer me if I mentioned them, but they always understand what I’m saying. But the degree to which they get it right, like if I say “what’s bigger—a nickel or the sun?” They never get it. And yet they usually understand the sentence.

So I don’t really know where I’m going with that other than, do you feel like you could say your area of practice is one of the more mature, like hey, we’re doing our bit, the rest of you common sense people over there and you models of the world over there and you transfer learning people, y’all are falling behind, but the computational linguistics people—we have it all together?

I don’t think that’s true. And the things you’re mentioning aren’t actually mutually exclusive either, so in natural language processing you often use common sense databases or you’re actually helping to do information extraction in order to fill out those databases. And you can also use transfer learning as a general technique that is pretty powerful in deep learning models right now.

Deep learning models are used in natural language processing as well as image processing as well as a ton of other stuff.

So… everything you’re mentioning is relevant to this task of saying something and having your device on your desktop understand what you’re talking about. And that whole process isn’t just simply recognizing the words, but it’s taking those words and then mapping them to some sort of user intent and then being able to act on that intent. That whole pipeline, that whole process involves a ton of different models and requires being able to make queries about the world and extract information based on… usually it’s going to be the content words of the phrase: so nouns, verbs things that are conveying the main sort of ideas in your utterance and using those in order to find relevant information to that.

So the Turing test… if I can’t tell if I’m talking to a person or a machine, you got to say the machine is doing a pretty good job. It’s thinking according to Turing. Do you think passing the Turing test would actually be a watershed event? Or do you think that’s more like marketing and hype, and it’s not the kind of thing you even care about one way or the other?

Right. So the Turing Test as was originally construed has this basic notion that the person who is judging can’t tell whether or not it’s human-generated or machine-generated. And there’s lots of ways to do that. That’s not exactly what we mean by human level performance. So, for example, you could trivially pass the Turing test if you were pretending to be a machine that doesn’t understand English well, right? So you could say, “Oh this is a this is a person behind this, they’re just learning English for the first time—they might get some things mixed up.”

Listen to this one-hour episode or read the full transcript at www.VoicesinAI.com

[voices_in_ai_link_back]

Byron explores issues around artificial intelligence and conscious computers in his new book The Fourth Age: Smart Robots, Conscious Computers, and the Future of Humanity.

Voices in AI – Episode 82: A Conversation with Max Welling

Today’s leading minds talk AI with host Byron Reese

.voice-in-ai-byline-embed {
font-size: 1.4rem;
background: url(https://voicesinai.com/wp-content/uploads/2017/06/cropped-voices-background.jpg) black;
background-position: center;
background-size: cover;
color: white;
padding: 1rem 1.5rem;
font-weight: 200;
text-transform: uppercase;
margin-bottom: 1.5rem;
}

.voice-in-ai-byline-embed span {
color: #FF6B00;
}

About this Episode

Episode 82 of Voices in AI features host Byron Reese and Max Welling discussing the nature of intelligence and its relationship with intuition, evolution, and need.

Listen to this one-hour episode or read the full transcript at www.VoicesinAI.com

Transcript Excerpt

Byron Reese: This is Voices in AI brought to you by GigaOm, and I’m Byron Reese. Today my guest is Max Welling. He is the Vice President, Technologies at Qualcomm. He holds a Ph.D. in theoretical physics from Utrecht University and he’s done postdoc work at Caltech, University of Toronto and other places as well. Welcome to the show Max!

Max Welling: Thank you very much.

I always like to start with the question [on] first principles, which is: What is intelligence and why is artificial intelligence artificial? Is it not really intelligent? Or is it? I’ll start with that. What is intelligence and why is AI artificial?

Okay. So if intelligence is not something that’s easily defined in a single sentence. I think there is a whole broad spectrum of possible intelligence, and in fact in artificial systems we are starting to see very different kinds of intelligence. For instance you can think of a search engine as being intelligent in some way, but it’s a very different kind of intelligence obviously as a human being, right?

So there’s human intelligence and I guess that’s the ability to plan ahead and to analyze the world, to organize information—these kinds of things. But artificial intelligence is artificial because it’s sort of in machines not in human brains. That’s the only reason why we call it ‘artificial.’ I don’t think there is any reason why artificial intelligence couldn’t be the same or very similar to human intelligence. I just think that that’s a very restricted set of intelligence. And we could imagine having a whole broad spectrum of intelligence in machines.

I’m with you [on] all of that, but maybe because human intelligence is organizing information, it’s planning ahead, machines are doing something different like search engines and all that. Maybe I should ask the question: What isn’t intelligence? I mean at some point, doesn’t it lose all its meaning if it’s like it’s kind of… a lot of stuff? I mean like what are we really talking about when we when we come to intelligence? Are we talking about problem solving? Are we talking about adaptation or what? Or is that so meaningless that it has no definition?

Well yeah, it depends on how broad you want to define it. I think it’s not a very well defined term per se. I mean you could ask yourself whether a fish is intelligent. And I think a fish to some degree is intelligent because you know it has a brain, it processes information, it adapts perhaps a little bit to the environment. So even a fish is intelligent, but clearly it’s a lot less intelligent than a human.

So anything I would say that has the purpose of sensing—sort of acquiring information from its environment, computing from that information to its own benefit. In other words, to survive better is the ultimate goal or to reproduce maybe is the penultimate goal. And so basically, once you’ve taken any information and you compute then you can act—use that information. You can then act on the world in order to bring the world in a state that’s more beneficial for you, right? So that you can survive better, reproduce better. So anything that processes information, I would say in order to reach a goal, in order to achieve a particular goal which in evolution is reproducing or surviving.

But… in artificial systems it could be something very different. In an artificial system, you could still sense information, you could still compute and process information in order to satisfy your customers—which is like providing them with better search results or something like that. So that’s a different goal, but the same phenomenon is underlying it, which is processing information to reach that goal.

Now, and you mentioned adaptation and learning, so I think those are things that are super important parts of being intelligent. So a system that can adapt and learn from its environment and from experiences is a system that can keep improving itself and therefore become more intelligent or better at its task, or adapt when the environment is changing.

So these are really important parts of being intelligent, but not necessary because you could imagine a self-driving car as being completely pre-programmed. It doesn’t adapt, but it still behaves intelligently in the sense that it knows when things are happening, it knows when to overtake other cars, it knows how to avoid collisions, etcetera.

So in short, I think intelligence is actually a very broad spectrum of things. It’s not super well-defined, and of course you can define more narrow things like a human intelligence for instance, or fish intelligence and/or search engine intelligence or something like that, and then it would mean something slightly different.

How far down in simplicity would you extend that? So if you have a pet cat and you have a food bowl that refills itself when it gets empty…it’s got a weight sensor, and when the weight sensor shows nothing in there, it opens something up and then fills it. It has a goal which is: keep the cat happy. Is that a primitive kind of artificial intelligence?

It would be a very, very primitive kind of artificial intelligence. Yes.

Fair enough. And then going back centuries before that, I read the first vending machines, the first coin operated machines were to dispense holy water and you would drop a coin in a slot and the weight of the coin would weigh down a thing that would open a valve, then dispense some water and then, as the water was dispensed, the coin would fall out and it would close off again. Is that a really, really primitive artificial intelligence?

Yeah. I don’t know. I mean you can drive these things to an extreme with many of these definitions. Clearly this is some kind of mechanism. I guess when there is sensing and this can sense, there is a bit of sensing because it’s sensing the weight of a coin and then it has a response to that—which is opening something. It’s like a response and sort of completely automatic response, and humans actually have many of these reflexes. If you hit your knee with a hammer, with a paddle of a hammer like the doctor does, your knee jerks up, so that’s actually being done through a nervous system that goes to… doesn’t even reach your brain. I think it’s down here somewhere in your brain in the back of your spine. So it’s very, very, very primitive, but still you could argue it senses something and it acts. It does something, it computes something and it acts. So it’s like the very, very most fundamental simple form of intelligence. Yeah.

So the technique we’re using to make a lot of advances in artificial intelligence, now that computers is machine learning, I guess it’s really a simple idea. Let’s study data about the past. Let’s look for patterns and make projections into the future. How powerful is that technique… what did you think are the inherent limits of that particular way of gaining knowledge and building intelligence?

Well, I think it’s kind of interesting if you look at the history of AI. So in the old days, there was a lot of AI which was hard coding rules. So you would think about what are the all the eventualities which you could encounter. And for each one of those, you would sort of program a response as an automatic response to those. And those systems did not necessarily look at data in large amounts from which they would learn patterns and learn to respond.

In other words, it was all up to humans to figure out what are the relevant things to look at, to sense, and how to respond to them and if you make enough of those, actually a system like that looks like it’s behaving quite intelligently and actually still I think nowadays, self-driving cars… a large component of these cars is made of lots and lots of these rules which are hardcoded in the system. And so if you have many, many of these really primitive pieces of intelligence together, they might look like they act quite intelligently.

Now there is a new paradigm which is: it’s always been there, but it’s been basically becoming the dominant mainstream in AI. The new paradigm I would say, which is: ‘Well, why are we actually trying to hand code all of these things which we should sense in there by hand because basically you can only do this to the level of what the human imagination actually is able to come up with, right?”

So if you think about detecting some… let’s say if somebody is suffering from Alzheimer’s from a brain MRI, well you can look at like the size of your hippocampus and it’s known that that thing shrinks—that organ shrinks if you are starting to suffer from memory issues which are correlated with Alzheimer’s. So that a human can think about that and put this in as a rule, but it turns out that there’s many, many more far more subtle patterns in that MRI scan. And if you sum all of those up, then actually you can get a much better prediction.

But humans, they wouldn’t be able to even see those subtle patterns because it’s like if this brain region and this brain region and this brain region, but not that brain region, would sort of have this particular pattern. Then you know this is a little bit of evidence in favor of like Alzheimer’s and then hundreds and hundreds of those things. So that humans lack the imagination or the sort of the capacity to come up with all of these rules. And we basically discovered that just provide a large data set and let the machine itself figure out what these rules are instead of trying to hand code them in. And this is the big change for instance with deep learning as well, [as] in computer vision and speech recognition.

Let’s first do computer vision. People have many hand coded features that they would try to identify on the image. Right. And then from there they would make predictions or for there’s some whether there was a person in the image or something like that. But then we basically said, “Well let’s just throw all the pixels, all the raw pixels at a neural nets. This is a convolution of neural net and let the neural nets figure out what are the right features. Let this neural net learn what the right features are to attend to when it needs to do a certain task.” And so it works a lot better, again because there’s many very subtle patterns that it now learns to look at which humans simply didn’t think of to look at—they seem to look at these things.

Now another example is the Alpha Go, maybe. In Alpha Go something similar happened. Humans have analyzed this game and come up with all sorts of rules of thumb for how to play the game. But then Alpha Go figured out things that humans can’t comprehend, it’s too complex. But still it made the algorithm win the game.

So I would say it’s a new paradigm that goes well beyond trying to hand code human invented features into a system and therefore it’s a lot more powerful. And in fact this is also the way of course humans work. And I don’t see a real limit to this, right? So if you pump more data through it, in principle you can learn a lot of things—or well basically everything you need to learn in order to become intelligent.

Listen to this one-hour episode or read the full transcript at www.VoicesinAI.com

Voices in AI

Visit VoicesInAI.com to access the podcast, or subscribe now:

.voice-in-ai-link-back-embed {
font-size: 1.4rem;
background: url(https://voicesinai.com/wp-content/uploads/2017/06/cropped-voices-background.jpg) black;
background-position: center;
background-size: cover;
color: white;
padding: 1rem 1.5rem;
font-weight: 200;
text-transform: uppercase;
margin-bottom: 1.5rem;
}

.voice-in-ai-link-back-embed:last-of-type {
margin-bottom: 0;
}

.voice-in-ai-link-back-embed .logo {
margin-top: .25rem;
display: block;
background: url(https://voicesinai.com/wp-content/uploads/2017/06/voices-in-ai-logo-light-768×264.png) center left no-repeat;
background-size: contain;
width: 100%;
padding-bottom: 30%;
text-indent: -9999rem;
margin-bottom: 1.5rem
}

@media (min-width: 960px) {
.voice-in-ai-link-back-embed .logo {
width: 262px;
height: 90px;
float: left;
margin-right: 1.5rem;
margin-bottom: 0;
padding-bottom: 0;
}
}

.voice-in-ai-link-back-embed a:link,
.voice-in-ai-link-back-embed a:visited {
color: #FF6B00;
}

.voice-in-ai-link-back a:hover {
color: #ff4f00;
}

.voice-in-ai-link-back-embed ul.go-alexa-briefing-subscribe-links {
margin-left: 0 !important;
margin-right: 0 !important;
margin-bottom: 0.25rem;
}

.voice-in-ai-link-back-embed ul.go-alexa-briefing-subscribe-links a:link,
.voice-in-ai-link-back-embed ul.go-alexa-briefing-subscribe-links a:visited {
background-color: rgba(255, 255, 255, 0.77);
}

.voice-in-ai-link-back-embed ul.go-alexa-briefing-subscribe-links a:hover {
background-color: rgba(255, 255, 255, 0.63);
}

.voice-in-ai-link-back-embed ul.go-alexa-briefing-subscribe-links .stitcher .stitcher-logo {
display: inline;
width: auto;
fill: currentColor;
height: 1em;
margin-bottom: -.15em;
}

Byron explores issues around artificial intelligence and conscious computers in his new book The Fourth Age: Smart Robots, Conscious Computers, and the Future of Humanity.

DEMOCRATIZING DATA MANAGEMENT

Lately, I’ve written a lot about data-management for unstructured data and more in general, about the relationship between data management and secondary storage. While recently attending Storage Field Day 18, I received confirmation on what is coming in the near future for date management technologies. Simplification and democratization of data management will be key for end-user adoption and success.

DATA MANAGEMENT, A SORT OF BUZZ WORD

Unfortunately, the term ‘data management’ is becoming a buzz word among vendors, especially when it comes to data protection vendors. Some backup vendors are replacing Data Protection with the term Data Management when describing their services in marketing material. Although it’s becoming their main message, when you ask them to elaborate on the data management aspects they are challenged to articulate how their services align with the terminology.

Yes, there are exceptions. Some vendors have very clear roadmaps. But, as it often happens in this industry, it seems many of vendors are counting their chickens before they’re hatched.

WHAT DOES “DEMOCRATIZING DATA MANAGEMENT” MEAN?

Despite the vendors still refining their services and messaging, I saw a few exceptions and providers with very clear roadmaps at SFD18.

The exception is Cohesity. I have long been confident they’re heading in the right direction with their strategy and product, and I have noted the promise of the Cohesity Analytics Workbench; a tool that has a great, but only theoretical, potential. In a new step forward announced at SFD18, Cohesity’s system can now run full-fledged applications, and the analytics workbench will soon become a thing of the past.

As I said, the Analytics Workbench was a great idea but the name of this tool tells the real story. Workbench means a lot of work and this is why, even if it’s exciting, it can’t be broadly adopted. It is powerful, based on Hadoop, but you need to know how to use it and how to write applications. I’m sure it has been used, but the reality is that for the traditional enterprise this is only cool on paper. And finding somebody that can write and maintain a big data application is not at all easy! Especially if there is no direct business return from it.

Standard apps that run on Cohesity’s platform are a totally different thing. Easy to use, deployed and managed transparently by the platform itself and, above all, ready to prove their value in a matter of minutes. The app catalog, or marketplace, doesn’t have many solutions yet, but some of them come from Cohesity partners like Splunk or Imanis Data for example.

Without going into the technical details (videos of the sessions are available also on youtube), let’s say that Cohesity demonstrated how quickly an app can be deployed and used, and all without being a data scientist or a developer. You just take a snapshot, run the app against a copy of your data, get the result, act accordingly. And it can be automated! Think about virus scanning, ransomware protection, log analytics, or advanced DB management (and I’m not using my imagination here because these are already available!).

Now the challenge for Cohesity is to involve more partners and to build a solid ecosystem. I recommended they release all the components to the open source community and try to standardize these apps for all vendors, making the catalog really big… but I know this is pure wishful thinking at the moment.

The idea of giving the average user this kind of power is amazing and NetApp is on the same wavelength, showing a pretty exciting potential roadmap. At SFD18 they presented an interesting solution which allows using standard replication tools (SnapMirror, for those familiar with NetApp’s portfolio) to make copies of data on the cloud. The product is still very immature but the potential is huge. In fact, alongside the standard use cases for remote replication, without needing a second NetApp appliance in this case, there are other possibilities ready to be exploited and leveraged to augment the value of data stored in these systems (and on the cloud).

In short, if you don’t want to watch the video embedded above, they provide a management tool (a GUI to simplify a bit) that runs on your Amazon account and can use SnapMirror to copy data directly to Amazon AWS and convert it in objects stored in an S3 bucket. Files and metadata are accessible and searchable, but this is only the first step. During the session, they demonstrate a custom application that can access a copy of that data and do operations on it and any enabled user on that platform could do the same. More or less we are talking about a standard S3 bucket, available on Amazon, that you can use as a data set for any application. Unfortunately, as was for Cohesity with its Analytics Workbench, only pre-packaged and easy-to-use applications will unleash the full potential of this solution when it comes to day-to-day data management.

WHAT IS THE BENEFIT?

Making data easily accessible and re-usable by a large number of individuals in your organization is the real benefit here. They could run different applications, each one of them for different reasons, and get insights needed to improve productivity, security, privacy and so on.

At the end of the day, we’re talking about a sort of revolution here. We are not there yet of course, but we are finally seeing how data can be effectively reused without having a Ph.D. in computer sciences, being proficient in MapReduce and Java, or any other programming language!

Yes, as I said, we’re not there yet, but neither are we very far from achieving this goal now, and I’m sure that Cohesity and Netapp’s initiatives will soon be followed by others.

CLOSING THE CIRCLE

Cohesity and NetApp are executing their respective strategies superbly. On one side, you have NetApp becoming more and more cloud-ish and more data- than storage-centric while, on the other you have Cohesity pushing aggressively on its secondary storage vision with products and solutions that are absolutely spot on.

In the next weeks, I’ll be spending some time analyzing what went on during their sessions as well as other moments I spent with them recently. I’ll be sharing my thoughts with you… so stay tuned!

Originally posted on Juku.it