Software Engineering Daily

Software Engineering Daily

softwareengineeringdaily.com/archives
Technical interviews about software topics.


Security Monitoring with Marc Tremsal
Jul 31 • 51 min
Logs are the source of truth. If a company is sufficiently instrumented, the logging data that streams off of the internal infrastructure can be refined to tell a comprehensive story for what is changing across that infrastructure in real time. This…
DEV and Forem with Ben Halpern
Jul 30 • 59 min
Dev.to has become one of the most popular places for developers to write about engineering, programming languages, and everyday life. For those who have not seen it, DEV is like a cross between Twitter and Medium, but targeted at developers. The content…
Drug Simulations with Bryan Vicknair and Jason Walsh
Jul 29 • 58 min
Drug trials can lead to new therapeutics and preventative medications being discovered and placed on the market. Unfortunately, these drug trials typically require animal testing. This means animals are killed or harmed as a result of needing to verify…
Access Control Management with Fouad Matin and Dan Gillespie
Jul 28 • 54 min
Across a company, there is a wide range of resources that employees need access to. Documents, S3 buckets, git repositories, and many others. As access to resources changes across the organization, a history of the changes to permissions can be useful for…
Acquired Podcasting with David Rosenthal and Ben Gilbert
Jul 27 • 57 min
Acquisitions are part of the technology industry. A successful corporation will often have an “exit”, either going public or becoming acquired. And with each of these corporations, there is a set of stories that narrate the company from beginning to end.…
Ray Applications with Richard Liaw
Jul 24 • 54 min
Ray is a general purpose distributed computing framework. At a low level, Ray provides fault-tolerant primitives that support applications running across multiple processors. At a higher level, Ray supports scalable reinforcement learning, including the…
Modin: Pandas Scalability with Devin Petersohn
Jul 23 • 58 min
Pandas is a Python data analysis library, and an essential tool in data science. Pandas allows users to load large quantities of data into a data structure called a dataframe, over which the user can call mathematical operations. When the data fits…
Sourcegraph: Code Search and Intelligence with Beyang Liu
Jul 22 • 59 min
A large codebase cannot be searched with naive indexing algorithms. In order to search through a codebase the size of Uber’s it is necessary to build a much more sophisticated indexing system than simple pure text search. Sourcegraph is a system for…
Digital Experience Analytics with Michael Morrissey
Jul 21 • 57 min
Users do not use web applications in the way that you might expect. And it is not easy to get the data that is necessary to get a full picture. But a newer API within browsers does make this more possible by capturing DOM mutations. The change capture of…
Cortex: Microservices Management with Anish Dhar and Ganesh Datta
Jul 20 • 60 min
Managing microservices becomes a challenge as the number of services within the organization grows. With that many services comes more interdependencies–downstream and upstream services that may be impacted by an update to your service. One solution to…
ADP Engineering with Tim Halbur
Jul 17 • 55 min
ADP has been around for more than 70 years, fulfilling payroll and other human resources services. Payroll processing is a complex business, involving the movement of money in accordance with regulatory and legal strictures. From an engineering point of…
Capital Allocation with Blair Silverberg and Chris Olivares
Jul 16 • 58 min
Software companies can be funded in a variety of ways: venture capital, self-funding, and debt, among others. In order to receive financing, a company is evaluated on its ability to generate future cash flows. After all, a valuation is a number that…
GitHub Mobile with Brian Lovin and Ryan Nystrom
Jul 15 • 51 min
GitHub has been a social network for developers for many years. Most social networks are centered around mobile applications, but GitHub sits squarely in a developer’s browser-based desktop workflow. As a result, the design of a mobile app for GitHub is…
Multimesh with Luke Kysow
Jul 14 • 53 min
A service mesh provides routing, load balancing, policy management, and other features to a set of services that need to communicate with each other. The mesh can simplify operations across these different services by providing an interface to configure…
Metaflow: Netflix Machine Learning Platform with Savin Goyal
Jul 13 • 56 min
Netflix runs all of its infrastructure on Amazon Web Services. This includes business logic, data infrastructure, and machine learning. By tightly coupling itself to AWS, Netflix has been able to move faster and have strong defaults about engineering…
Strapi: Headless CMS with Pierre Burgy
Jul 10 • 49 min
WordPress has been a dominant force in the world of online publishing for many years because of how battle-tested it is. WordPress is the definitive leader in CMS technology. But there have always been alternatives. Drupal, Ghost, and other open source…
Chronosphere: Scalable Metrics Database with Rob Skillington
Jul 9 • 41 min
M3 is a scalable metrics database originally built to host Uber’s rapidly growing data storage from Prometheus. When Rob Skillington was at Uber, he helped design, implement, and deploy M3. Since leaving Uber, he has co-founded a company around a hosted…
Determined AI: Machine Learning Ops with Neil Conway
Jul 8 • 48 min
Developing machine learning models is not easy. From the perspective of the machine learning researcher, there is the iterative process of tuning hyperparameters and selecting relevant features. From the perspective of the operations engineer, there is a…
The Good Parts of AWS with Daniel Vassallo
Jul 7 • 64 min
AWS has over 150 different services. Databases, log management, edge computing, and lots of others. Instead of being overwhelmed by all of these products, an engineering team can simplify their workflow by focusing on a small subset of AWS services–the…
Pull Request Environments with Eric Silverman
Jul 6 • 47 min
The modern release workflow involves multiple stakeholders: engineers, management, designers, and product managers. It is a collaborative process that is often held together with brittle workflows. A developer deploys a new build to an ad hoc staging…
Deepgram: End-to-End Speech Recognition with Scott Stephenson
Jul 3 • 53 min
Deepgram is an end-to-end deep learning platform for speech recognition. Unlike the general purpose APIs from Google or Amazon, Deepgram models are custom-trained for each customer. Whether the customer is a call center, a podcasting company, or a sales…
DynamoDB with Alex DeBrie
Jul 2 • 61 min
DynamoDB is a managed NoSQL database service from AWS. It is widely used as a transactional database to fulfill key-value and wide-column data models. In a previous show with Rick Houlihan, we explored how to build a data model and optimize the query…
Snowplow Analytics: Data Collection Platform with Alex Dean
Jul 1 • 57 min
As a user browses a webpage, that browser session generates events that need to be recorded, validated, enriched, and stored. This data is sometimes called customer data infrastructure, or CDI. This data requires a full stack of different tools: a system…
Postman: API Development with Abhinav Asthana
Jun 30 • 55 min
A software company manages and interacts with hundreds of APIs. These APIs require testing, performance analysis, authorization management, and release management. In a word, APIs require collaboration. Postman is a system for API collaboration. It allows…
Cresta: Speech ML for Calls with Zayd Enam
Jun 29 • 59 min
At a customer service center, thousands of hours of audio are generated. This audio provides a wealth of information to transcribe and analyze. With the additional data of the most successful customer service representatives, machine learning models can…
React Native Ecosystem with Nader Dabit (Summer Break Repeat)
Jun 26 • 51 min
Originally published July 6, 2017. We are taking a few weeks off. We’ll be back soon with new episodes. React Native allows developers to reuse components from one user interface on multiple platforms. React Native was introduced by Facebook to reduce the…
Traces: Video Recognition with Veronica Yurchuk and Kostyantyn Shysh (Summer Break Repeat)
Jun 25 • 60 min
Originally published October 8, 2019. We are taking a few weeks off. We’ll be back soon with new episodes. Video surveillance impacts human lives every day. On most days, we do not feel the impact of video surveillance. But the effects of video…
Envoy Mobile with Matt Klein (Summer Break Repeat)
Jun 24 • 54 min
Originally published July 25, 2019. We are taking a few weeks off. We’ll be back soon with new episodes. Envoy is an open source edge and service proxy that was originally developed at Lyft. Envoy is often deployed as a sidecar application that runs…
Data Intensive Applications with Martin Kleppman (Summer Break Repeat)
Jun 23 • 64 min
Originally published May 2, 2017. We are taking a few weeks off. We’ll be back soon with new episodes. A new programmer learns to build applications using data structures like a queue, a cache, or a database. Modern cloud applications are built using more…
freeCodeCamp with Quincy Larson (Summer Break Repeat)
Jun 22 • 133 min
Originally published December 20, 2019. We are taking a few weeks off. We’ll be back soon with new episodes. freeCodeCamp was started five years ago with the goal of providing free coding education to anyone on the Internet. freeCodeCamp has become the…
Facebook Open Source with Tom Occhino (Summer Break Repeat)
Jun 19 • 62 min
Originally published April 14, 2017. We are taking a few weeks off. We’ll be back soon with new episodes. Facebook’s open source projects include React, GraphQL, and Cassandra. These projects are key pieces of infrastructure used by thousands of…
Redis with Alvin Richards (Summer Break Repeat)
Jun 18 • 53 min
Originally published October 24, 2019. We are taking a few weeks off. We’ll be back soon with new episodes. Redis is an in-memory database that persists to disk. Redis is commonly used as an object cache for web applications. Applications are composed of…
HTTP with Julia Evans (Summer Break Repeat)
Jun 17 • 59 min
Originally published November 21, 2019. We are taking a few weeks off. We’ll be back soon with new episodes. HTTP is a protocol that allows browsers and web applications to communicate across the Internet. Everyone knows that HTTP is doing some important…
Stripe Machine Learning Infrastructure with Rob Story and Kelley Rivoire (Summer Break Repeat)
Jun 16 • 63 min
Originally published June 13, 2019. We are taking a few weeks off. We’ll be back soon with new episodes. Machine learning allows software to improve as that software consumes more data. Machine learning is a tool that every software engineer wants to be…
Architects of Intelligence with Martin Ford (Summer Break Repeat)
Jun 15 • 64 min
Originally published January 31, 2019. We are taking a few weeks off. We’ll be back soon with new episodes. Artificial intelligence is reshaping every aspect of our lives, from transportation to agriculture to dating. Someday, we may even create a…
Cruise Simulation with Tom Boyd
Jun 12 • 58 min
Cruise is an autonomous car company with a development cycle that is highly dependent on testing its cars–both in the wild and in simulation. The testing cycle typically requires cars to drive around gathering data, and that data to subsequently be…
Grafana with Torkel Ödegaard
Jun 11 • 51 min
Grafana is an open source visualization and monitoring tool that is used for creating dashboards and charting time series data. Grafana is used by thousands of companies to monitor their infrastructure. It is a popular component in monitoring stacks, and…
Apache Airflow with Maxime Beauchemin, Vikram Koka, and Ash Berlin-Taylor
Jun 10 • 64 min
Apache Airflow was released in 2015, introducing the first popular open source solution to data pipeline orchestration. Since that time, Airflow has been widely adopted for dependency-based data workflows. A developer might orchestrate a pipeline with…
Human in the Loop Data Analytics with Aditya Parameswaran
Jun 9 • 48 min
The life cycle of data management includes data cleaning, extraction, integration, analysis and exploration, and machine learning models. It would be great if all of this data management could be handled with automation, but unfortunately that is not an…
Tilt: Kubernetes Tooling with Dan Bentley
Jun 8 • 52 min
Kubernetes continues to mature as a platform for infrastructure management. At this point, many companies have well-developed workflows and deployment patterns for working with applications built on Kubernetes. The complexity of some of these deployments…
Uber’s Data Visualization Tools with Ib Green
Jun 5 • 48 min
Uber needs to visualize data on a range of different surfaces. A smartphone user sees cars moving around on a map as they wait for their ride to arrive. Data scientists and operations researchers within Uber study the renderings of traffic moving…
Prisma: Modern Database Tooling with Johannes Schickling
Jun 4 • 50 min
A frontend developer issuing a query to a backend server typically requires the developer to issue that query through an ORM or a raw database query. Prisma is an alternative to both of these data access patterns, allowing for easier database access…
Tecton: Machine Learning Platform from Uber with Kevin Stumpf
Jun 3 • 55 min
Machine learning workflows have had a problem for a long time: taking a model from the prototyping step and putting it into production is not an easy task. A data scientist who is developing a model is often working with different tools, or a smaller data…
HoloClean: Data Quality Management with Theodoros Rekatsinas
Jun 2 • 60 min
Many data sources produce new data points at a very high rate. With so much data, the issue of data quality emerges. Low quality data can degrade the accuracy of machine learning models that are built around those data sources. Ideally, we would have…
Disaggregated Servers with Yiying Zhang
Jun 1 • 57 min
Server infrastructure traditionally consists of monolithic servers containing all of the necessary hardware to run a computer. These different hardware components are located next to each other, and do not need to communicate over a network boundary to…
Kubernetes vs. Serverless with Matt Ward
May 29 • 50 min
Kubernetes has become a highly usable platform for deploying and managing distributed systems. The user experience for Kubernetes is great, but is still not as simple as a full-on serverless implementation–at least, that has been a long-held assumption.…
Distributed Systems Research with Peter Alvaro
May 28 • 54 min
Every software company is a distributed system, and distributed systems fail in unexpected ways. This ever-present tendency for systems to fail has led to the rise of failure testing, otherwise known as chaos engineering. Chaos engineering involves the…
Brex Engineering with Cosmin Nicolaescu
May 27 • 52 min
Brex is a credit card company that provides credit to startups, mostly companies which have raised money. Brex processes millions of transactions, and uses the data from those transactions to assess creditworthiness, prevent fraud, and surface insights…
Edge Machine Learning with Zach Shelby
May 26 • 63 min
Devices on the edge are becoming more useful with improvements in the machine learning ecosystem. TensorFlow Lite allows machine learning models to run on microcontrollers and other devices with only kilobytes of memory. Microcontrollers are very…
Software Daily
May 23 • 8 min
For the last five months, we have been working on a new version of Software Daily, the platform we built to host and present our content. We are creating a platform that integrates the podcast with a set of other features that make it easier to learn from…
RedwoodJS with Tom Preston-Werner
May 22 • 64 min
Over the last 5 years, web development has matured considerably. React has become a standard for frontend component development. GraphQL has seen massive growth in adoption as a data fetching middleware layer. The hosting platforms have expanded beyond…
ArcGIS: Geographic Information Software with Max Payson
May 21 • 56 min
Geospatial analytics tools are used to render visualizations for a vast array of applications. Data sources such as satellites and cellular data can gather location data, and that data can be superimposed over a map. A map-based visualization can allow…
RudderStack: Open Source Customer Data Infrastructure with Soumyadeb Mitra
May 20 • 47 min
Customer data infrastructure is a type of tool for saving analytics and information about your customers. The company that is best known in this category is Segment, a very popular API company. This customer data is used for making all kinds of decisions…
Matterport 3-D Imaging with Japjit Tulsi
May 19 • 49 min
Matterport is a company that builds 3-D imaging for the inside of buildings, construction sites, and other locations that require a “digital twin.” Generating digital images of the insides of buildings has a broad spectrum of applications, and there are…
Frontend Performance with Anycart’s Rafael Sanches
May 18 • 51 min
There are many bad recipe web sites. Every time I navigate to a recipe website, it feels like my browser is filling up with spyware. The page loads slowly, everything seems broken, I can feel the 25 different JavaScript adtech tags interrupting each…
AWS Virtualization with Anthony Liguori
May 15 • 59 min
Amazon’s virtual server instances have come a long way since the early days of EC2. There are now a wide variety of available configuration options for spinning up an EC2 instance, which can be chosen from based on the workload that will be scheduled onto…
International Consumer Credit Infrastructure with Brian Regan and Misha Esipov
May 14 • 45 min
A credit score is a rating that allows someone to qualify for a line of credit, which could be a loan such as a mortgage, or a credit card. We are assigned a credit score based on a credit history, which could be related to work history, rental payments,…
Grapl: Graph-Based Detection and Response with Colin O’Brien
May 13 • 62 min
A large software company such as Dropbox is at a constant risk of security breaches. These security breaches can take the form of social engineering attacks, network breaches, and other malicious adversarial behavior. This behavior can be surfaced by…
Static Analysis for Infrastructure with Guy Eisenkot
May 12 • 60 min
Infrastructure-as-code tools are used to define the architecture of software systems. Common infrastructure-as-code tools include Terraform and AWS CloudFormation. When infrastructure is defined as code, we can use static analysis tools to analyze that…
Social Distancing Data with Ryan Fox Squire
May 11 • 50 min
Social distancing has been imposed across the United States. We are running an experiment unlike anything before it in history, and it is likely to have a lasting impact on human behavior. By looking at location data of how people are moving around today,…
Dropbox Engineering with Andrew Fong
May 8 • 54 min
Dropbox is a consumer storage product with petabytes of data. Dropbox was originally started on the cloud, backed by S3. Once there was a high enough volume of data, Dropbox created its own data centers, designing hardware for the express purpose of…
Pravega: Storage for Streams with Flavio Junqueira
May 7 • 53 min
“Data stream” is a word that can be used in multiple ways. A stream can refer to data in motion or data at rest. When a stream is data in motion, an endpoint is receiving new pieces of data on a continual basis. Each new data point is sent over the wire…
Advanced Redis with Alvin Richards
May 6 • 53 min
Redis is an in-memory object storage system that is commonly used as a cache for web applications. This core primitive of in-memory object storage has created a larger ecosystem encompassing a broad set of tools. Redis is also used for creating objects…
Multicloud MySQL with Jiten Vaidya and Anthony Yeh
May 5 • 52 min
For many applications, a transactional MySQL database is the source of truth. To make a MySQL database scale, some developers deploy their database using Vitess, a sharding system built on top of Kubernetes. Jiten Vaidya and Anthony Yeh work at…
Isolation with Courtland Allen and Anurag Goel
May 4 • 57 min
We are all living in social isolation due to the quarantine from COVID-19. Isolation is changing our habits and our moods, ravaging the economy, and changing how we work. One positive change is that more people have been reconnecting with their friends…
Data Lakehouse with Michael Armbrust
May 1 • 59 min
A data warehouse is a system for performing fast queries on large amounts of data. A data lake is a system for storing high volumes of data in a format that is slow to access. A typical workflow for a data engineer is to pull data sets from this slow data…
JAMStack Content Management with Scott Gallant, Jordan Patterson, and Nolan Phillips
Apr 30 • 55 min
A content management system (CMS) defines how the content on a website is arranged and presented. The most widely used CMS is WordPress, the open source tool that is written in PHP. A large percentage of the web consists of WordPress sites, and WordPress…
Prefect Dataflow Scheduler with Jeremiah Lowin
Apr 29 • 64 min
A data workflow scheduler is a tool used for connecting multiple systems together in order to build pipelines for processing data. A data pipeline might include a Hadoop task for ETL, a Spark task for stream processing, and a TensorFlow task to train a…
CockroachDB with Peter Mattis
Apr 28 • 56 min
A relational database often holds critical operational data for a company, including user names and financial information. Since this data is so important, a relational database must be architected to avoid data loss. Relational databases need to be a…
Dask: Scalable Python with Matthew Rocklin
Apr 27 • 61 min
Python is the most widely used language for data science, and there are several libraries that are commonly used by Python data scientists including Numpy, Pandas, and scikit-learn. These libraries improve the user experience of a Python data scientist by…
Rasa: Conversational AI with Tom Bocklisch
Apr 24 • 59 min
Chatbots became widely popular around 2016 with the growth of chat platforms like Slack and voice interfaces such as Amazon Alexa. As chatbots came into use, so did the infrastructure that enabled chatbots. NLP APIs and complete chatbot frameworks came…
Cloudburst: Stateful Functions-as-a-Service with Vikram Sreekanti
Apr 23 • 59 min
Serverless computing is a way of designing applications that do not directly address or deploy application code to servers. Serverless applications are composed of stateless functions-as-a-service and stateful data storage systems such as Redis or…
NGINX API Management with Kevin Jones
Apr 22 • 53 min
NGINX is a web server that can be used to manage the APIs across an organization. Managing these APIs involves deciding on the routing and load balancing across the servers which host them. If the traffic of a website suddenly spikes, the website needs to…
Frontend Monitoring with Matt Arbesfeld
Apr 21 • 57 min
Web development has historically had more work being done on the server than on the client. The observability tooling has reflected this emphasis on the backend. Monitoring tools for log management and backend metrics have existed for decades, helping…
Zoom Vulnerabilities with Patrick Wardle
Apr 20 • 60 min
Zoom video chat has become an indispensable part of our lives. In a crowded market of video conferencing apps, Zoom managed to build a product that performs better than the competition, scaling with high quality to hundreds of meeting participants, and…
Facebook OpenStreetMap Engineering with Saurav Mohapatra and Jacob Wasserman
Apr 17 • 61 min
Facebook applications use maps for showing users where to go. These maps can display businesses, roads, and event locations. Understanding the geographical world is also important for performing search queries that take into account a user’s location. For…
NGINX Service Mesh with Alan Murphy
Apr 16 • 59 min
NGINX is a web server that is used as a load balancer, an API gateway, a reverse proxy, and other purposes. Core application servers such as Ruby on Rails are often supported by NGINX, which handles routing the user requests between the different…
Shopify React Native with Farhan Thawar
Apr 15 • 56 min
Shopify is a platform for selling products and building a business. It is a large e-commerce company with hundreds of engineers and several different mobile apps. Shopify’s engineering culture is willing to adopt new technologies aggressively, trying new…
Ceph Storage System with Sage Weil
Apr 14 • 54 min
Ceph is a storage system that can be used for provisioning object storage, block storage, and file storage. These storage primitives can be used as the underlying medium for databases, queueing systems, and bucket storage. Ceph is used in circumstances…
Collaborative SQL with Rahil Sondhi
Apr 13 • 48 min
Data analysts need to collaborate with each other in the same way that software engineers do. They also need a high quality development environment. These data analysts are not working with programming languages like Java and Python, so they are not using…
Reserved Instances with Aran Khanna
Apr 10 • 59 min
When a developer spins up a virtual machine on AWS, that virtual machine could be purchased using one of several types of cost structures. These cost structures include on-demand instances, spot instances, and reserved instances. On-demand instances are…
Snorkel: Training Dataset Management with Braden Hancock
Apr 9 • 58 min
Machine learning models require the use of training data, and that data needs to be labeled. Today, we have high quality data infrastructure tools such as TensorFlow, but we don’t have large high quality data sets. For many applications, the state of the…
Cadence: Uber’s Workflow Engine with Maxim Fateev
Apr 8 • 56 min
A workflow is an application that involves more than just a simple request/response communication. For example, consider a session of a user taking a ride in an Uber. The user initiates the ride, and the ride might last for an hour. At the end of the…
kSQLDB: Kafka Streaming Interface with Michael Drogalis
Apr 7 • 48 min
Kafka is a distributed stream processing system that is commonly used for storing large volumes of append-only event data. Kafka has been open source for almost a decade, and as the project has matured, it has been used for new kinds of applications.…
Godot Game Engine with Juan Linietsky
Apr 6 • 54 min
Building a game is not easy. The development team needs to figure out a unique design and gameplay mechanics that will attract players. There is a great deal of creative work that goes into making a game successful, and these games are often built with…
V8 Lite with Ross McIlroy
Apr 3 • 55 min
V8 is the JavaScript engine that runs Chrome. Every popular website makes heavy use of JavaScript, and V8 manages the execution environment of that code. The code that processes in your browser can run faster or slower depending on how “hot” the codepath…
Serverless Development with Jeremy Daly
Apr 2 • 59 min
Serverless tools have come a long way since the release of AWS Lambda in 2014. Serverless apps were originally architected around Lambda, with the functions-as-a-service being used to glue together larger pieces of functionality and API services. Today,…
Audio Data Engineering with Allison King
Apr 1 • 53 min
Cortico is a non-profit that builds audio tools to improve public dialogue. Allison King is an engineer at Cortico, and she joins the show to talk about the process of building audio applications. One of these applications was a system for ingesting radio…
Facebook Messenger Engineering with Mohsen Agsen
Mar 31 • 61 min
Facebook Messenger is a chat application that millions of people use every day to talk to each other. Over time, Messenger has grown to include group chats, video chats, animations, facial filters, stories, and many more features. Messenger is a tool for…
Pika Dependency Management with Fred Schott
Mar 30 • 60 min
Modern web development involves a complicated toolchain for managing dependencies. One part of this toolchain is the bundler, a tool that puts all your code and dependencies together into static asset files. The most popular bundler is webpack, which was…
Cloud Kitchen Platform with Ashley Colpaart
Mar 27 • 50 min
Food delivery apps have changed how the restaurant world operates. After seven years of mobile food delivery, the volume of food ordered through these apps has become so large that entire restaurants can be sustained solely through the order flow that…
Remote Team Management with Ryan Chartrand
Mar 26 • 58 min
Remote engineering work makes some elements of software development harder, and some elements easier. With Slack and email, communication becomes more clear cut. Project management tools lay out the responsibilities and deliverables of each person. GitHub…
Sorbet: Typed Ruby with Dmitry Petrashko
Mar 25 • 56 min
Programming languages are dynamically typed or statically typed. In a dynamically typed language, the programmer does not need to declare if a variable is an integer, string, or other type. In a statically typed language, the developer must declare the…
Datomic Architecture with Marshall Thompson
Mar 24 • 63 min
Datomic is a database system based on an append-only record keeping system. Datomic users can query the complete history of the database, and Datomic has ACID transactional support. The data within Datomic is stored in an underlying database system such…
Google Cloud Networking with Lakshmi Sharma
Mar 23 • 49 min
A large cloud provider has high volumes of network traffic moving through data centers throughout the world. These providers manage the infrastructure for thousands of companies, across racks and racks of multitenant servers, and cables that stretch…
ClickUp Engineering with Zeb Evans and Alex Yurkowski
Mar 20 • 65 min
Over the last fifteen years, there has been a massive increase in the number of new software tools. This is true at the infrastructure layer: there are more databases, more cloud providers, and more open-source projects. And it’s also true at a higher…
Pulumi: Infrastructure as Code with Joe Duffy
Mar 19 • 59 min
Infrastructure-as-code allows developers to use programming languages to define the architecture of their software deployments, including servers, load balancers, and databases. There have been several generations of infrastructure-as-code tools. Systems…
Infrastructure Investing with Vivek Saraswat
Mar 18 • 82 min
Software investing requires a deep understanding of the market, and an ability to predict what changes might occur in the near future. At the level of core infrastructure, software investing is particularly difficult. Databases, virtualization, and large…
Sisu Data with Peter Bailis
Mar 17 • 59 min
A high volume of data can contain a high volume of useful information. That fact is well understood by the software world. Unfortunately, it is not a simple process to surface useful information from this high volume of data. A human analyst needs to…
Location Data with Ryan Fox Squire
Mar 16 • 56 min
Physical places have a large amount of latent data. Pick any location on a map, and think about all of the questions you could ask about that location. What businesses are at that location? How many cars pass through it? What is the soil composition? How…