discussion-main 2021-10-06 | Devops Enterprise Summit Slack Archive

Laura Henry - American Airlines [she/her]14:10:23

@lbmkrishna Nice to see you!

🙏 1

kristin valters14:10:09

@jroa it appears Suncor's DevOps journey started with one business segment versus the whole technology org transforming simultaneously - is this accurate or was it big bang?

Love the quote - very empowering for the software engineers

Gene Kim, ITREV, Program Chair14:10:36

“BubbleStorm” — this sounds like one of those wonderful projects that torture people trying to preserve CAP theorem objectives, @cleng!!

🙏 1

Christof Leng (Google)14:10:49

Being tortured by CAP constraints is one of my favorite hobbies!

Gene Kim, ITREV, Program Chair14:10:50

“We now have 3000 SREs” — all reporting to VP 24/7 Engineering, Ben Treynor-Sloss.

😲 4

Joey Roa14:10:04

@nickeggleston We're still early in our maturity for DevOps. Our DevOps COE (small team initially), defined the capabilities that make up DevOps at Suncor and then prioritized which ones we felt we should emphasize. From there, we began the change and comms work to educate teams and leaders about the why . For the leaders, the messaging was a reinforcement of quality (repeatability via automation), resilience (being able to restore or deploy in a rapid fashion) and speed (doing 1000's of tests in minutes via tech). This would allow for shorter feedback loops with the customers to help the time to value equation. HTH. If not, hit me up and we can talk more.

Jeffrey Fredrick, Author-Agile Conversations14:10:09

the chemistry background for @jpetoff makes sense to me. a discipline that created the “things I won’t work with” series seems good background for SRE

Kurt A, Clari14:10:24

We will be publishing a podcast with @jpetoff later on this month in case you're interested

😊 1

👍 1

Jennifer Petoff15:10:40

@kboth_does can't wait to hear how the podcast turned out!

Jennifer Petoff14:10:26

BTW @genek did the talk start early? I dialed in at X;12 and we were already a few slides in...

Ann Perry - IT Revolution14:10:57

We did – running shockingly ahead this morning. so sorry!!

Gene Kim, ITREV, Program Chair14:10:07

I’m so sorry! Not sure — @annp would know. But all good, we’re all so happy that you’re here! (@annp, can you send the link to the complete video? 🙏 )

Jennifer Petoff14:10:49

All good. Just wanted to make sure I wasn't late accidentally 🙂

Jennifer Petoff14:10:36

Missed your intro :'-(

Gene Kim, ITREV, Program Chair14:10:23

The functional nature of SRE at Google is so interesting to me — and I’m so excited that @jpetoff and @cleng are sharing some of the “engagement models” between the product and SRE orgs, including the economics.

👏 2

🙏 1

Jennifer Petoff14:10:28

@jtf STEM subjects like chemistry are a great foundation for SRE: it's like applying the scientific method in a pressure cooker 🙂

❤️ 9

😆 1

👏 1

Jeffrey Fredrick, Author-Agile Conversations14:10:35

I completely agree! physics-chemistry double major myself. went from an interest in computational physics/chemistry into software.

Jennifer Petoff15:10:36

Very nice! I did synthetic chemistry myself so software is a bit of a departure, but I love it!

👍 1

Joey Roa14:10:34

@kristin.valters TBH, it's neither. We have targeted work going on across all the business units/areas. The messaging and vision has been communicated to everyone (org wide) with an emphasis on digital (some groups don't do anything tech so the message gets lost with them). Then, based on pull (or leader push, on occasion), we focus on a # of small teams in that area. Allows for tailoring of technologies used and business practices employed. Early in the agile transformation work I was leading, I quickly discovered you get the most "bang for the dollar" by going to the teams and areas that want you vs. selling to the groups that don't. The laggards will come along eventually. HTH

Gene Kim, ITREV, Program Chair14:10:40

“Both sides must agree to start the relationship; either side can end it.” (I love the way @jpetoff describes this.)

😊 2

Andrew Davis - AutoRABIT - DevSecOps for Salesforce14:10:55

“SRE is a scarce resource by design” - is that to ensure that Devs retain some level of ownership?

Jennifer Petoff14:10:52

Yes. and also to ensure that we aren't taken for granted and only working on the highest value added things.

Craig Cook - IBM14:10:57

Are "developers" on-call at google?

Jennifer Petoff14:10:52

Indeed. For SRE supported services, Devs may also share a portion of the oncall load and many teams that don't have SRE support handle their own oncall.

❤️ 2

Craig Cook - IBM14:10:40

Is the team that owns the service the first one that gets alerts for it, or are first level alerts sent to a different team?

Jennifer Petoff15:10:06

Yes, the team that owns the service typically gets paged. There is no general triage queue.

❤️ 1

Daniel Cahill - Engineer - Ontario Systems14:10:18

How does the decision to hand a project back to a product team go? Do the SREs collectively decide that or the product team or someone in management?

Gene Kim, ITREV, Program Chair14:10:35

“Work must be challenging to SRE teams. It must improve the reliability of systems thru engineering.” — @jpetoff

Bryan Finster - Defense Unicorns (Speaker)14:10:37

“We aren’t ops.” Yes!

🎯 1

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)14:10:40

Excellent description of how SRE works with ops!

👍 1

Nick Eggleston (free radical)14:10:59

What is the decision process around funding SREs for those who hold that budget line? @jpetoff

Jennifer Petoff14:10:38

Paging @cleng to weigh in on how this works in practice.

Christof Leng (Google)16:10:29

I posted my response in the main chat earlier, but reposting here to make it easier to find: The budget for SRE headcount comes from the Dev org. They "pay" SRE via HC, but once transferred, SRE is in complete control of the HC (until the engagement is ended by either side).

Amy Cheng - TELUS14:10:02

Love it, "Fix it and fix it once and for all!"

Angel Diaz14:10:05

Hello everyone!!

Gene Kim, ITREV, Program Chair14:10:23

So glad you’re here, and thank you for presenting!

Bryan Finster - Defense Unicorns (Speaker)14:10:14

If I’m not accountable for my quality decisions, quality is a fantasy.

👍 1

Michael Winslow14:10:33

@genek told me I was going to love this one! This is so amazing @jpetoff nd @cleng!a

🙏 2

BMK-SECTION6-TransformationArchitect14:10:52

SRE is not Ops Team - Love that; - @jpetoff

Ganga Narayanan14:10:57

"SRE is not an ops team"! 🙂 Yes

Joe Arrowood14:10:11

@bryan.finster486 Well said

❤️ 1

Jon Smart [Sooner Safer Happier]14:10:13

Is there anything that prevent the product teams from thinking, "I'll just leave that for SRE to fix, as I know that they are there to catch things for me, so I don't need to think about it as much as I would have done if SRE didn't exist"?

👀 2

Jennifer Petoff14:10:33

That sounds like a 'throw it over the fence' mindset to me which I'd consider an antipattern. @cleng anything to add?

Jon Smart [Sooner Safer Happier]14:10:23

Yes, definitely an antipattern. What behaviour do you see at Google? Is there anything intentional which prevents that antipattern?

Jon Smart [Sooner Safer Happier]16:10:10

I think there is an educational and communication component to it. SRE leadership and engineers on the ground communicating about the key SRE principles and best practices and how teams should work together for maximum impact. This is something that @cleng is actively working on. My team is also working to share reliability best practices more broadly. Reliability needs to be everyone's concern, not just SRE.

👍 1

Thanks @jpetoff

👍 1

Christof Leng (Google)16:10:52

There are a number of remedies to this challenge: • Write down who's responsible for what and who has authority over what. If SRE can block bad launches, Dev will try to work something out (However, the job of SRE is NOT to block, it's to advise!) • Work together. Have Devs work a little bit on infrastructure and ops and have SREs work a little bit on product - not too much, because it waters down the role specialization, but enough to maintain a mutual understanding. • Establish ultimate accountability for the product's reliability with the product team. SRE's job is to help them achieve that, but at the end of the day, they remain accountable. • If the situation spirals out of control, declare a "production freeze" (only stability fixes get deployed) and/or "code yellow" (reliability work trumps all other project work until the exit criteria are met). You need support from senior Dev leadership for either. If you can't get support from anyone in the reporting chain on the Dev side, you should find a better Dev org to work with.

➕ 1

👍 1

Jon Smart [Sooner Safer Happier]16:10:01

Thanks @cleng. "Establish ultimate accountability for the product's reliability with the product team. SRE's job is to help them achieve that, but at the end of the day, they remain accountable.", this resonates for me as to how to avoid the human tendency to think that it's someone else's problem.

Denver Martin, Dir DevSecOps, he/him14:10:21

The last 3 companies where I helped bring DevOps to life, I had to spear head getting Ops involved in the DevOps.. This can be challenging as most of these teams have been covered up in firefighting and are staffed to min staffing levels...

Gene Kim, ITREV, Program Chair14:10:22

What’s so remarkable of this talk is that everything is grounded in economics — deliberate surfacing that funding SREs is at expense of product devs; that SREs can’t be “bought” to do non-novel work, etc.

👍 2

Andy Nortrup - Director of PM at Tanium14:10:04

This sounds very much like when you bring a manufacturing engineering into a physical production process in order to address quality or throughput issues. My Dad did this at Pratt and Whitney to go in and help other teams or contractors figure out how to build a part correctly when they were having trouble. And they were an expensive resource to bring in.

👍 1

BMK-SECTION6-TransformationArchitect14:10:42

or for that matter - not Sys Admins renamed as "SRE" 🙂

🎉 1

Jennifer Velasquez14:10:48

Are you seeing the need to address the measurement or incentives for individuals in org? If so, how are you messaging that?

Christof Leng (Google)14:10:25

@nickeggleston The budget for SRE headcount comes from the Dev org. They "pay" SRE via HC, but once transferred, SRE is in complete control of the HC (until the engagement is ended by either side).

Denver Martin, Dir DevSecOps, he/him14:10:27

I see SRE as the breakout group that can be the champion in both Dev and Ops... they bridge the gap in many ways...

👍 2

BMK-SECTION6-TransformationArchitect14:10:47

@jpetoff - question, I guess the SRE comes into picture only for web scale systems at Google, correct? I see some refer about SRE for every IT Service/Systems. Just wanted to stand corrected.

Jennifer Petoff14:10:59

SRE support is typically limited to the most mission critical services. It's part of the cost benefit equation.

BMK-SECTION6-TransformationArchitect14:10:47

Love that "Mission Critical Services" - most IT systems owners think that all their services are "Critical" 🙂 in a typical Enterprise (the talk about availability and reliability) but not about the cost to achieve that

👍 1

Maria Luisa Polo14:10:00

@jpetoff what are the main responsabilities of the SRE Education Director, why this role is neccesary in your Organization?

👍 1

BMK-SECTION6-TransformationArchitect14:10:31

Thanks for asking! I lead the learning and development function for SRE. My team is responsible for onboarding, getting folks ready to go oncall and ongoing education opportunities. We also bring reliability-focused education to all of engineering. Reliability needs to be a priority for everyone, not just SRE. This may sound fluffy, but we also foster a strong oral tradition and passdown of the SRE organizational culture through storytelling in our classes.

👏 1

❤️ 1

"Throw it over the wall mentality" 🙂

Gene Kim, ITREV, Program Chair14:10:33

“highly customized infrastructure make it difficult for SREs”, especially in situations when SREs handle multiple services” ==> drives/encourages standardization.

Andrew Davis - AutoRABIT - DevSecOps for Salesforce14:10:36

“You can’t build a wall and then complain about a ‘throw it over the wall’ mentality”

❤️ 6

Jeffrey Fredrick, Author-Agile Conversations14:10:44

standardization again. relevant in truck maintenance and software environments. under appreciated I think.

💯 4

Jon Smart [Sooner Safer Happier]14:10:38

For the knowable, or meta level patterns in context, I would suggest 🙂 A headwind for the unknowable (treating the unknowable as if it's one size fits all)

➕ 2

Vaidik Kapoor (Speaker) - Technology Consultant14:10:49

What are some of the ways that are used at google to “teach how to fish”, especially when something is on fire?

👍 1

Christof Leng (Google)14:10:44

• Sharing ops work to some extent • Escalating during incidents, debugging togehter

🙏 1

Christof Leng (Google)14:10:22

• reviewing postmortems together • co-design sessions • ops/architecture training sessions for Dev

Denver Martin, Dir DevSecOps, he/him14:10:54

We have the SRE team lead the BPM (Blameless Post Mortems) they are in a great place to look at issues deeper and can then help those involved get to the root cause easier and see the action going as far as possible with in Development or in better Operations.. I know I am not at Google but thought I would chime in... hope that is okay.

❤️ 2

Jennifer Petoff14:10:00

Thanks for chiming in @mr.denver.martin always interested in how others do it.

Christof Leng (Google)14:10:43

@mr.denver.martin That's actually very similar to how many Google teams organize postmortem reviews.

Denver Martin, Dir DevSecOps, he/him14:10:04

I can also see SRE building the tools and process for Fishing, not just teaching other to Fish.. but they are not Fisher People... 🙂

Nitin Kulkarni14:10:56

@jpetoff I wonder how you maintain guard rails though and not create a chaos of tools and strategies if SRE is an optional engagement?

Christof Leng (Google)16:10:51

I think @cleng is best placed to tackle this one.

It's complicated. The preferred strategy is to make the tools and strategies you want the teams to use the most attractive ones (incentives). They can be most accessible, easy-to-use, best supported, most feature complete. Don't try to force a solution on your engineers that isn't working for them. They'll find ways around it. The second approach is on the relationship level. Just because a Dev team doesn't have (full) SRE support, doesn't mean that they're not exposed to SRE. There can be consulting, SRE Love, training programs, tech talks, etc. When SRE has a reputation to be helpful, the devs will ask you for advice and follow your best practices. SRE is production evangelism. The third approach is "the stick": Policies about what you can/cannot use, automated compliance measurement/enforcement, nice-or-naughty dashboards for senior leadership, etc. I would generally advise against these, but there are corner cases when they are necessary. Typically, when you can convince 80-90% of the org with the other strategies, it can be sufficient to show that number to non-compliant team (and/or their bosses). Then again, not everything always needs to be uniform. If you don't have to deal with it, let them do what they feel is right. If you always follow the one-size-fits-all approach, you stifle innovation. Listen to why they chose a different path. Maybe they have good reasons.

Sean D. Mack14:10:04

@jpetoff @cleng Amazing talk! Does Google have any sort of 24 x 7 monitoring team/Operations or is that all delegated to the dev teams? If it is responsibility for the development teams how do you handle legacy applications which may not be under active development?

👀 3

Gene Kim, ITREV, Program Chair14:10:12

What’s so bad about discussing SLOs after software is written? What could go wrong? 😆 😆 😆 (“Overengineering something at the expense of valuable features”)

😆 2

BMK-SECTION6-TransformationArchitect14:10:18

for the scale, pace of questions, comments for this subject - we need to engage SRE - cc @jpetoff (How are you scrolling all these comments)

👍 1

Jennifer Petoff14:10:57

Ha! working to get through the Qs as quickly as possible. Also listening to the talk at the same time which is distracting me. I'm terrible at multi-tasking 🙂

😆 1

BMK-SECTION6-TransformationArchitect14:10:45

I could imagine.

🙂 1

Andrew Davis - AutoRABIT - DevSecOps for Salesforce14:10:23

“Important to not see SRE as a human abstraction layer over production. That’s an invitation for complexity to flourish” - @jpetoff

👍 4

☝️ 3

Andy Nortrup - Director of PM at Tanium14:10:14

The explicit funding model of the engagement process described reminds me of the economic principal of optionality that Gene has talked about in the most recent few episodes of the Ideal Cast. You pay for an SRE to come over if you think that is going to increase your overall value more than spending that money on something else (like another Dev, designer, PM, whatever).

➕ 1

Gene Kim, ITREV, Program Chair14:10:18

“For products in new business units, you could get away with lots of Baseline engagements”

Scott Kellerman (DevEx Product Owner, Vanguard)14:10:54

hey @cleng, can you tell us more about how you measure team maturity?

👍 1

Christof Leng (Google)14:10:16

Service maturity: • SLO quality (user-oriented) + compliance • Ops workload (tickets + incidents) • Data integrity processes • Capacity planning / efficiency • postmortem processes and hygiene • Release automation Team maturity: • OKR planning processes • Staffing/attrition • ops workload • SRE/Dev relationship

Scott Kellerman (DevEx Product Owner, Vanguard)14:10:28

thanks @cleng! really insightful, love how staffing attrition is a factor in team maturity, often overlooked but it's a key leading indicator to predict the quality that a team might deliver

Christof Leng (Google)16:10:44

Unfortunately attrition can be a trailing indicator, because it takes time to build up enough frustration that the engineers actually leave. Also, be careful to measure this only for organizations big enough that you get a meaningful signal. When you look at a 5 people team and 2 of them leave for reasons completely unrelated to the team health, you get a huge spike. A noisy signal is not useful.

👍 1

Gene Kim, ITREV, Program Chair14:10:20

I love the benefit of being able to call experienced SREs when things go wildly wrong in a major incident!

Nick Eggleston (free radical)14:10:46

Get that project some “SRE ❤️”

❤️ 2

Gene Kim, ITREV, Program Chair14:10:15

“SRE Love” — when devs write proposals / requests for SRE help, to aid in knowledge transfer, mentoring, skill upleveling. “Helps build relationships between devs and SREs.”

❤️ 1

Amy Cheng - TELUS14:10:47

The SRE Office hours and Continuous Learning in mentoring the devs is awesome!

❤️ 1

Brian W. Spolarich - Cal Poly14:10:47

Fascinating how this feels like a "service you can buy based on your goals and funding" all grounded in economics.

💯 1

➕ 1

José Chanto14:10:15

I don't get yet how SRE is different to an operation team

Jennifer Petoff14:10:13

I think a big difference is the focus on automating away toil so that the team can scale sublinearly to the size of the service rather than throwing more bodies at the problem.

❤️ 1

Scott Jaffa (Principal Engineer, ValidaTek)14:10:28

Think of it as SREs provide operational (production) expertise. They’ll help fix an operational issue, but the goal is to understand the issue such that they can engineer away the problem from happening again.

❤️ 1

Scott Jaffa (Principal Engineer, ValidaTek)14:10:35

Or the actual expert answered 😄

Jon Smart [Sooner Safer Happier]14:10:54

i.e. writing better fire fighting equipment, so that don't need to spend time fire fighting?

➕ 1

Denver Martin, Dir DevSecOps, he/him14:10:58

Hi @jchanto17 I see SRE as the bridge between Ops and Dev, these are people that are pulled out of or not part of Dev or Ops, but they have skills and knowledge to be able to get to root cause and then can look at 1. fix temporary with work arounds 2. figure out how to respond and restore faster 3. fix long term how to keep it from happening again.

Jennifer Petoff14:10:13

well stated @scott.jaffa!

Pete Nuwayser - IBM14:10:53

Might an SRE also look for risks before they become issues? Is that inherent to eliminating toil?

Jennifer Petoff14:10:28

Yes @nuwayser, SREs are empowered to make tomorrow better than today by focusing on the things that will have the most impact on improving reliability of the services they support.

👍 1

Sujay Solomon14:10:12

SREs make more sense in places where dev teams actually own dev & ops - more common when your apps are cloud-native.

Sujay Solomon14:10:38

in traditional datacenter based apps, SREs may play the role of a bridge between dev & ops

Sujay Solomon14:10:41

i'm involved in some research projects on whether SREs fit into the hybrid world of cloud-native + legacy datacenter apps. It's been interesting and challenging

José Chanto14:10:23

Ok thanks all for the comments, now it makes sense to me. And what I think is that I can take this type of firefighting work out of the Dev team, because in my case Dev team use to tackle this type of problems and slowdown the value delivery

👍 1

Pete Nuwayser - IBM14:10:36

> Yes @nuwayser, SREs are empowered to make tomorrow better than today by focusing on the things that will have the most impact on improving reliability of the services they support. Thank you @jpetoff :) @jchanto17 to your question - as someone who has worked in traditional ops before, an SRE is someone with my mindset + developer skillset, and who is also empowered / expected / measured on their ability to • participate in incident response to close issues • proactively mitigate risks • leverage error budget to find problems make the service more resilient By developer skillset I mean more than python/bash/perl scripting: it's someone who has the skills and license to get into the app code and improve it proactively. @jonathansmart1 to your earlier point re: measurement alignment between SREs and Product owners - curious to know if you think the two roles are responsible for the same outcomes or different ones.

👍 1

Jon Smart [Sooner Safer Happier]17:10:52

Hi @nuwayser, not sure, keen to understand the case study point of view (SREs and Product Teams, not only POs). I guess that it will depend on org by org, as to how that is done. With more aligned incentives or less aligned incentives. Keen to understand and how risk is mitigated.

Daniel Cahill - Engineer - Ontario Systems14:10:27

Are there any examples of SRE Love projects that we could use as a example?

Christof Leng (Google)14:10:35

Typical examples: • Helping a Dev team to set up their monitoring/alerting • Analyzing SLOs / refining them • Architecture review • Picking the right tools / infrastructure for a new "greenfield" service • A migration to new infrastructure (e.g. database) • Improving ops processes

❤️ 1

Jon Smart [Sooner Safer Happier]14:10:28

How does incentivisation (performance appraisals, pay, promotion, rewards) work for SREs? How are SRE incentives and Product Team incentives kept in line, to avoid the antipatterns which would easy to occur (e.g. SREs incentivisation not inline with Product Team org incentives, e.g. lighting fires in order to put them out, as an extreme example)? Thanks!

👀 3

Christof Leng (Google)14:10:50

Core aspects of SRE performance are impact (measurable improvements for users/Dev/SRE/budget) and simplicity (standardization, deprecations, cleaner architectures, less dependencies). Heroics can be rewarded temporarily but are a generally not going to get you a promo. Specifically, the focus for performance evaluation is on designs and landing engineering projects not "keeping the lights on".

👍 1

Jennifer Petoff14:10:09

Service Level Objectives are nominally the tool that allow SREs and Devs (and other business stakeholders) to speak the same language and align on incentives

👍 1

Jon Smart [Sooner Safer Happier]17:10:59

Thanks @cleng and @jpetoff

Jon Smart [Sooner Safer Happier]17:10:33

I like "simplicity"

👍 1

BMK-SECTION6-TransformationArchitect14:10:43

@genek - we need a slack plugin to move all these "Gold nuggets" to a ever living doc (kind of shortform or notion) - Goodness me - there is so much to read thru

💯 10

🎉 1

➕ 3

❤️ 1

Graham McGregor14:10:44

Are dev-teams on call overnight? Is there any pattern of follow-the-sun rotation handing off to another team for overnight?

Jennifer Petoff14:10:00

Response time would typically depend on the criticality of the service.

Graham McGregor14:10:22

So are the dev teams are on-call?

Matt Ring (he/him) - Sr. Product/Engineering Coach, John Deere14:10:24

"You build it, you run it." 😉

Graham McGregor14:10:15

I'm pretty sure that's how it is, but I was hoping for confirmation. There's folks in our org who don't agree with that.

Christof Leng (Google)14:10:50

Yes, Dev teams are oncall, some of them 24x7 when they don't have multiple sites across the globe. That's not what we would do for business-critical services though. Being paged at 3am when the rest of your team is soundly sleeping is not a recipe for success. However, SRE escalation support via baseline can be helpful in these scenarios.

Graham McGregor14:10:39

Thank you for the response! What would the size of those teams be? How many people around the globe? We have some teams of 5-7 that are responsible for critical services and currently hand off to operations overnight because they don't want to be woken at 3am, but the operations team is becoming overloaded.

Christof Leng (Google)16:10:52

For an SRE team, we recommend at least 6 engineers in each of the two sites. That's a significant investment, so each SRE team is typically working on a large set of services (or a few very big/critical ones). Dev being oncall during business hours and SRE being oncall during off-hours is also a model that is working well for Google SRE teams that are using it. It increases the coordination overhead a bit (more cross-team handoffs), but makes dealing with faulty releases easier (releases should be done when the devs are oncall - they're more familiar with the changes). It also exposes Dev to production, but with a safety net - there's always an SRE familiar with the service you can escalate to.

Joe Waid - Manager, Delivery Engineering - Columbia Sportswear14:10:16

The clearly defined different engagement models is really interesting to me. Having that so well defined to give teams the ability to decide what they can afford and not just what they would ideally want. It seems really useful to keep the smaller and less important projects from swamping the SRE org.

👍 6

Gene Kim, ITREV, Program Chair14:10:09

Haunted Graveyards!!!!

😂 4

🪦 1

👻 4

Javier Magaña - Walmart14:10:24

Hmmm... seems like an SRE role aligns a lot with my interests. Is there any good resources to start learning and grow this muscle? I assume the SRE O'reilly book is a good place to start?

BMK-SECTION6-TransformationArchitect14:10:54

https://sre.google/workbook/table-of-contents/

👍 3

➕ 1

Jennifer Petoff14:10:01

@nepobunceno There are lots of resources to check out at sre.google including our various books. There are some good large system desigjn exercises and shorter form articles too.

BMK-SECTION6-TransformationArchitect14:10:38

There are great books by Google (Free) and also there are number of courses in LinkedIn and Youtube available. The one thing I will recommend strongly the micro learning videos on this subject by Google Seth Vargo and our own @lizf

➕ 2

Daniel Cahill - Engineer - Ontario Systems14:10:06

@jpetoff Where can I find the system design exercises? I'm scrolling through the site and recognize books and articles I've grown from and helped my org change processes to reflect.

Gene Kim, ITREV, Program Chair14:10:41

@dacahill7 check out http://sre.google/classroom

🙏 2

I’m fascinated by the path one is required to go through to get to Full SRE Support — and how products going thru hypergrowth will likely need it!!'

BMK-SECTION6-TransformationArchitect14:10:09

I have to confess here (A general statement): All these good work by various organizations - SRE, DevOps, Product Management, Funding, Transformation, Team composition, Organizational design - all these get over complicated in their own way in many Enterprises and a common statement comes to answer all the time - WE ARE NOT GOOGLE - cc - @genek, @jpetoff . Reminds @jonathansmart1 book - antipatterns

💯 4

Christof Leng (Google)14:10:12

Google is an enterprise with its own challenges. With 140k employees and >20 years of history there are many complications on the ground. Individual SRE PAs find their own solutions on how to adapt to their space. Please don't take Google SRE as a blueprint to be applied verbatim to your org. It's a case study. We keep evolving the way we do things. Because we have to adapt all the time.

➕ 2

BMK-SECTION6-TransformationArchitect14:10:04

^^^^ THIS - Love this @cleng (the same applies to all the models right - Spotify, Etsy, Google, Amazon)

Christof Leng (Google)16:10:06

I definitely don't want to become a preacher for a cargo cult. 😉

🙏 1

Jeffrey Fredrick, Author-Agile Conversations14:10:16

I like that the SREs can vote themselves off the island

❤️ 3

😆 1

Nick Eggleston (free radical)14:10:22

How are SREs on a project evaluated for performance/promotions? How frequently and by whom? @jpetoff @cleng

Christof Leng (Google)14:10:42

SRE's performance is evaluated by SREs (up to a certain level of seniority), but SRE managers and promo committees expect positive peer feedback from Dev peers as critical support for high ratings.

➕ 1

Jeffrey Fredrick, Author-Agile Conversations14:10:32

that autonomy is a really useful source of information

Gene Kim, ITREV, Program Chair14:10:38

“When SREs have all left/abandoned a [problematic service], well, the developers are left wearing the pager anyway.” 🙂 I love these “tough love” statements from @cleng.

❤️ 6

Jennifer Velasquez14:10:48

“smart engineering” not “brute force”. love it

❤️ 5

Matt Wheeler14:10:24

So many gems in this talk. Definitely one to rewatch and share.

💯 3

Gene Kim, ITREV, Program Chair14:10:35

“you never fully understand a system until you see it burst into flames” — @cleng 😆

👏 7

😆 2

🙌 1

Virginia Laurenzano NSA14:10:37

So. True.

Christof Leng (Google)14:10:02

@genek "Some of the best firefighters can think like arsons."

Gene Kim, ITREV, Program Chair14:10:36

😆 😆 Like a connoisseur of disasters, if you will. 🙂

😋 1

Gene Kim, ITREV, Program Chair14:10:17

I’ve always admired the way that Google has talked about how they’ve defined SRE as a career path, as a set of skills, so much pioneered by @jpetoff.

❤️ 2

Denver Martin, Dir DevSecOps, he/him14:10:43

These books are amazing ...

Jennifer Petoff14:10:50

Thank you! PSA that all the books are available online for free at sre.google/books

Nick Eggleston (free radical)14:10:44

Google got their own TLD? I didn’t know that!

🤯 1

🙂 1

Kurt A, Clari14:10:48

They also have .prod 😄

👍 1

Christof Leng (Google)14:10:48

Try docs.new 😉

Nick Eggleston (free radical)11:10:12

When can I get “Nick.”? :rolling_on_the_floor_laughing:

Use other profile14:10:45

Slides for this talk: https://github.com/devopsenterprise/2021-virtual-us/blob/main/DOES21%20_Petoff_Leng_How%20Google%20SRE%20and%20developers%20work%20together.pdf

🙏 1

Gene Kim, ITREV, Program Chair14:10:40

Thank you so much @jpetoff and @cleng for giving us a glimpse of how SRE works inside of Google!! This is something I’ve wanted to better understand for nearly a decade! 🙏

❤️ 7

Jennifer Petoff14:10:49

Thanks so much for inviting. us, @genek. Will continue to work our way through the backlog of Qs!

Christof Leng (Google)16:10:07

It's been an honor and a pleasure. Fantastic community at DOES! Great questions, many things to learn from others too!

Jeffrey Fredrick, Author-Agile Conversations14:10:47

Conferences are for conferring 🙂

👏 3

🙂 1

BMK-SECTION6-TransformationArchitect14:10:07

Challenge to Enterprise Technology Leaders - "WE ARE NOT GOOGLE" - WE WILL NEVER BE; 🙂

🙂 1

Malcolm McAlpin14:10:16

Thank you!!

Jeffrey Fredrick, Author-Agile Conversations14:10:26

👏

👍 1

Jennifer Collings14:10:29

Thank you!!

👍 1

Ganga Narayanan14:10:31

This is amazing and very timely with some of the work we've begun doing! Thank you @jpetoff and @cleng! Reminder to self to read up on your SRE books!

👍 1

Khan, Humayoun at TELUS14:10:32

What was the SRE site again

Jeffrey Fredrick, Author-Agile Conversations14:10:49

https://sre.google

Khan, Humayoun at TELUS14:10:19

thanks i also googled it

Glenn Wilson, Author of DevSecOps14:10:37

Nice...I see what you did there 🙂

🙂 1

Wow - so much to take in. I’ll be watching this one again

❤️ 1

Jeff Gallimore (CTIO - Excella)14:10:39

Virginia Laurenzano NSA14:10:45

@eleonravinez @jeff.gallimore There is a lot of unpack there. I think the best way to sum it up is through this conference talk that I gave at FailoverConf last year: "Swim Don't Sink: Why Training Matters to an SRE Practice" https://www.youtube.com/watch?v=8iaNMMwozCc

👍 2

When I get the "we're not Google lecture" I try to find ways to say "but we can aspire to be"

👏 4

❤️ 6

🙌 2

Jennifer Petoff14:10:39

@vmshook Many of the foundational principles of SRE can be applied no matter what your size (e.g., SLOs and error budgets and a 'vanquish toil' mindset)

👏 1

Nick Eggleston (free radical)14:10:42

I love that the whole conference is focused on these topics at the same time. It’s creates a great shared community context.

➕ 2

Javier Magaña - Walmart14:10:05

Thanks a lot @jpetoff and @cleng. Very informative.

👍 2

🙏 1

BMK-SECTION6-TransformationArchitect14:10:18

thank you @jpetoff, @cleng - Brilliant session (I can manage without a coffee now for a while)

🙂 2

💯 2

Maria Luisa Polo14:10:24

Thanks for sharing!! very usefull, I’ll be watching this one again

✔️ 1

👍 2

🎉 1

Stan14:10:28

When working with an SRE, how do you decide whether to go with an incremental approach over redesigning a whole system to meet reliability goals?

Jennifer Petoff14:10:26

Another good one for @cleng to tackle!

Christof Leng (Google)14:10:46

When you have been working closely with the system and know all the hidden timebombs: An incremental approach is generally preferred, because a clean-slate rewrite is costly and high risk. However, sometimes you have explored all incremental options and they either don't get you to an acceptable level or are even costlier/riskier than a rewrite. Then it's time to sharpen your design pencils. When you're new to the system: A thorough architecture review and a regular ops review to assess the long- and short-term viability of the system should get you close to the above.

❤️ 1

Jason Cox - Disney14:10:49

@jpetoff and @cleng - Thank you! Can you post a snapshot of the SRE in a nutshell slide here?

Jennifer Petoff14:10:06

Here you go. Also lots of good material at https://sre.google

Jason Cox - Disney14:10:42

Thank you! Great talk.

❤️ 1

BMK-SECTION6-TransformationArchitect14:10:27

AS @genek says - DOES is like a recharging battery station. I managed to charge by (Inspiration) and Battery over last 6 Years - JUST by DOES

💯 2

🙏 4

Christina Biangslev14:10:28

@cleng @jpetoff Were there ever a valid counter argument for putting SRE inside the product teams? (If this is covered in books/papers I haven't read, please share! this is the single hottest topic where I'm at right now.)

👏 2

👀 1

Javier Magaña - Walmart14:10:31

@christina.biangslev This is not something that has been considered at Google AFAIK. Keeping SRE and Dev reporting lines ensures that reliability is a first class feature. If everyone reported up to Dev, it's possible that the orgs leaders might be tempted to trade off reliability for feature velocity or to take on more tech debt than is wise. I know other companies have approached SRE differently though and have found success with a more embedded or consulting approach.

👍 1

👆 1

I agree. What I see often is compromising quality because of the pull from business to deliver new features. I think this would also be something that would make it a second class citizen.

➕ 1

Christof Leng (Google)15:10:40

See https://youtu.be/n4Wf14e2jxQ?t=497 for a good argument. Summary by Ben Sloss: "So for that reason SRE has to be its own team. It's my basic thesis. If you don't have a team who views their mission in life as making sure that the product works, you will ignore availability and reliability until you're in real trouble."

🙏 1

👍 1

Christina Biangslev15:10:49

I completely acknowledge this risk, that Feature Fetisch might taking over the SRE agenda. I'm struggling to fit the ideal of the truly independent product team with the experience of our SRE pioneers. I'm sure there's a place for a variety of setups; I'm wondering what the deciding factors would be to consider the truly independent model.

Christof Leng (Google)16:10:00

There's a whole website for the many organizational options you have: https://web.devopstopologies.com/

🎯 1

Nick Eggleston (free radical)14:10:26

Speaking of talks happening at the same time… if we go back later to listen to one we missed, which channel is best for the follow-up discussion? (since we don’t have one channel per talk or (set of) speaker(s))

🙏 1

Denver Martin, Dir DevSecOps, he/him14:10:06

Maybe do DM to the presenter and maybe they would still be watching the Slack Workspace... just a thought.. but then you miss on others that may have knowledge besides the speaker... hmm.

Jeffrey Fredrick, Author-Agile Conversations14:10:14

#ask-the-speaker-more

Nick Eggleston (free radical)14:10:26

It’s tough to know. I love staying on one channel throughout the day for the lively engagement and community, but it would be super cool if the thread discussion for a given talk were moved to a dedicated channel for continuing discussion… @genek

Ann Perry - IT Revolution14:10:20

👏:skin-tone-2: Let's get ready to welcome the team from Capital One – @girija.rao, @denee.ferguson @jennifer.miles, presenting Productizing the Network: Square Peg, Round Hole?:clap::skin-tone-2:

👏 2

Brian W. Spolarich - Cal Poly14:10:45

I love to see this panel!

❤️ 3

👏 1

Gene Kim, ITREV, Program Chair14:10:35

I’m so excited that @girija.rao @denee.ferguson and @jennifer.miles will be talking about they brought DevOps principles for core networking that enables (all?) major bank operations!

Nick Eggleston (free radical)14:10:48

This reminds me of the talk BMW did about applying DevOps principles to their core IT support org

Gene Kim, ITREV, Program Chair14:10:14

“My favorite: Wireless LAN”. from @denee.ferguson (I’m getting stressed out hearing about all these mission critical services. “It’s always the firewall or network.“)

❤️ 1

😆 1

Jeffrey Fredrick, Author-Agile Conversations14:10:09

timely after facebook’s experience this week…

Brian W. Spolarich - Cal Poly14:10:10

Actually, its always DNS.

😆 2

Jeffrey Fredrick, Author-Agile Conversations14:10:29

That was my first thought!

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)14:10:30

@jtf for sure!

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)14:10:46

lol... @genek I like to say "bring it on"... most of the time, it isn't!

BMK-SECTION6-TransformationArchitect14:10:11

so DevNetOps is a thing, right? cc - @denee.ferguson

👍 1

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)14:10:19

It is!!!!! Incorporating the development/automation has been key to getting out of the firedrill mode, and bringing sanity to our pace of delivery

❤️ 3

Scott Prugh (ETLS PC / CTO Uturn Data)14:10:43

Ah.. @girija.rao is talking about creating Build/Run teams for Network!

Nick Eggleston (free radical)14:10:57

DevEngOps teams

Matt Wheeler14:10:29

DNS - DevNetSRE

Gene Kim, ITREV, Program Chair14:10:30

“Previously, we had Engineering and Run reporting to two different executives” — @girija.rao cc @scott.prugh

👏 1

Scott Prugh (ETLS PC / CTO Uturn Data)14:10:28

This is awesome. Parallels to some other stories:

👍 1

Scott Heaberlin16:10:16

@scott.prugh may I ask what that’s from?

Scott Prugh (ETLS PC / CTO Uturn Data)16:10:50

https://www.youtube.com/watch?v=6afD-sQm03E

❤️ 1

Craig Cook - IBM14:10:38

I heard this org change referred to as "reverse conways law". If you don't like your architecture, change your organization to reflect what you want your architecture to be.

❤️ 7

Vaidik Kapoor (Speaker) - Technology Consultant14:10:23

align it to business outcomes and the architecture gets fixed.

Gene Kim, ITREV, Program Chair14:10:25

(More difficult to rearchitect your core switches, which are global in nature, than most software. 🙂

Vaidik Kapoor (Speaker) - Technology Consultant14:10:46

but sometimes it doesnt. sometimes figuring out the right archtiecture is also really hard

Pete Nuwayser - IBM15:10:06

@cncook001 We should call it The Law of One Foot

Kurt A, Clari15:10:11

Team Topologies in action

✔️ 1

Meghan Glass - PrdMgr Best Buy14:10:05

Network Engineering + Network Operations separate teams => Eng + Ops team but this is still IT focused, correct? These product teams don't include non-IT operations providing value to customer?

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)14:10:52

The org itself is IT focused... But the org also has non-IT staff members that are integral to our success.

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)14:10:12

@cncook001 Indeed!!! Over 160 people changed managers as part of this transformation....

❤️ 2

Pete Nuwayser - IBM14:10:44

"Shoulder Tapping" - these words are giving me a mild anxiety attack.

💯 2

😂 3

Pete Nuwayser - IBM15:10:29

Yet another reason why open floor plans can kill productivity. At my previous studio job, somebody created paper indicators that you cut out to put on your monitor: • Green: I can be interrupted • Yellow: I can be interrupted but we can't talk about your cat • Red: DND

BMK-SECTION6-TransformationArchitect14:10:54

@cncook001 - This reminds me Martin Fowler quote "You change the organization or change your organization" 🙂

👍 2

Charlie Betz14:10:56

Oooh. "Shoulder-tapping" as A Thing. Stealing that.

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)14:10:16

@char aka "drive bys"

✅ 1

Gene Kim, ITREV, Program Chair14:10:32

The scale of this talk blows me away: 14,000 devices, 185k carrier assets (!!)

👍 1

🤯 1

Gene Kim, ITREV, Program Chair14:10:48

Is that 14K networking devices, @denee.ferguson?

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)18:04:49

yes

Brian Smith15:10:24

I have a manager who loves drive by's . I developed PTSD from every time I heard his office door open.

😆 2

😢 1

Matt Wheeler15:10:54

I had the same, started working in the cafeteria.

BMK-SECTION6-TransformationArchitect15:10:29

Most of the "Productive" Engineers prefer to work from home (as they do not need to see these managers and other distractions)

👆 1

Jennifer Miles15:10:11

You would think remote working would provide some speed bumps for those drive bys but not always the case. Where there is a will there is a way!

💯 1

Brian Smith15:10:45

Now that we are remote, life is much better.

👆 1

Virginia Laurenzano NSA15:10:44

mine is back in force w/no telework options and a move back to an open office. less time to think

Brian Smith15:10:10

I wondered that yesterday about your workforce.

Virginia Laurenzano NSA15:10:28

some jobs have that option. my last one did. my current does not.

Virginia Laurenzano NSA15:10:58

it's definitely something IC/DoD is needing to rethink

Charlie Betz15:10:47

The biggest problem I am hearing in infra product team transitions (aka platform teams) is that 1) engineers don't easily skill into product manager roles and 2) have an antibody reaction if you parachute in someone with product skills who is not an engineer.

👍 4

👀 1

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:40

The biggest lesson is the product managers NEED to understand the technology...

✅ 3

Charlie Betz15:10:11

That can be a purple squirrel quest...

👍 1

Jennifer Miles15:10:12

Completely agree with @denee.ferguson

BMK-SECTION6-TransformationArchitect15:10:20

This leads to me to question - The PdM's do they need to be technical or Business 🙂 or both?

Pete Nuwayser - IBM15:10:24

I ❤️this thread and it's barely gotten started.

Jennifer Miles15:10:46

They are both in our org or at least that is the desired state

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:50

@lbmkrishna both

👍 1

Pete Nuwayser - IBM15:10:04

At my previous job we had motion in both directions (engineering->product crafts and vice versa)

❤️ 1

Chris Gallivan (Planview)15:10:28

My attempts at product teams with Infra were underwhelming at first, but I found several months later they came back to me with an appreciation for what they learned. It just took longer to click

❤️ 3

✅ 1

Pete Nuwayser - IBM15:10:54

I think what we found is that engineering->product manager was better than the other way around

Geri Pohl15:10:56

@char re the purple squirrel quest, one of my challenges is to "train" the infrastructure "POs" at the moment. Im just getting started.

Pete Nuwayser - IBM15:10:36

common theme: I was a developer and kept building features that nobody used or wanted. I became a PM to prevent that from happening

👍 1

BMK-SECTION6-TransformationArchitect15:10:50

In some org (Enterprises) - The Product Management is (abused). The PdM's act as Service Delivery Managers (approving people work)

Charlie Betz15:10:50

Has anyone tried a "two in a box" strategy for platform teams?

Pete Nuwayser - IBM15:10:31

@charles.page like PM+Engineering Lead?

✅ 2

Geri Pohl15:10:35

@chris.gallivan421 Id love to hear more about your successes with infra POs. what did they come back to you for?

💡 1

Brian Gallop15:10:38

Can you expound on that?

Jason Trent15:10:50

@char,two in a box??

Chris Gallivan (Planview)15:10:59

@gerijotoole we did a dojo with them (Joel Tosi and I)

Chris Gallivan (Planview)15:10:42

They said they liked mobbing, rather than working in individual silos

Pete Nuwayser - IBM15:10:50

hey @ckissler this needs to be a Lean Coffee topic

❤️ 1

Charlie Betz15:10:58

Many product teams are matrices with consensus leadership shared by two "pyramid" reps

Chris Gallivan (Planview)15:10:09

they also said, they felt like a real team for the first and only time in the dojo

❤️ 1

Charlie Betz15:10:22

eg a product "pyramid" and an engineering "pyramid"

❤️ 1

Geri Pohl15:10:30

ah.. I have experience with those and have even talked about the quick learnings, but at this time, they are barely "walking," not to mention leadership buy in

Geri Pohl15:10:11

@chris.gallivan421 those comments have been ones ive witnessed as well. it's a beatiful thing, isn't it?

👍 1

Jeffrey Fredrick, Author-Agile Conversations15:10:12

engineers don’t easily skill into product manager roles < I’m wondering what specific problems you’ve seen with people who wanted to make the transition.

👀 3

Chris Gallivan (Planview)15:10:22

at Stellantis we favored the 2 in a box, but it was mostly because we needed a place to put the managers 🙂

Geri Pohl15:10:58

@jtf same. I am just getting started, week 3 at a new org, with this setup

Charlie Betz15:10:21

@jtf These are field reports I am hearing. Just went through this in detail yesterday with a large Australian investment brokerage.

Chris Gallivan (Planview)15:10:22

@bryan.finster486 has some good takes on 2 in a box

✔️ 1

Chris Gallivan (Planview)15:10:42

on app side this led to typing pools rather than product teams

Geri Pohl15:10:50

@bryan.finster486 has some good takes on lots of things. thanks for the tip

Charlie Betz15:10:02

@jtf have heard similar at least 3 or 4 other times.

Bryan Finster - Defense Unicorns (Speaker)15:10:02

I mean, I have OPINIONS.

😂 4

❤️ 1

Bryan Finster - Defense Unicorns (Speaker)15:10:13

I’d cross check them if I were you.

Chris Gallivan (Planview)15:10:21

so much so, we have renamed TPO from Technical Product owner to Typing Pool Owner

Bryan Finster - Defense Unicorns (Speaker)15:10:29

I love that.

Pete Nuwayser - IBM15:10:36

Opinions are free, "good takes" are a fee

Bryan Finster - Defense Unicorns (Speaker)15:10:54

Nah, I open source anything I can.

👍 1

🆒 1

👏 2

Charlie Betz15:10:25

@jtf here is a quote from JP Morgan Chase

👍 3

Brian Gallop15:10:07

@charlie TRUTH!

Jeffrey Fredrick, Author-Agile Conversations15:10:19

@char: I complete agree it happens. I think it can be a problem to move from engineer --> product manager for many different reasons. I’ve also seen problems for people moving from business analyst --> product manager or scrum manager --> product manager. I’m not clear there’s anything uniquely challenging about engineer --> product manager. When I’ve seen problems in transition the biggest issue is the lack of mentoring in the transition and/or a weak (or unhealthy) product management culture in the organization. Might that apply in these cases?

👍 2

Jennifer Miles15:10:49

One of the issues I have seen with engineering moving to product is that they sometimes can have a hard time not solutioning.

Chris Gallivan (Planview)15:10:53

A lot of the people I have seen in these roles are long term employees who used to code. Over the years they have built up a lot of domain knowledge, more so than on the business side

Scott Jaffa (Principal Engineer, ValidaTek)15:10:39

Is there any effect on how the org is structured? Ie. is someone moving from engineering to product because that’s where growth is, vs. because they specifically want to be in product? Curious if organizational dynamics are a major driver of seeing that transition succeed.

Chris Gallivan (Planview)15:10:46

That was in an org that did a lot of outsourcing

Jeffrey Fredrick, Author-Agile Conversations15:10:56

That failure mode for engineers makes sense to me @char. One thought that came to mind for me is that we try to have our engineers oriented to client needs in any case; the product owner acts as a customer proxy and tries to talk in client language to the teams (and sometimes the engineers join client meetings). I suspect that might make an easier transition compared to an environment where the product owner acts as a translation layer and client language doesn’t make it into the engineering team.

❤️ 1

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:57

@scott.jaffa so far we have not had any engineers shift over to product management...

Geri Pohl15:10:58

I plan to use customer empathy to begin to help these folks, since who we serve work daily in an environment that demands extreme safety. I just got here 3 weeks go, so thinking to start there.

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:00

@jtf Having the engineers start paying attention to the user experience that their products provided has been key... having the former engineering team members be on call also helped them get religion (quickly!) over the importance of details

❤️ 4

👏 1

Charlie Betz15:10:10

@jtf These are all good hypotheses, reflective of growth mindset. I would also, as a counterbalance, inquire as to whether there is fixed mindset operating in a way that's not easily disrupted, even with the best training available.

✔️ 1

👍 1

Charlie Betz15:10:53

@denee.ferguson This is exactly what an SRE at a FAANG told me yesterday.

Craig Cook - IBM15:10:45

When IBM Marketplace start 5+ years ago, we adopted "3 in a box" model. Development Manager, Product Owner and Technical lead. We called it "intentional tension in the system". Those 3 had to agree on what to work on next. It worked really well.

❤️ 3

Geri Pohl15:10:48

@char my group may benefit from the shift of fixed to growth, simply by nature of our business where we serve teams who's lives can be in jeopardy if we don't shift. thanks for the reminder of fixed v growth. that's a great way to bring in customer empathy with them

Charlie Betz15:10:03

@jtf I do believe that one of the major constraints in the whole transition to product-centric operating model is workforce lack of product managers. I mean, there is not even a well accepted educational/training pipeline. (Not to dismiss the efforts and valuable offerings of the boutique sector. But we need more scale IMO.)

➕ 1

Bryan Finster - Defense Unicorns (Speaker)15:10:13

How can the engineers become product experts? I was in distribution centers as often as possible.

👍 1

Scott Jaffa (Principal Engineer, ValidaTek)15:10:14

I’ve been sending all the engineers to the customer product demos, user trainings, etc. Too new to know if it is having success, but that team did just get an amazing kudos from one of the users in a training about the application quality…

👍 2

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:20

@bryan.finster486 We haven't tried cracking that nut yet... the people leader for the engineering teams is the product owner... so we focus heavily on enhancing the user centricity of the product owner. The engineers get there kinda by osmosis.... We do have tight relationships with other internal teams that help us understand specific use cases among our customer base (e.g., call center agents, traders, executives, etc.)

Jeffrey Fredrick, Author-Agile Conversations15:10:37

@char: reminds me of the early days in agile, where lack of people with experience in agile was a bottleneck to widespread adoption. There was much more demand in 2005 than there were people to satisfy it. It seems something similar is happening now with product management. Everyone can see this is a better way to operate, but precisely because it is new there’s a dearth of experienced practitioners. (Mind the Product is great, but only gets you so far.)

💯 2

✅ 1

Charlie Betz15:10:24

@denee.ferguson Are engineers reporting to the product managers in terms of career pathing etc?

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:40

@char Engineers report to product owner; product owners report to me. Product managers report into our Agile org, led by @jennifer.miles While product management could be a potential path for the engineers, I'm not seeing much interest from the engineers in making that move. Most want to remain deeply technical, while a few have aspirations to become product owners/people leaders.

BMK-SECTION6-TransformationArchitect15:10:52

Interesting take. Question - then the PdM and PO spend more time on people management activities than the "Product" management? cc - @denee.ferguson

Chris Gallivan (Planview)15:10:59

While they may not be plentiful, I have met some amazing developers who are passionate about customers, products and clean code

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:09

@lbmkrishna product managers are not necessarily people leaders. Product owners are. We have found that limiting number of direct reports a product owner has is key. For one of my teams, there are 8 engineers (soon to be more as we onboard new hires). One of the engineers that wanted to become a people leader will take over product owners responsibilities for one of the products this team owns, and will become the manager of 2-3 engineers on the team. This will reduce the direct report load on the current product owner for that team. Generally, 6-7 is the max number of direct reports a product owner should have in our org.

👍 1

BMK-SECTION6-TransformationArchitect15:10:50

Thank you @denee.ferguson - Thank you for providing the context, much appreciated.

Gene Kim, ITREV, Program Chair15:10:11

PS: something I learned earlier this year: load balancers and all those Nginx systems shield developers from having to implement all sorts of new endpoints and protocols, such as HTTP/2, QUIC, SSL, and all sorts of other things I’ve only heard of. I gained a whole new appreciation for all those “commodity” devices! (Without them, we’d have to change every app to do things like handle devices roaming between WiFi and cell connections!!)

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:16

Load balancers definitely make technical implementations simpler!!! Now if I could just get MACs to roam better!!!

Gene Kim, ITREV, Program Chair15:10:33

Ha!! When I learned what load balancers did for devs, my jaw dropped in awe. I had no idea!!!

Gene Kim, ITREV, Program Chair15:10:22

Did I get that right, @denee.ferguson?

Craig Cook - IBM15:10:09

oh, triggered... Days without critical incident is not a good metric.

Craig Cook - IBM15:10:04

This can lead to teams not reporting incidents.

👍 1

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:42

@cncook001 not possible in our world

Javier Magaña - Walmart15:10:33

It depends on the culture. Needs to be a constant reminder for transparency, vulnerability and blamelessness

Jeffrey Fredrick, Author-Agile Conversations15:10:44

I disagree, I think it is a good metric… but only if you understand that good metrics allow you to ask questions. They can’t give you answers.

👍 2

Matt Wheeler15:10:49

Is there a metric that can't be used for bad reasons? good to be aware of the risks and build the right conversations around them.

Girija Rao15:10:55

We have blameless postmortems as part of our overall culture

❤️ 1

Javier Magaña - Walmart15:10:48

A metric by itself can definitely be abused. That is why it is important to identify multiple metrics and track them in tandem to be used as guard rails.

Jeffrey Fredrick, Author-Agile Conversations15:10:31

Is there a metric that can’t be used for bad reasons? < I have a friend who says “if it can’t be used for evil it isn’t a superpower.”

❤️ 2

Craig Cook - IBM15:10:56

Yes, it does depend on culture.

Nathan Kampwerth15:10:16

What were the new Infra Product teams oriented to? Was it more outcome focused or technology focused?

Girija Rao15:10:55

outcome based - oriented around the product evolution and customer experience

👏 1

Jennifer Miles15:10:28

👍@girija.rao

Brian W. Spolarich - Cal Poly15:10:21

Girija - what are a couple of examples of team focuses?

Nathan Kampwerth15:10:50

@girija.rao - So many questions. Would that mean you had a branch focus team that would include WAN design as well? Would FW access requests be part of datacenter connectivity?

Girija Rao15:10:35

@brianspo for e.g. improving the wifi experience, understanding usage painpoints and wish lists through empath interviews and surveys, and determining technology enhancements to drive those outcomes

🙏 1

Girija Rao15:10:24

@nkampwerth we have a core security team and perimeter security team and each one focuses on the related use cases

Marc Price - Nationwide Building Society (Speaker)15:10:10

did you see incidents are seasonal?

Nicole Forsythe15:10:16

Can you say more about using velocity and story points - do you standardize how teams assign story points?

👏 1

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:42

This is team driven

Nicole Forsythe15:10:09

Would that invalidate how the velocity metric rolls up?

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:36

@jennifer.miles

Jennifer Miles15:10:38

no, teams use similar methods to story size

Jennifer Miles15:10:45

the real benefit we found is measuring at the team level but we are able to roll up to see trends

John Awesome Rowe - Best Buy15:10:56

What is the purpose of your story pointing then? I've always stuck to the matra that story points mean absolutely nothing outside of a product team. Since the numbers are all made up and unique to the team, I use it to ensure consistency and predictability of the teams and use other, more real, metrics to see how teams are performing over time

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:03

@john.rowe It helps the team understand how much work they can take on each sprint... Each team knows how many story points they have been able to successfully deliver on average during a sprint.... We do include stories for things like training, vacation, PI Planning meetings etc so that our process is consistent from sprint to sprint. As I understand it (@jennifer.miles is the agile guru), story points isn't a metric that should be compared across teams.

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:18

Q4 tends to be a lot lower... change freeze after black friday

👍 1

Marc Price - Nationwide Building Society (Speaker)15:10:26

so your days without incident metric in Q4 is not used? or do you track trends year on year?

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:43

@marc.price the recording was made in mid-September, so we didn't have full Q3 stats at that point, much less Q4. We do track trends year on year

Matt Ring (he/him) - Sr. Product/Engineering Coach, John Deere15:10:26

@denee.ferguson I completely empathize with this challenge of having infra / network teams adopt Scrum and apply it to building out hardware and network configurations. Big aha moment for me when working with teams in my org. It was contextual by team, but we eventually de-emphasized the importance of Scrum ceremonies and transitioned more teams toward Scrumban or Kanban. Provided the benefits of lean/agile but without as much "overhead" that Scrum introduced.

👏 1

Nicole Forsythe15:10:33

Yeah scrum was never intended to be used in this way. We see parts of Scrum as a “starting point” less a “end state”

👍 1

Scott Jaffa (Principal Engineer, ValidaTek)15:10:17

Yeah, Kanban and flow at the speed of the work for infra / hardware. Physical constraints are better handled via kanban and blocker states and you move on when you can.

Matt Ring (he/him) - Sr. Product/Engineering Coach, John Deere15:10:13

Regarding metrics for our teams, we paid less attention to velocity and more on lean metrics like Cumulative Flow Diagram (CFD), WIP and throughput. Flow Metrics (from @mik) would work here too. We also spent time in value stream mapping to identify quality issues (low % C/A) and delays (in scheduling or hand-offs). These were where we got the most "bang for buck" with our infra teams.

👏 2

❤️ 1

👍 1

Charlie Betz15:10:00

Scrum always raises a red flag for me with anything operational as it's not suited to interrupt-driven work.

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:08

@char That was part of the square peg, round hole challenge for us... honestly I have found that using scrum ended up improving delivery (quantity, quality... still working on predictability). But we had to teach the engineers how to chunk up their work into bite-size pieces.... It used to be that they might have a single story that would hang out on their kanban board for months (something like "deploy X")... and there was no visibility into how that effort was going, where the team was getting stuck... We have found 2 week sprints to be best fit for my team... The 2 week interval allows us to incorporate unplanned work without disrupting the current sprint, but keep whoever is escalating the need to do X happy.

❤️ 1

Charlie Betz15:10:06

@denee.ferguson It sounds like you have some flexibility then with the Scrum masters accepting emergent tasks that aren't specified at the start of the sprint - some purists (not me)might raise an eyebrow :face_with_rolling_eyes:

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:28

@char yes. As much as I would like to say we will not take on stories mid-sprint, that's not our business reality. My team in particular isn't fully staffed (we're hiring!); hoping that as reqs are filled this will be less of a problem because we will be able to accomplish more each sprint with more engineers.

Brian Smith15:10:03

I am wondering if making more frequent changes in smaller batches reduce the number of severe incidents?

👍 3

Marc Price - Nationwide Building Society (Speaker)15:10:06

I guess it can cause more incidents if you try to change too frequently. faster doesn't always mean better

Brian Smith15:10:33

thanks

Scott Prugh (ETLS PC / CTO Uturn Data)15:10:39

Smaller batches that reduce the blast area are key. In our loadbalancer space we reduced failure group size first by adding more load balancers enabled via automation.

👆 1

Scott Prugh (ETLS PC / CTO Uturn Data)15:10:09

So then the failure of any change affected a much smaller area.

Scott Prugh (ETLS PC / CTO Uturn Data)15:10:53

This also greatly helps scheduling dependencies since dependencies increase risk exponentially.

👍 1

Mark Peters15:10:20

Depends on how you define an incident. This is where the A/B and canary deployments can help greatly. Have lately been converted to feature flagging as a way to rapidly fix deployed incidents, as long as you minimize embedding

👍 1

Scott Prugh (ETLS PC / CTO Uturn Data)15:10:07

@tiny.mpetersii Agreed. Feature flags are a great technique to 1) reduce blast area, 2) reduce dependencies and risk, 3) inject operational thinking upstream to dev(shift left on ops concerns)

👍 1

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:44

@brian.m.smith There's a balance... The smaller the number of devices in a window, the more windows you need. Since our windows are normally at night, wear and tear on the team is a consideration... So... I personally strive to do enough changes (pilots) that we have the kinks worked out... then ramp up the devices per window. Some of our technologies (e.g., SDWAN, Wireless) it is generally a big bang deployment... There are some ways of limiting the blast radius, but you can only do that so much. In those situations thorough lab testing and bug scrubs are key... Unfortunately, vendors tend to have a lot of bugs.

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:09

@brian.m.smith frequency of execution is actually positive... we get to be a well-oiled machine.... but once we get to that point, we're finding that we need to reduce the total number of windows to say... upgrade all devices in the network... so we can get it done in a shorter time period.

Gene Kim, ITREV, Program Chair15:10:51

(I am always in awe of how in this domain, even the smallest changes can have catastrophic global impact: I.e., global outage. firewalls, core switches, etc.)

Dave Fugleberg15:10:55

as in, Facebook's BGP issue this week...

Gene Kim, ITREV, Program Chair15:10:06

totally.

Nick Eggleston (free radical)15:10:24

It’s amazing how one “little” routing change or firewall statement can have unintended and hard-to-backtrace consequences.

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:33

yup....... not the first time I've seen BGP route change have significant impact. fortunately not caused by my teams 😉

Manny15:10:06

Dynamic routing is a double edged sword. Once a dynamic routing change propagates, you can't just reverse it as quickly as a "no" statement.

Gene Kim, ITREV, Program Chair15:10:44

“the wrong JIRA structure”

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:56

@brian.m.smith The #devices in scope for a change isn't generally the driver of likelihood of incidents.. Attention to detail in testing, design, adherence to best practice.

👍 1

Raghu Tumuluri15:10:44

Great presentation. QQ please. How did you measure the business outcomes and NPS?

Gene Kim, ITREV, Program Chair15:10:31

A lively thread on the impact of projects like this on infrastructure teams…

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:45

@raghu.tumuluri614 we haven't measured NPS for our products.... we did try to go there for wireless, but honestly found too many cases were users complaints really involved things that were not wireless......

Raghu Tumuluri15:10:56

Thank you Denee.

Raghu Tumuluri15:10:08

did you try to get the pulse surveys at the end of every PI? just to get the business and technology to provide feedback and maybe help to drive NPS in a desired way?

BMK-SECTION6-TransformationArchitect15:10:48

There is a thread growing (a breakout QA thread full of gold)

Gene Kim, ITREV, Program Chair15:10:56

Testimonial:

Gene Kim, ITREV, Program Chair15:10:30

Higher res:

Marc Price - Nationwide Building Society (Speaker)15:10:34

how did you role out your agile training? lead by internal champions?

Andrew Machen15:10:42

@jennifer.miles What is your approach for avoiding "ivory towers" when teams want to opt out of product model with less than complete data points as to why it won't work?

Jennifer Miles15:10:41

@andrew.machen continued exposure to the benefits of the model, basically a wear them down approach. We have one team who is still resistant but have adopted some of the processes. It is an iterative process with them.

Andrew Machen15:10:27

Our experience is similar. Certainly great metrics and more energy from engineers in making things better as the months and years roll by. Great talk from you and your team!

Gene Kim, ITREV, Program Chair15:10:44

> I’m incredibly pleased at the transformation we achieved with our product-oriented agile-driven restructuring - it enabled us to establish a unified mission and sense of identity, full visibility and prioritization of work, improved execution and delivery, and clear accountability internally and with our stakeholders. This structure also allowed us to easily incorporate several new functions over the past two years. It’s an ongoing journey as we continue to iterate upon this foundation to best meet the evolving needs of our dynamic organization and the services we provide. > Girija Rao Vice President > > The unification of efforts and ownership across the architecture, engineering, and operational aspects of product teams, in concert with the ability to effectively manage priorities has enabled us to transform our technical capabilities while maintaining stable business operations in a more focused and optimized manner. > Vince Gutosky > Senior Director & Chief Network Architect

❤️ 1

Gene Kim, ITREV, Program Chair15:10:56

Thank you, @girija.rao @denee.ferguson @jennifer.miles!!!

Malcolm McAlpin15:10:03

Thank you!!

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:09

Thank you all for listening!!! We are hiring!!! look on the hiring tab for an opening on my team...

👍 2

👏 1

❤️ 1

Marc Price - Nationwide Building Society (Speaker)15:10:34

great session, thanks for sharing!

❤️ 1

Charlie Betz15:10:39

Very interesting! Exactly where I am focusing current research.

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:08

@char happy to answer questions via linkedin

❤️ 1

Ann Perry - IT Revolution15:10:10

🌟 Up next, we welcome @angeldiazrodriguez and @sheilalodhia from Discover, sharing their presentation, How Discover Financial Services Puts Engineering “Craftsmanship” at the Center of Our Digital Transformation. Joining us for questions will be @kevinjosephallen 🌟

👏 1

👍 1

❤️ 1

Khan, Humayoun at TELUS15:10:00

I recall Capital One was or is a pioneer in SAFe implementation. Is CO still managing this framework @jennifer.miles

👀 1

☝️ 1

Jennifer Miles15:10:46

@humayoun.khan we still use SAFe although in the Network space we have adopted a hybrid approach at this point, using a product based teams in a SAFe like environment.

👍 1

Gene Kim, ITREV, Program Chair15:10:01

Up next: Dr. @angeldiaz and @sheilalodhia!

👏 3

❤️ 2

Manny15:10:48

@genek I ❤️ Apple Pay.

👍 1

❤️ 2

Gene Kim, ITREV, Program Chair15:10:00

(Yes, me, too! I had shared my amazement of how Discover powers Apply Pay with several folks lately — several had said something like, “oh, yeah, I saw their name when I accepted the TOS when I set up my credit cards, and now I know why!“)

❤️ 2

Mark Peters15:10:29

Had a chat with Discover the other day, their fraud prevention folks didn't know about some of the changes to improve the system, like apple pay. So the multiple small charges in a day were getting flagged as fraud, and turned down, when marketing was advocating make those small charges for extra points

Mark Peters15:10:58

Runway stops when you take off... An 80k runway means more time on the ground

Chris Gallivan (Planview)15:10:42

#dojo

🎉 2

Angel Diaz15:10:45

It’s all about reaching escape velocity

👍 1

Roland Krocin15:10:22

@tiny.mpetersii Runway is a way for us to brand the transformation. There are different altitudes that we’re iterating through, but the core point is to get off the ground quickly and keep climbing consistently.

❤️ 3

👍 1

Mark Peters15:10:57

understood, too much Air Force time...

😁 1

Chris Gallivan (Planview)15:10:39

4 week...we always used 6 weeks

Chris Gallivan (Planview)15:10:39

4 week...we always used 6 weeks

Gene Kim, ITREV, Program Chair15:10:35

How did you converge upon 6 weeks?

C15:10:31

Everything takes six weeks

Gene Kim, ITREV, Program Chair15:10:05

Nothing takes 6 weeks. 🙂 😆

Chris Gallivan (Planview)15:10:15

We worked with the authors of Creating Your Dojo: Upskill Your Organization for Digital Evolution - Joel Tosi and Dion Stewart who started Dojos at Target. They recommended 6 weeks,

Chris Gallivan (Planview)15:10:34

Over time we saw that it took about 6 weeks for learning to stick

👍 1

Chris Gallivan (Planview)15:10:49

assuming at least 30 hours together a week

Chris Gallivan (Planview)15:10:58

we experimented with other time frames

Chris Gallivan (Planview)15:10:46

based on a lot of data across numerous teams

C15:10:48

sorry for formatting.

👍 1

C15:10:03

let me edit, I'm new to this slack thing

Joe Waid - Manager, Delivery Engineering - Columbia Sportswear15:10:10

I’d like to learn more about the concept of a Dojo, is that book the place to start?

C15:10:45

week 1 - A man, a plan, a canal; Panama During week 1 you make and begin executing a plan Week 2 - No plan survives contact with the enemy If you are tracking to a two week sprint, you realize you didn't plan well enough, and your plan isn't executable. Week 3 - A plan for a real plan Take your lumps from failing during the first two weeks, Week 4 - Get the work done This is the week where the real work happens, because the team finally knows and understands which direction its going Week 5 - If you find yourself going through hell, keep going Week 6 - Celebrate

Matt Ring (he/him) - Sr. Product/Engineering Coach, John Deere15:10:58

Same approach at Principal Financial Group (prior company). Six week dojo challenges. Influenced by talking with folks at Target and the Dojo Consortium.

C15:10:34

@genek - everything* takes six weeks, you just need to scope to ensure that

Chris Gallivan (Planview)15:10:46

it doesnt start to hockey stick until week 4 or 5

Chris Gallivan (Planview)15:10:54

at least that is what i observed

🎉 1

Chris Gallivan (Planview)15:10:07

week 2 or 3 is often despair

❤️ 1

C15:10:40

@chris.gallivan421 I've boiled it down to: week 1 - let's do this (excitement) week 2 - oh this is bad (overcommitment) week 3 - oh this is really bad (we have no idea what is going on week 4 - TADA! week 5 - I knew we were fine! week 6 - wrong rock

Chris Gallivan (Planview)15:10:48

yep

Matt Ring (he/him) - Sr. Product/Engineering Coach, John Deere15:10:11

Also this: "It takes anywhere from 18 to 254 day... to form a new habit and an average of 66 days for the new behavior to become automatic." https://www.healthline.com/health/how-long-does-it-take-to-form-a-habit#takeaway

Chris Gallivan (Planview)15:10:48

along with Ebbinghaus forgetting curve

Matt Ring (he/him) - Sr. Product/Engineering Coach, John Deere15:10:50

5 days a week x 6 weeks is only 30 work days of habit forming. So even six weeks has a risk of teams leaving a dojo and new habits not sticking.

👏 2

C15:10:42

Why 6 weeks? Why are 6 week boundaries attractive? Well because math. People are really bad at predicting the future, and therefore we have to develop means and mechanisms to understand how to break up time, which is a human construct as well. People have been dividing their time into smaller parts since the beginning, of well, time.

C15:10:36

@mring - which is why quarters are useful - form - 6 weeks storm, 6 weeks storm/norm, 1 week retro, repeat

Chris Gallivan (Planview)15:10:40

I agree with @mring - ideally it would be more than that amount of time - that was our minimum

👏 2

Chris Gallivan (Planview)15:10:43

we also saw cases of unlearning, where 20 hours spent in dojos, 20 hours spent at desk unlearning

💯 2

Chris Gallivan (Planview)15:10:58

it needs focus

C15:10:02

and repetition

Chris Gallivan (Planview)15:10:37

I love this topic - I’ll be around anytime to discuss this more

C15:10:27

#same - its my philosophical rule num (#) 2

C15:10:38

esp in the gov.

Roland Krocin16:10:52

Discover’s dojos are typically 4 to 6 weeks, with a hands-on and interactive experience that covers real-life scenarios which you wouldn't get from classroom-based training. The time line comes from the selection of learning modules that has been crafted to meet the product team. Dojos give product teams a thorough understanding of core concepts while leveraging their existing product backlogs for pairing and upskilling, creating an experiential learning environment that accelerates adoption in the team's backlog immediately. Its the combination of workshops in the morning, and pairing in the afternoon. Old habits take time to break as they practice new patterns to form new habits.

Chris Gallivan (Planview)16:10:38

we tried 4 weeks, but we found teams felt more pressure to deliver something. 6 weeks gave them some more room to breathe and learn

C16:10:29

I feel the same way with a 2 week sprint - lots of time spent churning, esp early on.

Chris Gallivan (Planview)16:10:49

software development is like repairing a tractor. sometimes you need to take a few bolts off to understand the problem

Bryan Finster - Defense Unicorns (Speaker)15:10:29

Yes, tracking improvement!

👍 4

Roland Krocin15:10:57

DORA metrics!

Jeffrey Fredrick, Author-Agile Conversations15:10:33

drink!

😁 1

Gene Kim, ITREV, Program Chair15:10:37

I particularly enjoyed @bryan.finster486’s talk on weaponizing DORA metrics yesterday — was absolutely fascinating!

👏 2

Gene Kim, ITREV, Program Chair15:10:41

Stories from three teams across some of the most important business units at Discover: • Priya Gupta, Sr Manager, Customer & Account Data • Joe Mathew, Sr Manager, Line Increase Request • Lakshmi Rupanagunta, Manager, Authorized User

👍 1

Angel Diaz15:10:03

Curious as to how do other foster in-open source approach to Dojo’s

Virginia Laurenzano NSA15:10:07

"coding in a closet" resonates with me

👍 2

Bryan Finster - Defense Unicorns (Speaker)15:10:16

@sheilalodhia we also put heavy focus on CI metrics at the WM Dojo. Code merge frequency and dev cycle time. Any thoughts?

Angel Diaz15:10:17

Hi - we do have a bunch of team metrics we use in addition to the ones we highlighted today. Code merge, refactor, re-use etc. included

👍 2

Sheila15:10:51

The metrics I presented were the minimums for all teams. The teams are free to measure with other metrics that help them solve their individual problems.

Bryan Finster - Defense Unicorns (Speaker)15:10:15

👍

Denee (de-NAY) Ferguson - Director, Technology - Capital One (Speaker)15:10:06

@marc.price Domenica Degrandis (Tasktop) did some internal training for us; we also had product owner training and to some extent an internal agile coach.... I personally think we should have done more, ensuring more role specific discussions about how this would work on day in day out basis. Having an agile coach working directly with each team would have cut the learning curve substantially.

👍 4

Marc Price - Nationwide Building Society (Speaker)15:10:41

we have a central team of enablement specialists that help with training and helping teams adopt new ways of working, sounds like we are heading in the right direction, thank you for sharing

Bryan Finster - Defense Unicorns (Speaker)15:10:27

We started using this to horizontally scale the knowledge of the goals. https://www.engineeringthedigitaltransformation.com/

Bryan Finster - Defense Unicorns (Speaker)15:10:49

Trying to push knowledge to the edge.

👍 1

Gene Kim, ITREV, Program Chair15:10:52

Thank you for getting these testimonials from your teams, @angeldiaz — these were great stories from the actual teams solving their actual problems, as opposed to executives talking “Powerpoint to Powerpoint”. 🙂

❤️ 3

👍 2

Gene Kim, ITREV, Program Chair15:10:45

and thank you for those testimonials, @sheilalodhia!

👍 2

Luke Rettig - Target, Sr Director-Global Inventory Mangement15:10:46

theme of this Summit: Connection > PtP

❤️ 4

Jon Tarrant15:10:56

PtP? PowerPoint or Point To Point?

Javier Magaña - Walmart15:10:25

Peer to Peer?

Luke Rettig - Target, Sr Director-Global Inventory Mangement15:10:07

powerpoint to powerpoint

👍 2

BMK-SECTION6-TransformationArchitect15:10:51

I love the Discover team presentation with all the micro interviews embedded in their talk - Love it cc - @angeldiaz @sheilalodhia

👍 4

❤️ 2

👏 1

Angel Diaz15:10:12

Thank you Our team is so excited and proud to share with everyone!

Roland Krocin15:10:25

We learn, we share! We get better together! That is a core Discover behavior!

Sheila15:10:42

Really appreciate your comment! Love sharing our journey.

Nick Eggleston (free radical)15:10:53

I love the notion of “what objective evidence is there that a certain person is actually good at X, Y, or Z.” 🙂

👍 2

✅ 1

John Allspaw15:10:35

If I may submit a patch, Gene: “what ~~objective~~ evidence is there that a certain person is perceived by others at being good ~~actually good~~ at X, Y, or Z.”

Those are two different but highly related questions

Nick Eggleston (free radical)15:10:02

And important

Gene Kim, ITREV, Program Chair16:10:20

😆 Nice. Just wanted to have a test case that detects when someone is not actually good at something. (Or anything. 🙂 😆

Nick Eggleston (free radical)12:10:09

Exactly… dunning-Kruger effect is very real, and there are folks who are amazing at expressing confidence while actually creating problems due to lack of competence in ways much harder to detect quickly

Geri Pohl15:10:02

Leaders and Product folks who get the chance to see and understand the value of dojos are the dream.

👍 2

❤️ 1

Olivier Jacques - AWS - DevEx15:10:03

:thinking_face: I wonder if some here started to leverage Nicole Forsgren's (GitHub / Microsoft), Margaret-Anne Storey, Chandra Maddila, Thomas Zimmermann, Brian Houck, and Jenna Butler "SPACE" framework on measuring Developer Productivity. https://queue.acm.org/detail.cfm?id=3454124. Including during and after Dojos.

Jon Tarrant15:10:31

Thanks for the reference. Looks very interesting.

👍 1

Mark Peters15:10:13

Converting innovation to invention - good thought - need to make sure not measured as innovation not resulting in invention is bad. 80% of new ideas tend to fail

👍 1

Gene Kim, ITREV, Program Chair15:10:40

Thank you so much, Dr. @angeldiaz and @sheilalodhia Lohdia!

❤️ 5

👍 2

👏 1

Chris Gallivan (Planview)15:10:48

great work on the gemba

Malcolm McAlpin15:10:51

Thank you!!

Topo pal15:10:54

Awesome @angeldiaz

❤️ 1

👍 2

Angel Diaz15:10:14

Thanks Topo!!

Craig Larsen - he/him - Solution Design Group Mpls15:10:56

Ok, well done!

Swarup Panja15:10:56

I loved presentation from Discover team!!

Angel Diaz15:10:14

Thank you

Sheila15:10:31

Thank you for taking out time to listen our story!

jeff.thomas15:10:57

Is the Discover Academy open to anyone.. It would be great to see ;o)

Roland Krocin15:10:34

Not yet, but stay tuned! ;)

👏 1

Catie Martin_Design Manager_BlueCrossBlueShield of SC15:10:01

Would love to hear more abut the academy!

👍 1

Roland Krocin15:10:09

You’ll see us here next year talking lots more about it. And we’ll be sure to engage with this community as we talk more about the Discover Tech Academy in the interim.

Ann Perry - IT Revolution15:10:02

:unicorn_face: Coming up next, our very own @genek and @steve773 with The Four Characteristics of Structure Needed to Get Great Dynamics

🙌 1

Jennifer Collings15:10:04

Up next: Dr. @steve773 and, umm, me! 🙂

🎉 1

❤️ 1

Thank you that was great!

❤️ 1

Roland Krocin15:10:10

Fantastic presentation!

❤️ 1

Raghu Tumuluri15:10:37

great presentation. thank you.

Javier Magaña - Walmart15:10:38

Great presentation

Bryce Miller15:10:59

Really enjoyed the recent podcast episode you did together

❤️ 1

Steve Spear15:10:01

Hoping this next speaker is worth listening too… :-)

🙏 1

😄 6

Margueritte Kim (CEO, IT Revolution)15:10:05

Hi @steve773!

Steve Spear15:10:17

yo!

Alyssa Lundgren - Centil - Product Owner15:10:20

YES!! Dr. @steve773 ! Get ready for a fun presentation, all!!

Steve Spear15:10:33

Thanks!

Gene Kim, ITREV, Program Chair15:10:21

This presentation represents how @steve773 and I are trying to prove that four characteristics of structure predict high- vs. low performance.

Ferrix Hovi - Principal Engineering Avocado - SOK (S Group)15:10:39

I'm afraid that @steve773 is going to help us drink from a firehose again.

😄 6

🧑‍🚒 2

🌊 1

➕ 1

Craig Larsen - he/him - Solution Design Group Mpls15:10:51

Ooh, sounds like this will also be a good talk

Tashfeen Mahmood15:10:57

As my 4-year old says: This is going to be fun!

➕ 2

Steve Spear15:10:44

👍

Chris15:10:05

Second consecutive DOES without bow tie for Dr @steve773 The begining of a new era} 😂

😭 3

Steve Spear15:10:31

Blame that on covid. I’m lucky if I remember to put on big boy pants.

Margueritte Kim (CEO, IT Revolution)15:10:46

I do not accept this!

☝️ 1

Klaudia Breslavets - Vanguard15:10:48

@angeldiaz @sheilalodhia thank you for sharing the Discover story! I've observed that learning culture is equal parts enablement with curated resources like internal academies but equally encouraging new behaviors for leaders & employees. Do you agree? What kinds of experiences has Discover had with the latter?

👍 3

Roland Krocin15:10:17

@kxbres agreed. It's the means and the ways. We equally coach on the adoption of the curated resources, alongside our core Discover behaviors that enable community, curiosity and innovation (maybe there’s an upcoming DOES talk on those ;)). Neither one can stand alone.

Sheila15:10:15

Great question! When changing how we work we tend to focus on the people within the teams and the steps they need to take….and never get to the leaders. This time as we rolled out Runway- we started with leaders in making the changes to the product model to seed the ‘why’ and problem we are solving mindset. Additionally, DTA has created learning journeys for leaders to help them understand the concepts the teams are learning as they get up skilled on DEVOPS.

Gene Kim, ITREV, Program Chair15:10:49

> What are the structures and dynamics necessary to unleash the distributed and collective human creativity and potential to compete and win, in an age being tumultuously disrupted by scientific and technological innovation, market transformations, and political and societal realignments. > > How and why over the last 150 years are some organization able to generate and deliver better ideas, quicker, faster, and more reliably. > > How do they create this magical dynamic that high-performing organizations use to unleash and empower everyone’s innate human creativity and intelligence to advance business and societal needs.

👍 1

Gene Kim, ITREV, Program Chair15:10:42

This is the classic Dev vs. Ops dynamic. But it’s also Merchant vs. Ops dynamic described yesterday by @lucas.rettig. And for that matter, Team of Teams.

Gene Kim, ITREV, Program Chair15:10:08

Did I get that right, @lucas.rettig Merchants vs. ___ ???

👍 1

Chris15:10:14

These two model of communication in the organization make me think of an email from Elon Musk to tesla,..

Nick Eggleston (free radical)15:10:27

Wow this Intro is so cognitively dense!! @genek

Jeff Gallimore (CTIO - Excella)15:10:30

configure vs run for the role of the leader. that’s a powerful lens.

Gene Kim, ITREV, Program Chair15:10:23

The suggestion is that the leader can look at the structure of the system, simulate it in their head, and predict how it will behave. LIke how @jtf looked at MIT Beer Game, and immediately said, “That’ll never work! One way communications, slow feedback!”

🍻 3

🍞 1

Matt Ring (he/him) - Sr. Product/Engineering Coach, John Deere15:10:45

@genek you reference the MIT beer game in your podcast and elsewhere frequently. Do you have a good source you like to reference for those who are new to the concept or want to see it in action?

Gene Kim, ITREV, Program Chair15:10:31

email me at <mailto:genek@itrevolution.com|genek@itrevolution.com> — I’ll send you some show notes in the Idealcast (or just look there).

Gene Kim, ITREV, Program Chair15:10:09

“queues” – what @scott.prugh hates most.

Steve Spear15:10:03

Who feels like that dude in the middle of the system? ideas, requests, inputs coming in from everywhere and outputs expected everywhere else. Holy 😱 Batman.

👍 1

🙌 1

☝️ 5

🙈 2

Chris15:10:57

Can’t find vishnu or ganesha multiarms emoji 😆

This is what was described in an Idealcast episode on modularity — the Eppinger Design Structure Matrix, where every node is connected to every other node. A completely full adjacency matrix.

👍 3

Gene Kim, ITREV, Program Chair15:10:24

https://en.wikipedia.org/wiki/Design_structure_matrix

Jeff Gallimore (CTIO - Excella)15:10:46

@steve773 @genek simplification, standardization, stabilization, synchronization — is it more useful to start with one of these? or treat them as all inter-related and use them all at once?