RRA Methodology

We all regularly use a risk based methodology when making decisions in day to day life, without thinking about it. The Rapid Risk Assessment or Rapid Risk Analysis (RRA) methodology helps formalize this type of decision making and ensures that the process is reproducible, consistent and the results are easy to communicate.

This document spells out how RRAs are typically run for a service (which is the most common and recommended use case).

See also Risk TL;DR for an introduction to risk and our processes related to risk.

Rapid Risk Assessment for a service

A typical Rapid Risk Analysis/Assessment (RRA) should take about 30-60 minutes ⏰.

It is not a security review, a full threat-model, a vulnerability assessment, or an audit. These types of activities may however follow an RRA if deemed appropriate or necessary.

The main objective of the RRA is to understand the value and impact of a service to the reputation, finances, productivity of the project or business. It is based on the data processed, stored or simply accessible by services.

Note that the RRA does not focus on enumerating and analyzing security controls. The RRA process is intended for analyzing and assessing services, not processes or individual controls.

Preamble

Data is the most important item in risk management. Software, websites, infrastructure, networks and people handle, process, exchange and store data.

The RRA focuses on creating a summary of the risks associated with your data. Key points:

Quick! The RRA takes 30 to 60 minutes maximum.
Very high-level. Details are for complete threat models. The RRA can become a complete threat model over time though!
Concise, readable. Short and with clear risk levels.
Easy to update. Can be run during any phase of the project development and continuously updated.
Informative. Collects risk impact and a data dictionary. Also collections information about how the service functions.
Let you know what to do. The RRA includes the list of recommendations from the security team with a priority for each item.

This helps to make the following type of risk-based decisions:

Is the security provided by a given platform appropriate to host a specific classification of data?
How much should we care about maintenance, etc?
Is there anything obvious we should really look at fixing right now?
Where should we focus our efforts to significantly increase the security or the service?
Did we forgot anything, or had any blind spot we hadn’t though of?

Sample use scenario

Firefox accounts store user data:

What happens if that data is disclosed to the world? What happens if the service goes down? What happens if the data is modified by someone unauthorized?
Do these events put us at financial risk? Is our productivity affected? Is our reputation affected?
What types of data does Firefox accounts handle? How sensitive is the primary type of data used by Firefox accounts?

The RRA risk table facilitates discovering the answers to these questions.

How-to: Attending and running RRAs

When to run RRAs? What do I need to bring or do?

RRAs are designed to be created and updated as needed, at any time, with or without an associated meeting. However, you really should run the first RRA during the design or architecture phase of new services together with a trained risk analyst.

It is also strongly recommended to have the following things available for the RRA creation:

Name of a person or/and team responsible for the service.
Data flow diagram.
List the kind of data that will be processed or stored: secrets, credentials, public data, confidential data, and anything else that may be important for this service.
An understanding of how the service works.

When NOT to run an RRA?

RRAs should only be run for services. If you have a question about a specific feature or design choice, and how it’s going to impact other services: find which service your feature is tied to and see if there is already an RRA available. Otherwise, request an RRA for that service, this will help us (and you) assess your feature or design choice!

(Re)scoping large services

Large services may be be split into multiple smaller services or sub-services that handle a specific type of data and expose a limited set of features. This choice has to be made when running the RRA. If the sub-services are owned by different teams, it is a strong indicator that multiple RRAs should be run.

Large services that cannot be split up not only lead to a complex assessment, but also may indicate that the service itself needs to be re-designed in a more secure fashion. Any service that is too large to be understood within the time allocated for an RRA is too complex and will inevitably lead to unknown risks.

What to focus on during an RRA?

Getting value for the service owner (not the risk analyst). The service owner, developer, designer, architect, etc. needs to understand what is most important to protect and if they have any blind spot.
Data. Fill in the data dictionary. You need to know most of the data the service will have access to, stored or processed.
Impact assessment. The RRA is the authority for impact levels and these are paramount. How bad can things get, what’s the worse case scenario?
Recording threat scenarios. What attack scenarios were considered? Would someone else understand it?

What to NOT focus on:

Gathering security controls and figuring out how effective they are. Don’t do that! This information may be recorded if it comes up but do not focus on it as this is very time consuming. If the service is considered risky, specific processes can be recommended in the RRA recommendations section to assess specific controls separately.
Likelihood, Security provided by service. Don’t spent much time there! It is very hard to assess likelihood in most scenarios, and easy to get lost in “what if”. We have specific, separate processes to assess likelihood. Some quick questions may still help though, such as “what’s the security history of this service?”

Guided process for risk analysts: Running your RRA in ~30 minutes

This is a guided example of how to run an initial RRA. You will:

Invite the relevant people to a meeting.
Help them figure out risk impacts and record everything in the RRA doc.
Help them figure out the next steps.
Make them feel like they own the RRA document (and they do!).

All steps in this process are important. Skipping on steps is likely to lead to issues for the future you.

Before the initial RRA meeting

Ensure no previous RRA exist; if it does, just enhance the current RRA document.
Create a copy of the RRA template in your RRA Google Drive directory.
Invite 1 or 2 members (product/service owners, lead engineers, etc.) related to the service with a bit of technical knowledge.
Ensure the invitees bring a data flow diagram and have an understanding of the data the service stores/processes.
You do not want more than 4 or 5 people total as this will slow down the RRA significantly. Many RRAs are run 1 on 1 (i.e. 2 people total).
Make sure everyone invited has edit rights to the document, and have the document opened in front of them when the RRA starts (share your screen).
If this is anyone’s first RRA, ensure they understand the goals of the RRA and give them a short introduction to what the different steps will be. This both help them follow, and show that you have control of the meeting (seenext section on time management).

Initial RRA meeting

Time management - take control

You will be responsible for the time management when running the first RRA for a service. Youwill sometimes have to cut a discussion short and be assertive: we have a tendency to jump directly to security controls during risk discussions. While valuable, this is not the main focus of the RRA: controls can be better discussed once the impacts have been clearly defined.

Examples of red flags that indicate you should re-focus the current discussion:

The discussion languishes (>1 minute) around “how to mitigate this very issue” or “we’re doing X to ensure this never happens”.
The discussion focuses on process preferences, changes, etc. instead of just filing the RRA document.
The discussion about how the service works takes too long (>15 minutes) and the owner has to lookup every single detail (you only need an overview at this stage, or the owner has to come back when they know what service they wantto look at).

A good tip is to reserve 60 minutes of RRA time in the calendar, and plan to run the RRA for only 30 minutes.

This leaves you with some room for error, and handle services that weren’t well understood by their owners. Best casescenario, everyone will be happy when you cut the meeting short after only 30 minutes. In any case, always watch the clock ⏰!

Taking longer than 60 minutes for the initial RRA is considered a failure and should not happen.

Running the RRA meeting

If this is your first RRA, ensure that someone who has run RRAs previously is present to help you. It is a good idea to have attended multiple RRAs before starting your own. Your experience and understanding is key to running a successful RRA that will help the teams and keep the service safe.

RRA Utilities: There is a menu at the top of the RRA document called the “RRA Utilities” menu. Use it to set risk impact, levels, data classification and marking the RRA as reviewed. Do use it as our scripts rely on this to copy RRAs to our RRA API.

Metadata (1min)

Fill in the service name.
Fill in the service owner: ask whom would be taking the decision to turn the service off in case of an incident, if it’s unclear. That is the service owner.
Lookup the owner’s closest director or VP and add this as well.
Leave “service data classification” and “highest risk impact” empty for now.

Service Notes (5min)

This is where you put any notes that you feel are relevant to the understanding of the service, security, etc.

Ask the service owner what the service does and a little bit of how it works. Ensure that you understand the service well.

You should be able to reformulate what the service does, and the service owner to agree on your formulation.

You will want to copy a diagram (if possible, a data flow diagram) and have links back to the RRA request (with whichever bug/issue tracking you choose to use), notes, and the service’s own website (which may be a vendor).

Optional: This is also a good time to mention the vendor questionnaire if this is a vendor and it hasn’t been filled in. Your organization may have it’s own process instead.

Feel free to go back to this section (Service Notes) at any time to add any further notes.

Data Dictionary (5-10 minutes)

We want to know about all data the service will process or store (and not just store). Any data the service can touch or see is to be considered as part of the assessment.

You will need to ask the team or service owner about what kind of data the service processes or stores. Here are some examples that you should expect:

Specific configuration data
User or service credentials, secrets
User data (often called Personally Identified Information, or PII))
etc.

Set a data classification for each data type in the dictionary, such as “PUBLIC”, “STAFF CONFIDENTIAL”, etc. by using the “RRA Utilities” menu.

When you figure out what the bulk of the data is, or what the most important data is, set this as the “Service Data Classification” in the RRA metadata.

Threat Scenarios (5-10 minutes)

This is where we discuss potential attack scenarios and figure out how bad things could go (i.e. the worse-case scenario).

The RRA document itself contains tips about this section as well, which should help you as you go along.

Do not record the threat types, attacker types, etc. in this model, instead think about the easiest attack vectors (“threat scenarios”) with examples of how they may occur. While you should focus on recording impact, it is important to also ask if anything already happened. Take a note if so, as this indicates a possible higher probability for the impact to occur.

The following are the high-level properties that you will consider a threat scenario for:

Confidentiality: What happens if all the data is disclosed to the world?
Integrity: What happens if the data is incorrect, misleading, website defaced, etc.?
Availability: What happens if the data or service is missing, deleted, or currently unreachable?

For each, run through the questions for these categories, and assign an impact level:

Reputation issues
- Do we get in mainstream news? (MAXIMUM impact)
- Do we get in the technical news? (HIGH impact)
- Do we receive emails, bugs, twitter messages, etc? (MEDIUM impact)
- Not much? (LOW impact)
Productivity issues
- Are small teams occupied on dealing with the issue for
  - Less than 24h? (LOW impact)
  - Less than 2 days? (MEDIUM impact)
  - Less than a week? (HIGH impact)
  - More? (MAXIMUM impact)
- How about large teams, or the entire company, or our user-base?
  - Less than 2h? (LOW impact)
  - Less than 24h? (MEDIUM impact)
  - Less than 2 days? (HIGH impact)
  - More? (MAXIMUM impact)
Financial issues?
- Would it cost money? How much?

Record all results and make sure that you set an impact level (use the “RRA Utilities” menu for this).

It is adequate to enhance these scenarios in the future - eventually to create a complete threat model (specially if the assessed impacts are HIGH or MAXIMUM) and if further security work is required, or the RRA is revisited.

Additional tips

Whenever the productivity impact is HIGH or MAXIMUM, there is probably also a financial impact due to the cost of the workforce being impacted.
Financial risk is sometimes hard to define, in particular when tied to contracts. When in doubt, it means there is no good data and that you should skip it.
Educate the project owners and lead developers of the project about the meaning of the risks you have described, and how the RRA can help them make decisions such as which operational environment to select, what technologies to use, and how much effort to put into securing the project (see also “Recommendations” below).

Recommendations (5 minutes)

While the RRA is not meant to be a complete review, recommendations do come up and this is a great time to chat about these.

Ensure all recommendations that came up (from you or the team) are mentioned here. It’s also ok to fill them as you go!
Use the “RRA Utilities” menu to set an impact for the recommendation (“if followed, how much would it help the service?”) as this allow service owners to prioritize work. HIGH and MAXIMUM threat scenarios are likely to have HIGH or above recommendations.
Ensure detection controls and access control have been mentioned, not just prevention controls. Can these be improved? Should we alert on events?
Does this service have an incident response plan defined?
Is the service using SSO for login, how?
Is there a website or application that can be further assessed?

Wrapping up

Make sure you’ve filled the “Service Data Classification” up top according to your data dictionary (what you consider to be either the bulk of the data or the most important data is your classification)
Make sure you’ve also filled the “Highest Risk Impact”, this is the highest impact recorded in the threat scenarios.
Double check the findings with the team:
- Present the current risk impact and ask if they think it’s reasonable.
- Present the current recommendations and ask if they think anything’s missing and what they should start with.
- Consider requiring leadership to acke on any recommendation that is HIGH or MAXIMUM before they ship or use the service, especially if these aren’t followed.
- Ask the team if there is any additional security related question they want to ask or if anything wasn’t covered.
Add Google docs comments to the recommendations and assign the comment to the service owner(s), to ensure they’ve also been notified by email.

Reference documents and similar work

RRA Google Docs templates and code

See the integrations!