How to measure whether your company kick off actually worked
Kick off measurement is not “did people enjoy it?”. It is “did the year begin?”. A three-horizon framework for measuring what kick offs are actually for.
It is the Monday after the kick off. The CEO has asked, in passing, “how did it land?”. The CFO has booked a fifteen-minute slot in two weeks to review the spend. The board pack for next month has a line that says “kick off impact summary”. You have an attendance figure of 91%, an in-room satisfaction score of 8.4 out of 10, and a sense that the energy on the day was good. None of that is going to survive the board meeting, and you know it. You need a measurement model that maps to what the kick off was actually for, not the one your post-event survey tool spat out automatically.
This article is for that conversation. It treats kick off measurement as a strategic design problem, not a survey design problem. It also treats measurement as the closing of a loop that begins with the strategic intent of the kick off and runs through the 90 days that follow.
Direct Answer
Kick off measurement is not “did people enjoy it?” – it is “did the year begin?”. The strongest frameworks use three horizons. Momentum on the day, measured in-room and online. Strategy recall and language adoption at 30 days. Behaviour and decision shift at 90 days. Each horizon answers a different leadership question and requires a different method. None of them is a satisfaction score.
Key takeaways - At a glance
- A kick off is a launch event, not a learning event. The measurement should map to launch, not to satisfaction.
- The three horizons: momentum on the day, recall at 30 days, behaviour at 90 days.
- A pre-event baseline is the difference between a defensible model and a guess.
- Satisfaction scores and NPS are momentum signals at best. They do not tell you whether the strategy landed or behaviour shifted.
- The board-ready report is one slide: three horizons, three or four signals per horizon, one paragraph of narrative.
Why satisfaction scores don’t tell you if a kick off worked
The standard post-event measurement tool is a survey. It captures NPS, an overall satisfaction score, a few open-text comments and an attendance breakdown. It is delivered to the CEO three days after the event and put into the next leadership review.
The honest problem with this is structural. The survey measures the day. It does not measure what the day was for.
A kick off exists to launch a year. The measurement question is therefore not “did people enjoy it?” but “did the year begin?”. Did the workforce arrive at the kick off in one state and leave in another? Are the strategic priorities visible thirty days later in the language the workforce uses? Are they visible ninety days later in the decisions the organisation is making? Those are the questions a CFO is going to ask, eventually, even if the first conversation is about the survey result.
The deeper problem is that satisfaction scores are weakly correlated with whether a kick off actually worked. An audience can rate a beautifully produced, well-catered, emotionally satisfying event highly and still leave with no clearer sense of the year’s strategy than they came in with. The reverse is also true. A more austere event that landed the strategy sharply can produce a lower satisfaction score and a much stronger 90-day outcome.
Gallup’s State of the Global Workplace research puts global employee engagement at 21% in 2025. Engagement and clarity are linked: workforces that strongly agree they know what is expected of them are materially more likely to be engaged. The measurement question is whether the kick off moved that needle, not whether the audience enjoyed the day. They are different questions, and they need different answers.
What should you actually measure?
The strongest measurement framework for a kick off uses three horizons. Each horizon answers a different leadership question, uses a different method and produces a different kind of signal.
Momentum on the day. Did the audience arrive with energy and leave with energy? Did the in-room and online experiences both work? Did leadership land what they were on stage to land? Method: in-room observation, online engagement analytics, leadership self-rating. Signal: directional, not conclusive. The day-of measurement tells you whether the launch happened. It does not yet tell you whether the launch worked.
Recall and language adoption at 30 days. Can a representative sample of the workforce name the year’s priorities in their own words? Has the new strategy language started to appear in team meetings, planning conversations and internal comms? Method: pulse survey, listening exercise, manager interviews. Signal: the most important measurement in the whole framework, because it tells you whether the translation work succeeded.
Behaviour and decision shift at 90 days. Are the priorities visible in board papers, objective-setting, hiring decisions, internal comms language and the way managers actually run their teams? Method: structured decision audit, document review, manager listening. Signal: this is the test that decides whether the launch was real or theatrical.
The three horizons are sequential, but they are also designed to be planned together. Each horizon’s measurement is part of the brief, not added after the event. CIPD’s employee communication factsheet reinforces the principle that effective internal communication is planned and evaluated, not delivered and abandoned.
On the day: measuring momentum
The day-of measurement is the easiest layer to do badly and the easiest to confuse with the real measurement question. The goal is not a satisfaction score. The goal is a small set of momentum signals that tell you whether the launch happened cleanly.
In-room energy, observed structurally. Not a vibe check. A small number of named observers (Internal Comms, HR, an external partner) watching for specific signals: laughter at the right moments, attention curves across blocks, audience participation in the breakouts, body language during the strategy reveal. The observations are written up in a one-page note.
Online engagement analytics. For the remote audience, the data is structural. Concurrent viewer count across the day. Drop-off curve (where did people leave?). Chat activity and quality. Poll participation rates. Average watch time vs total event length. The shape of the curve tells you whether the hybrid layer worked, which is a question the in-room experience cannot answer.
Leadership self-rating, structured. Each executive who was on stage rates their own block on three dimensions (clarity of message, audience response, what they would change) within 24 hours. This produces a useful leadership view that is separate from the audience view, and that is sometimes more accurate at predicting how a strategy actually landed.
A short audience pulse, optional. A two-question pulse the day after, not the day of. One question on whether the audience can name what was launched, one open question on what stood out. NPS and satisfaction scores can be included for completeness, but they should not be the headline measurement. They are momentum signals, not launch signals.
The day-of measurement is directional. Its job is to tell you that the launch happened, not whether it worked. That question gets answered later.
At 30 days: measuring recall and language adoption
This is the most important horizon in the framework, and the one most often skipped.
Recall survey, structured. A short survey to a representative sample of the workforce (not the whole organisation, not just leadership). The headline question is whether the respondent can name the year’s priorities in their own words, unprompted. A separate prompted question can follow. The unprompted answer is the real measurement.
The headline number to track is the proportion of respondents who can name the priorities accurately. A strong launch tends to produce 65 to 80% accurate recall at 30 days. Below 50% is a signal that the cascade is not working. Above 85% is a signal that either the launch was unusually strong or the survey design is leading the respondents.
Language listening, qualitative. Ask five managers in different parts of the organisation, in a short conversation, to describe the year ahead in their own words. What language do they use? Where does the new strategy phrasing appear and where does it not? Where is it being translated accurately and where is it being softened? This produces a different, more textured signal than the survey, and the two together give you a useful read.
Comms audit. A quick scan of internal comms content across the 30 days post-event. Is the new strategy language showing up in non-kick-off comms? In team Slack channels? In email subject lines from leadership? The presence or absence of the language in the organic flow is a strong signal.
McKinsey’s research on organisational transformations found that successful transformations used line-manager briefings as a primary cascade mechanism in 65% of cases. The 30-day measurement is largely a measurement of whether that line-manager layer carried the cascade. If recall is weak at 30 days, the diagnosis is almost always that the manager toolkit did not get used.
At 90 days: measuring behaviour and decision shift
The 90-day question is whether the launch is real or theatrical. Recall on its own is not enough. The test is whether the priorities are showing up in the decisions the organisation is making.
Decision audit. A structured review of the documents the organisation has produced in the 90 days since the kick off. Are the new priorities named in board papers? In quarterly objective-setting templates? In hiring decision criteria? In internal comms language for non-kick-off topics? The audit does not need to be elaborate. A two-page note with named examples is the right size.
Manager behaviour signal. A second short pulse to line managers. Are they running team conversations that reference the year’s priorities? Are they using the language in their one-to-ones? Have they had any pushback or confusion from their teams that they have not been able to resolve? The aim is to surface the gaps, not to grade managers.
Visible examples, named. Leadership surfaces two or three concrete examples of teams or individuals making decisions in line with the new priorities. This is both a measurement and a reinforcement act. The act of naming the examples is part of the cascade.
Outcome metrics, where they exist. Some strategic priorities have clean lead indicators. Customer adoption metrics, pipeline indicators, hiring metrics, retention signals. Where they exist and are short-cycle enough to read at 90 days, include them. Where they do not, do not invent them.
By the end of the 90-day phase, the launch is either real or it is not, and the measurement framework gives you a defensible answer in either direction. If the launch did not land, the measurement also tells you where it broke (cascade, language, behaviour) so the next launch can be designed better.
Kick off measurement is the closing of a loop, not a post-event survey. The loop starts with strategic intent and ends with whether the workforce is making decisions in line with it ninety days later.
Mike Walker, Managing Director, MGN Events
How do you report this back to leadership?
The answer is short, and that is part of the point. A kick off measurement report should be one slide. Three horizons, three or four signals per horizon, one paragraph of narrative that names what worked, what did not, and what to do differently next time. That is the format that survives a board pack.
The slide structure that works in MGN’s experience.
Header: the question the kick off was launched to answer.
Three columns: momentum on the day, recall at 30 days, behaviour at 90 days. Each column has three or four signals, presented as numbers or short phrases. Each column has a clear verdict (working, partial, gap).
Narrative paragraph: four to six sentences that close the loop. What the launch was for. What the evidence says. What is being recommitted to. What is being adjusted.
That is it. No 30-page deck. No appendix of survey screenshots. The credibility of the measurement comes from its clarity, not its length. A leadership team reading the slide should be able to ask sharper questions because the format makes the answers visible.
The Event Effectiveness Audit is a useful first step for any organisation that has not yet measured a kick off in this way. It is a structured assessment of where measurement is strong, weak or absent.
Bringing it together
Kick off measurement is the closing of a loop that begins with strategic intent and runs through the 90 days that follow. The three horizons (momentum, recall, behaviour) map cleanly to what a kick off is for. None of them is a satisfaction score. All of them are designable into the brief from the start.
MGN scopes measurement alongside the event itself, not as a post-event afterthought. The framework, the survey design, the manager pulse, the audit structure and the board-ready slide are all part of the proposal from day one. If you want measurement built into your next kick off from the brief stage, the conversation worth having is structural.
Call 01932 22 33 33 or email hello@mgnevents.co.uk.
You can also explore our Company Kick Offs service page, our guide on planning a company kick off, or the Event Effectiveness Audit as a starting point.
Measure company Kick off Success FAQs
IS NPS A USEFUL KICK OFF METRIC?
Only as a momentum signal on the day. NPS does not tell you whether the strategy landed or behaviour shifted. It can be included for completeness, but it should not be the headline measurement. Reporting a kick off to leadership on NPS alone almost guarantees the next conversation will be about why the kick off is worth doing.
WHAT’S THE SINGLE MOST USEFUL 30-DAY METRIC?
Strategy recall. Can a representative sample of employees name the year’s priorities in their own words, unprompted? It is the cleanest signal that the launch worked, and it predicts the 90-day behaviour shift more reliably than any other single measurement.
HOW DO YOU MEASURE BEHAVIOUR AT 90 DAYS?
Decision audit. Look at board papers, objective-setting templates, hiring decisions and internal comms language. Are the new priorities named, weighted and decided on? The audit does not need to be elaborate. A two-page note with named examples is the right size.
DO WE NEED A PRE-EVENT BASELINE?
Yes. Without it, post-event measurement has no comparison point. A simple pre-event pulse survey, two or three weeks before the kick off, covers it. The survey asks the same recall question the 30-day measurement will ask, so the shift can be read directly.
SHOULD WE REPORT THIS IN THE SAME FORMAT AS CONFERENCE MEASUREMENT?
No. Kick off measurement maps to launch, not to learning. The conference measurement framework (covered in MGN’s piece on measuring conference impact) uses a different set of horizons because conferences answer a different question. The two frameworks are sibling, not interchangeable.
Written by MGN Events, a UK creative events agency specialising in corporate events and brand experiences.





