Results and plotting¶

This tutorial covers how to inspect, export, and plot results from an STIsim simulation.

Setup¶

Let's run a simple gonorrhea simulation to work with:

In [1]:

Copied!

import stisim as sti
import matplotlib.pyplot as plt

sim = sti.Sim(diseases='ng', n_agents=2000, start=2010, stop=2030)
sim.run(verbose=0)
import stisim as sti
import matplotlib.pyplot as plt

sim = sti.Sim(diseases='ng', n_agents=2000, start=2010, stop=2030)
sim.run(verbose=0)

Initializing sim with 2000 agents

Out[1]:

Sim(n=2000; 2010—2030; networks=structuredsexual, maternalnet; diseases=ng)

Browsing available results¶

After running a sim, all results are stored in sim.results. Each disease has its own results object. You can list all available keys with sim.results.ng.keys(), or access individual results directly. STIsim stores results at multiple levels of detail: overall, by sex, and by age group.

In [2]:

Copied!





# Access individual results
prev = sim.results.ng.prevalence
print(f'{prev.label}: min={prev.values.min():.3f}, mean={prev.values.mean():.3f}, max={prev.values.max():.3f}')

new_inf = sim.results.ng.new_infections
print(f'{new_inf.label}: min={new_inf.values.min():.0f}, mean={new_inf.values.mean():.1f}, max={new_inf.values.max():.0f}')

# There are many more -- including by sex and age group
print(f'\nTotal result keys: {len(sim.results.ng.keys())}')
# Access individual results
prev = sim.results.ng.prevalence
print(f'{prev.label}: min={prev.values.min():.3f}, mean={prev.values.mean():.3f}, max={prev.values.max():.3f}')

new_inf = sim.results.ng.new_infections
print(f'{new_inf.label}: min={new_inf.values.min():.0f}, mean={new_inf.values.mean():.1f}, max={new_inf.values.max():.0f}')

# There are many more -- including by sex and age group
print(f'\nTotal result keys: {len(sim.results.ng.keys())}')

Prevalence: min=0.007, mean=0.077, max=0.096
New infections: min=3, mean=42.5, max=64

Total result keys: 200

Each result is an ss.Result object. The raw data is in .values (a NumPy array) and the time axis is in .timevec:

In [3]:

Copied!

print(f'Values (first 5): {prev.values[:5]}')
print(f'Timevec:          {prev.timevec[0]} to {prev.timevec[-1]} ({len(prev.values)} timesteps)')
print(f'Values (first 5): {prev.values[:5]}')
print(f'Timevec:          {prev.timevec[0]} to {prev.timevec[-1]} ({len(prev.values)} timesteps)')

Values (first 5): [0.00675219 0.00740741 0.00940228 0.00938338 0.01137885]
Timevec:          2010.01.01 to 2030.01.01 (241 timesteps)

Automatic plotting¶

The simplest way to visualize results is sim.plot(). By default, it shows only the high-level results (not every age/sex subgroup):

In [4]:

Copied!

sim.plot()
sim.plot()

Figure(933.333x700)

No description has been provided for this image

Plotting specific results¶

To plot specific results, pass the key argument. You can use a single key or a list:

In [5]:

Copied!

# Plot just prevalence and new infections
sim.plot(key=['ng.prevalence', 'ng.new_infections'])
# Plot just prevalence and new infections
sim.plot(key=['ng.prevalence', 'ng.new_infections'])

Figure(800x600)

You can also plot results that are hidden from the default view. For example, sex-stratified prevalence:

In [6]:

Copied!

sim.plot(key=['ng.prevalence_f', 'ng.prevalence_m'])
sim.plot(key=['ng.prevalence_f', 'ng.prevalence_m'])

Figure(800x600)

Exporting results¶

As a DataFrame¶

Export all results to a Pandas DataFrame with sim.to_df():

In [7]:

Copied!

df = sim.to_df()
print(f'Shape: {df.shape}')
df.head()
df = sim.to_df()
print(f'Shape: {df.shape}')
df.head()

Shape: (241, 207)

Out[7]:

	timevec	structuredsexual_n_edges	ng_n_susceptible	ng_n_infected	ng_n_exposed	ng_n_asymptomatic	ng_n_symptomatic	ng_n_seeking_care	...	ng_new_symptomatic_m_50_65	ng_symp_prevalence_m_50_65	n_alive
0	2010-01-01	353.0	1984.0	15.0	1.0	16.0	0.0	0.0	...	0.0	0.000000	2000.0
1	2010-01-31	435.0	1981.0	16.0	3.0	11.0	8.0	4.0	...	1.0	0.005988	2000.0
2	2010-03-03	497.0	1981.0	19.0	0.0	10.0	9.0	0.0	...	0.0	0.005952	2000.0
3	2010-04-02	531.0	1978.0	19.0	3.0	13.0	9.0	1.0	...	0.0	0.005917	2000.0
4	2010-05-03	568.0	1976.0	22.0	2.0	13.0	11.0	2.0	...	1.0	0.011561	2000.0

5 rows × 207 columns

You can also export a single result:

In [8]:

Copied!

prev_df = sim.results.ng.prevalence.to_df()
prev_df.head()
prev_df = sim.results.ng.prevalence.to_df()
prev_df.head()

Out[8]:

	timevec	value
0	2010-01-01	0.006752
1	2010-01-31	0.007407
2	2010-03-03	0.009402
3	2010-04-02	0.009383
4	2010-05-03	0.011379

Resampling to annual results¶

STIsim runs with monthly timesteps by default, but you'll often want annual results for reporting or comparison with data. Use the resample() method on any result:

In [9]:

Copied!





# Monthly new infections (raw)
monthly = sim.results.ng.new_infections
print(f'Monthly: {len(monthly.values)} timesteps')

# Resample to annual (sums monthly counts)
annual = monthly.resample('year')
print(f'Annual:  {len(annual)} years')
print(annual)
# Monthly new infections (raw)
monthly = sim.results.ng.new_infections
print(f'Monthly: {len(monthly.values)} timesteps')

# Resample to annual (sums monthly counts)
annual = monthly.resample('year')
print(f'Annual:  {len(annual)} years')
print(annual)

Monthly: 241 timesteps
Annual:  21 years
Result(Gonorrhea.new_infections: min=45, mean=487.238, max=639)

The resample() method automatically chooses the right aggregation: sum for counts (like new_infections) and mean for rates/proportions (like prevalence). You can also resample to a DataFrame:

In [10]:

Copied!

annual_df = sim.results.ng.new_infections.resample('year')
print(f'Annual result: {len(annual_df.values)} years')
print(annual_df)
annual_df = sim.results.ng.new_infections.resample('year')
print(f'Annual result: {len(annual_df.values)} years')
print(annual_df)

Annual result: 21 years
Result(Gonorrhea.new_infections: min=45, mean=487.238, max=639)

You can also resample the entire sim's results at once via to_df():

In [11]:

Copied!

annual_all = sim.to_df(resample='year', use_years=True)
annual_all.head()
annual_all = sim.to_df(resample='year', use_years=True)
annual_all.head()

Out[11]:

	timevec	structuredsexual_n_edges	ng_n_susceptible	ng_n_infected	ng_n_exposed	ng_n_asymptomatic	ng_n_symptomatic	ng_n_seeking_care	...	ng_new_symptomatic_m_50_65	ng_symp_prevalence_m_50_65	n_alive
0	2010	557.250000	1965.500000	27.583333	6.916667	20.666667	13.833333	3.000000	...	5.0	0.006692	2000.0
1	2011	658.250000	1885.416667	96.416667	18.166667	62.666667	51.916667	6.833333	...	18.0	0.032234	2000.0
2	2012	685.500000	1852.333333	128.416667	19.250000	82.416667	65.250000	7.583333	...	20.0	0.050674	2000.0
3	2013	714.833333	1847.333333	133.166667	19.500000	84.416667	68.250000	8.416667	...	19.0	0.038104	2000.0
4	2014	737.833333	1846.833333	131.583333	21.583333	84.250000	68.916667	8.833333	...	28.0	0.047867	2000.0

5 rows × 207 columns

Custom plots with Matplotlib¶

For publication-quality figures or custom layouts, use the result values directly with Matplotlib:

In [12]:

Copied!





fig, axes = plt.subplots(1, 2, figsize=(10, 4))

# Left: prevalence by sex
axes[0].plot(sim.timevec, sim.results.ng.prevalence_f, label='Female')
axes[0].plot(sim.timevec, sim.results.ng.prevalence_m, label='Male')
axes[0].set_xlabel('Year')
axes[0].set_ylabel('Prevalence')
axes[0].set_title('NG prevalence by sex')
axes[0].legend()

# Right: annual new infections (resampled)
annual_inf = sim.results.ng.new_infections.resample('year', use_years=True)
axes[1].bar(annual_inf.timevec, annual_inf.values)
axes[1].set_xlabel('Year')
axes[1].set_ylabel('New infections')
axes[1].set_title('Annual NG infections')

plt.tight_layout()
fig
fig, axes = plt.subplots(1, 2, figsize=(10, 4))

# Left: prevalence by sex
axes[0].plot(sim.timevec, sim.results.ng.prevalence_f, label='Female')
axes[0].plot(sim.timevec, sim.results.ng.prevalence_m, label='Male')
axes[0].set_xlabel('Year')
axes[0].set_ylabel('Prevalence')
axes[0].set_title('NG prevalence by sex')
axes[0].legend()

# Right: annual new infections (resampled)
annual_inf = sim.results.ng.new_infections.resample('year', use_years=True)
axes[1].bar(annual_inf.timevec, annual_inf.values)
axes[1].set_xlabel('Year')
axes[1].set_ylabel('New infections')
axes[1].set_title('Annual NG infections')

plt.tight_layout()
fig

Out[12]:

Custom results¶

STIsim's built-in results cover the most common quantities (prevalence, incidence, counts by sex and age group). But you'll often need something specific to your analysis -- coinfection prevalence in a subgroup, treatment coverage by pathway, network contact rates, etc.

Any module can define custom results: analyzers, interventions, networks, connectors, and diseases all use the same define_results pattern. The result then automatically appears in sim.results, sim.to_df(), and sim.plot() -- and can be used as a calibration target.

Here's a minimal example using an analyzer that tracks gonorrhea prevalence among people aged 15-24:

In [13]:

Copied!





import starsim as ss

class YouthPrev(ss.Analyzer):
    """Track gonorrhea prevalence among 15-24 year olds."""

    def init_pre(self, sim):
        super().init_pre(sim)
        # define_results registers a result with the sim's results system.
        # After sim.run(), it appears in sim.results.youthprev.ng_prev_15_24
        # and in sim.to_df() as 'youthprev.ng_prev_15_24'.
        self.define_results(
            ss.Result('ng_prev_15_24', dtype=float, scale=False, label='NG prev (15-24)'),
        )

    def step(self):
        ppl = self.sim.people
        youth = (ppl.age >= 15) & (ppl.age < 25)       # Boolean mask for 15-24 year olds
        n_youth = youth.count()                          # How many are in this group
        if n_youth > 0:
            infected = self.sim.diseases.ng.infected     # Boolean: who is infected
            prev = (infected & youth).count() / n_youth  # Prevalence = infected / total
            self.results['ng_prev_15_24'][self.ti] = prev
import starsim as ss

class YouthPrev(ss.Analyzer):
    """Track gonorrhea prevalence among 15-24 year olds."""

    def init_pre(self, sim):
        super().init_pre(sim)
        # define_results registers a result with the sim's results system.
        # After sim.run(), it appears in sim.results.youthprev.ng_prev_15_24
        # and in sim.to_df() as 'youthprev.ng_prev_15_24'.
        self.define_results(
            ss.Result('ng_prev_15_24', dtype=float, scale=False, label='NG prev (15-24)'),
        )

    def step(self):
        ppl = self.sim.people
        youth = (ppl.age >= 15) & (ppl.age < 25)       # Boolean mask for 15-24 year olds
        n_youth = youth.count()                          # How many are in this group
        if n_youth > 0:
            infected = self.sim.diseases.ng.infected     # Boolean: who is infected
            prev = (infected & youth).count() / n_youth  # Prevalence = infected / total
            self.results['ng_prev_15_24'][self.ti] = prev

How it works:

init_pre is called during sim.init(). Call self.define_results(...) to register your results. Each ss.Result gets a pre-allocated array matching the simulation's number of timesteps. The result is stored under sim.results.<module_name>.<result_name> -- in this case, sim.results.youthprev.ng_prev_15_24.
step is called every timestep. Access the sim via self.sim, compute your quantity, and store it in self.results[name][self.ti] where self.ti is the current timestep index.
After sim.run(), the result is available everywhere: sim.results, sim.to_df(), sim.plot(), and as a calibration target (see Calibration tutorial).

This same pattern works in any module -- interventions, connectors, networks, etc. Anywhere you can write define_results in init_pre and self.results[name][self.ti] = value in step, you get a tracked result.

`ss.Result` options¶

The most commonly used arguments to ss.Result:

Argument	Default	Description
`name`	(required)	Key used to access the result, e.g. `'ng_prev_15_24'`
`dtype`	`float`	Data type (`float` for rates, `int` for counts)
`scale`	`True`	Whether to scale the result when changing `n_agents`. Set `True` for counts (e.g., new infections), `False` for rates/proportions (e.g., prevalence)
`auto_plot`	`True`	Whether to include in `sim.plot()` by default. Set `False` for results you only need in `to_df()`
`label`	`None`	Human-readable label for plot axes

Let's try it:

In [14]:

Copied!





# Run a sim with our custom analyzer
sim = sti.Sim(diseases='ng', n_agents=2000, start=2010, stop=2030, analyzers=[YouthPrev()])
sim.run(verbose=0)

# The result is now accessible like any built-in result
print(sim.results.youthprev.ng_prev_15_24)

# And it shows up in to_df()
df = sim.to_df(resample='year', use_years=True, sep='.')
print(df[['timevec', 'youthprev.ng_prev_15_24']].head())
# Run a sim with our custom analyzer
sim = sti.Sim(diseases='ng', n_agents=2000, start=2010, stop=2030, analyzers=[YouthPrev()])
sim.run(verbose=0)

# The result is now accessible like any built-in result
print(sim.results.youthprev.ng_prev_15_24)

# And it shows up in to_df()
df = sim.to_df(resample='year', use_years=True, sep='.')
print(df[['timevec', 'youthprev.ng_prev_15_24']].head())

Initializing sim with 2000 agents

Result(YouthPrev.ng_prev_15_24: min=0, mean=0.0314935, max=0.0779221)

   timevec  youthprev.ng_prev_15_24
0     2010                 0.006844
1     2011                 0.033992
2     2012                 0.044170
3     2013                 0.056500
4     2014                 0.032168

Summary¶

Task	Method
List all result keys	`sim.results.ng.keys()`
Access a result's values	`sim.results.ng.prevalence.values`
Auto-plot high-level results	`sim.plot()`
Plot specific results	`sim.plot(key=['ng.prevalence', 'ng.new_infections'])`
Plot hidden results	`sim.plot(key='ng.prevalence_f')`
Export all to DataFrame	`sim.to_df()`
Export one result	`sim.results.ng.prevalence.to_df()`
Resample to annual	`result.resample('year')`
Annual DataFrame	`sim.to_df(resample='year', use_years=True)`
Add custom result	Write an `ss.Analyzer` with `define_results`

Exercises¶

Run a sim with both 'ng' and 'ct'. Use sim.plot(key=...) to compare the prevalence of the two diseases on the same figure.
Export the annual new infections for gonorrhea to a DataFrame and save it to a CSV file.
Make a Matplotlib figure that plots monthly symptomatic prevalence (ng.symp_prevalence) alongside total prevalence (ng.prevalence). What fraction of infections are symptomatic?
Write an analyzer that tracks the number of symptomatic gonorrhea cases in males aged 25-49. Run the sim with it and verify the result appears in sim.to_df().