Passed
Pull Request — dev (#905)
by
unknown
01:33
created

cts_buildings()   B

Complexity

Conditions 2

Size

Total Lines 188
Code Lines 105

Duplication

Lines 0
Ratio 0 %

Importance

Changes 0
Metric Value
eloc 105
dl 0
loc 188
rs 7
c 0
b 0
f 0
cc 2
nop 0

How to fix   Long Method   

Long Method

Small methods make your code easier to understand, in particular if combined with a good name. Besides, if your method is small, finding a good name is usually much easier.

For example, if you find yourself adding comments to a method's body, this is usually a good sign to extract the commented part to a new method, and use the comment as a starting point when coming up with a good name for this new method.

Commonly applied refactorings include:

1
"""
2
CTS electricity and heat demand time series for scenarios in 2035 and 2050
3
assigned to OSM-buildings.
4
5
Disaggregation of cts heat & electricity demand time series from MV Substation
6
to census cells via annual demand and then to OSM buildings via
7
amenity tags or randomly if no sufficient OSM-data is available in the
8
respective census cell. If no OSM-buildings or synthetic residential buildings
9
are available new synthetic 5x5m buildings are generated.
10
11
The resulting data is stored in separate tables
12
13
* `openstreetmap.osm_buildings_synthetic`:
14
    Lists generated synthetic building with id, zensus_population_id and
15
    building type. This table is already created within
16
    :func:`hh_buildings.map_houseprofiles_to_buildings()`
17
* `openstreetmap.egon_cts_buildings`:
18
    Table of all selected cts buildings with id, census cell id, geometry and
19
    amenity count in building. This table is created within
20
    :func:`cts_buildings()`
21
* `demand.egon_cts_electricity_demand_building_share`:
22
    Table including the mv substation electricity profile share of all selected
23
    cts buildings for scenario eGon2035 and eGon100RE. This table is created
24
    within :func:`cts_electricity()`
25
* `demand.egon_cts_heat_demand_building_share`:
26
    Table including the mv substation heat profile share of all selected
27
    cts buildings for scenario eGon2035 and eGon100RE. This table is created
28
    within :func:`cts_heat()`
29
* `demand.egon_building_peak_loads`:
30
    Mapping of demand time series and buildings including cell_id, building
31
    area and peak load. This table is already created within
32
    :func:`hh_buildings.get_building_peak_loads()`
33
34
**The following datasets from the database are mainly used for creation:**
35
36
* `openstreetmap.osm_buildings_filtered`:
37
    Table of OSM-buildings filtered by tags to selecting residential and cts
38
    buildings only.
39
* `openstreetmap.osm_amenities_shops_filtered`:
40
    Table of OSM-amenities filtered by tags to select cts only.
41
* `openstreetmap.osm_amenities_not_in_buildings_filtered`:
42
    Table of amenities which do not intersect with any building from
43
    `openstreetmap.osm_buildings_filtered`
44
* `openstreetmap.osm_buildings_synthetic`:
45
    Table of synthetic residential buildings
46
* `boundaries.egon_map_zensus_buildings_filtered_all`:
47
    Mapping table of census cells and buildings filtered even if population
48
    in census cell = 0.
49
* `demand.egon_demandregio_zensus_electricity`:
50
    Table of annual electricity load demand for residential and cts at census
51
    cell level. Residential load demand is derived from aggregated residential
52
    building profiles. DemandRegio CTS load demand at NUTS3 is distributed to
53
    census cells linearly to heat demand from peta5.
54
* `demand.egon_peta_heat`:
55
    Table of annual heat load demand for residential and cts at census cell
56
    level from peta5.
57
* `demand.egon_etrago_electricity_cts`:
58
    Scaled cts electricity time series for every MV substation. Derived from
59
    DemandRegio SLP for selected economic sectors at nuts3. Scaled with annual
60
    demand from `demand.egon_demandregio_zensus_electricity`
61
* `demand.egon_etrago_heat_cts`:
62
    Scaled cts heat time series for every MV substation. Derived from
63
    DemandRegio SLP Gas for selected economic sectors at nuts3. Scaled with
64
    annual demand from `demand.egon_peta_heat`.
65
66
**What is the goal?**
67
68
To disaggregate cts heat and electricity time series from MV substation level
69
to geo-referenced buildings. DemandRegio and Peta5 is used to identify census
70
cells with load demand. Openstreetmap data is used and filtered via tags to
71
identify buildings and count amenities within. The number of amenities serve
72
to assign the appropriate load demand share to the building.
73
74
**What is the challenge?**
75
76
The OSM, DemandRegio and Peta5 dataset differ from each other. The OSM dataset
77
is a community based dataset which is extended throughout and does not claim to
78
be complete. Therefore not all census cells which have a demand assigned by
79
DemandRegio or Peta5 methodology also have buildings with respective tags or no
80
buildings at all. Merging these datasets inconsistencies need
81
to be addressed. For example: not yet tagged buildings or amenities in OSM
82
83
**How are these datasets combined?**
84
85
------>>>>>> continue
86
87
Firstly, all cts buildings are selected. Buildings which have cts amenities
88
inside.
89
90
91
**What are central assumptions during the data processing?**
92
93
* Mapping census to OSM data is not trivial. Discrepancies are substituted.
94
* Missing OSM buildings are generated by census building count.
95
* If no census building count data is available, the number of buildings is
96
derived by an average rate of households/buildings applied to the number of
97
households.
98
99
**Drawbacks and limitations of the data**
100
101
* Missing OSM buildings in cells without census building count are derived by
102
an average rate of households/buildings applied to the number of households.
103
As only whole houses can exist, the substitute is ceiled to the next higher
104
integer. Ceiling is applied to avoid rounding to amount of 0 buildings.
105
106
* As this datasets is a cascade after profile assignement at census cells
107
also check drawbacks and limitations in hh_profiles.py.
108
109
110
111
Example Query
112
-----
113
114
115
Notes
116
-----
117
118
This module docstring is rather a dataset documentation. Once, a decision
119
is made in ... the content of this module docstring needs to be moved to
120
docs attribute of the respective dataset class.
121
"""
122
123
from geoalchemy2 import Geometry
124
from geoalchemy2.shape import to_shape
125
from sqlalchemy import REAL, Column, Integer, String, func
126
from sqlalchemy.ext.declarative import declarative_base
127
import geopandas as gpd
128
import numpy as np
129
import pandas as pd
130
import saio
131
132
from egon.data import db
133
from egon.data import logger as log
134
from egon.data.datasets import Dataset
135
from egon.data.datasets.electricity_demand import (
136
    EgonDemandRegioZensusElectricity,
137
)
138
from egon.data.datasets.electricity_demand.temporal import (
139
    EgonEtragoElectricityCts,
140
)
141
from egon.data.datasets.electricity_demand_timeseries.hh_buildings import (
142
    BuildingElectricityPeakLoads,
143
    OsmBuildingsSynthetic,
144
)
145
from egon.data.datasets.electricity_demand_timeseries.tools import (
146
    random_ints_until_sum,
147
    random_point_in_square,
148
    specific_int_until_sum,
149
    write_table_to_postgis,
150
    write_table_to_postgres,
151
)
152
from egon.data.datasets.heat_demand import EgonPetaHeat
153
from egon.data.datasets.heat_demand_timeseries import EgonEtragoHeatCts
154
from egon.data.datasets.zensus_mv_grid_districts import MapZensusGridDistricts
155
from egon.data.datasets.zensus_vg250 import DestatisZensusPopulationPerHa
156
157
engine = db.engine()
158
Base = declarative_base()
159
160
# import db tables
161
saio.register_schema("openstreetmap", engine=engine)
162
saio.register_schema("boundaries", engine=engine)
163
164
165
class EgonCtsElectricityDemandBuildingShare(Base):
166
    __tablename__ = "egon_cts_electricity_demand_building_share"
167
    __table_args__ = {"schema": "demand"}
168
169
    building_id = Column(Integer, primary_key=True)
170
    scenario = Column(String, primary_key=True)
171
    bus_id = Column(Integer, index=True)
172
    profile_share = Column(REAL)
173
174
175
class EgonCtsHeatDemandBuildingShare(Base):
176
    __tablename__ = "egon_cts_heat_demand_building_share"
177
    __table_args__ = {"schema": "demand"}
178
179
    building_id = Column(Integer, primary_key=True)
180
    scenario = Column(String, primary_key=True)
181
    bus_id = Column(Integer, index=True)
182
    profile_share = Column(REAL)
183
184
185
class CtsBuildings(Base):
186
    __tablename__ = "egon_cts_buildings"
187
    __table_args__ = {"schema": "openstreetmap"}
188
189
    serial = Column(Integer, primary_key=True)
190
    id = Column(Integer, index=True)
191
    zensus_population_id = Column(Integer, index=True)
192
    geom_building = Column(Geometry("Polygon", 3035))
193
    n_amenities_inside = Column(Integer)
194
    source = Column(String)
195
196
197
class BuildingHeatPeakLoads(Base):
198
    __tablename__ = "egon_building_heat_peak_loads"
199
    __table_args__ = {"schema": "demand"}
200
201
    building_id = Column(Integer, primary_key=True)
202
    scenario = Column(String, primary_key=True)
203
    sector = Column(String, primary_key=True)
204
    peak_load_in_w = Column(REAL)
205
206
207
class CtsDemandBuildings(Dataset):
208
    def __init__(self, dependencies):
209
        super().__init__(
210
            name="CtsDemandBuildings",
211
            version="0.0.0",
212
            dependencies=dependencies,
213
            tasks=(
214
                cts_buildings,
215
                {cts_electricity, cts_heat},
216
                {get_cts_electricity_peak_load, get_cts_heat_peak_load},
217
            ),
218
        )
219
220
221
def amenities_without_buildings():
222
    """
223
    Amenities which have no buildings assigned and are in
224
    a cell with cts demand are determined.
225
226
    Returns
227
    -------
228
    pd.DataFrame
229
        Table of amenities without buildings
230
    """
231
    from saio.openstreetmap import osm_amenities_not_in_buildings_filtered
232
233
    with db.session_scope() as session:
234
        cells_query = (
235
            session.query(
236
                DestatisZensusPopulationPerHa.id.label("zensus_population_id"),
237
                osm_amenities_not_in_buildings_filtered.geom_amenity,
238
                osm_amenities_not_in_buildings_filtered.egon_amenity_id,
239
            )
240
            .filter(
241
                func.st_within(
242
                    osm_amenities_not_in_buildings_filtered.geom_amenity,
243
                    DestatisZensusPopulationPerHa.geom,
244
                )
245
            )
246
            .filter(
247
                DestatisZensusPopulationPerHa.id
248
                == EgonDemandRegioZensusElectricity.zensus_population_id
249
            )
250
            .filter(
251
                EgonDemandRegioZensusElectricity.sector == "service",
252
                EgonDemandRegioZensusElectricity.scenario == "eGon2035",
253
            )
254
        )
255
256
    df_amenities_without_buildings = gpd.read_postgis(
257
        cells_query.statement,
258
        cells_query.session.bind,
259
        geom_col="geom_amenity",
260
    )
261
    return df_amenities_without_buildings
262
263
264
def place_buildings_with_amenities(df, amenities=None, max_amenities=None):
265
    """
266
    Building centroids are placed randomly within census cells.
267
    The Number of buildings is derived from n_amenity_inside, the selected
268
    method and number of amenities per building.
269
270
    Returns
271
    -------
272
    df: gpd.GeoDataFrame
273
        Table of buildings centroids
274
    """
275
    if isinstance(max_amenities, int):
276
        # amount of amenities is randomly generated within bounds
277
        # (max_amenities, amenities per cell)
278
        df["n_amenities_inside"] = df["n_amenities_inside"].apply(
279
            random_ints_until_sum, args=[max_amenities]
280
        )
281
    if isinstance(amenities, int):
282
        # Specific amount of amenities per building
283
        df["n_amenities_inside"] = df["n_amenities_inside"].apply(
284
            specific_int_until_sum, args=[amenities]
285
        )
286
287
    # Unnest each building
288
    df = df.explode(column="n_amenities_inside")
289
290
    # building count per cell
291
    df["building_count"] = df.groupby(["zensus_population_id"]).cumcount() + 1
292
293
    # generate random synthetic buildings
294
    edge_length = 5
295
    # create random points within census cells
296
    points = random_point_in_square(geom=df["geom"], tol=edge_length / 2)
297
298
    df.reset_index(drop=True, inplace=True)
299
    # Store center of polygon
300
    df["geom_point"] = points
301
    # Drop geometry of census cell
302
    df = df.drop(columns=["geom"])
303
304
    return df
305
306
307
def create_synthetic_buildings(df, points=None, crs="EPSG:3035"):
308
    """
309
    Synthetic buildings are generated around points.
310
311
    Parameters
312
    ----------
313
    df: pd.DataFrame
314
        Table of census cells
315
    points: gpd.GeoSeries or str
316
        List of points to place buildings around or column name of df
317
    crs: str
318
        CRS of result table
319
320
    Returns
321
    -------
322
    df: gpd.GeoDataFrame
323
        Synthetic buildings
324
    """
325
326
    if isinstance(points, str) and points in df.columns:
327
        points = df[points]
328
    elif isinstance(points, gpd.GeoSeries):
329
        pass
330
    else:
331
        raise ValueError("Points are of the wrong type")
332
333
    # Create building using a square around point
334
    edge_length = 5
335
    df["geom_building"] = points.buffer(distance=edge_length / 2, cap_style=3)
336
337
    if "geom_point" not in df.columns:
338
        df["geom_point"] = df["geom_building"].centroid
339
340
    df = gpd.GeoDataFrame(
341
        df,
342
        crs=crs,
343
        geometry="geom_building",
344
    )
345
346
    # TODO remove after #772 implementation of egon_building_id
347
    df.rename(columns={"id": "egon_building_id"}, inplace=True)
348
349
    # get max number of building ids from synthetic residential table
350
    with db.session_scope() as session:
351
        max_synth_residential_id = session.execute(
352
            func.max(OsmBuildingsSynthetic.id)
353
        ).scalar()
354
    max_synth_residential_id = int(max_synth_residential_id)
355
356
    # create sequential ids
357
    df["egon_building_id"] = range(
358
        max_synth_residential_id + 1,
359
        max_synth_residential_id + df.shape[0] + 1,
360
    )
361
362
    df["area"] = df["geom_building"].area
363
    # set building type of synthetic building
364
    df["building"] = "cts"
365
    # TODO remove in #772
366
    df = df.rename(
367
        columns={
368
            # "zensus_population_id": "cell_id",
369
            "egon_building_id": "id",
370
        }
371
    )
372
    return df
373
374
375
def buildings_with_amenities():
376
    """
377
    Amenities which are assigned to buildings are determined
378
    and grouped per building and zensus cell. Buildings
379
    covering multiple cells therefore exists multiple times
380
    but in different zensus cells. This is necessary to cover
381
    all cells with a cts demand. If buildings exist in multiple
382
    substations, their amenities are summed and assigned and kept in
383
    one substation only. If as a result, a census cell is uncovered,
384
    a synthetic amenity is placed. The buildings are aggregated
385
    afterwards during the calculation of the profile_share.
386
387
    Returns
388
    -------
389
    df_buildings_with_amenities: gpd.GeoDataFrame
390
        Contains all buildings with amenities per zensus cell.
391
    df_lost_cells: gpd.GeoDataFrame
392
        Contains synthetic amenities in lost cells. Might be empty
393
    """
394
395
    from saio.openstreetmap import osm_amenities_in_buildings_filtered
396
397
    with db.session_scope() as session:
398
        cells_query = (
399
            session.query(
400
                osm_amenities_in_buildings_filtered,
401
                MapZensusGridDistricts.bus_id,
402
            )
403
            .filter(
404
                MapZensusGridDistricts.zensus_population_id
405
                == osm_amenities_in_buildings_filtered.zensus_population_id
406
            )
407
            .filter(
408
                EgonDemandRegioZensusElectricity.zensus_population_id
409
                == osm_amenities_in_buildings_filtered.zensus_population_id
410
            )
411
            .filter(
412
                EgonDemandRegioZensusElectricity.sector == "service",
413
                EgonDemandRegioZensusElectricity.scenario == "eGon2035",
414
            )
415
        )
416
    df_amenities_in_buildings = pd.read_sql(
417
        cells_query.statement, cells_query.session.bind, index_col=None
418
    )
419
420
    df_amenities_in_buildings["geom_building"] = df_amenities_in_buildings[
421
        "geom_building"
422
    ].apply(to_shape)
423
    df_amenities_in_buildings["geom_amenity"] = df_amenities_in_buildings[
424
        "geom_amenity"
425
    ].apply(to_shape)
426
427
    df_amenities_in_buildings["n_amenities_inside"] = 1
428
429
    # add identifier column for buildings in multiple substations
430
    df_amenities_in_buildings[
431
        "duplicate_identifier"
432
    ] = df_amenities_in_buildings.groupby(["id", "bus_id"])[
433
        "n_amenities_inside"
434
    ].transform(
435
        "cumsum"
436
    )
437
    df_amenities_in_buildings = df_amenities_in_buildings.sort_values(
438
        ["id", "duplicate_identifier"]
439
    )
440
    # sum amenities of buildings with multiple substations
441
    df_amenities_in_buildings[
442
        "n_amenities_inside"
443
    ] = df_amenities_in_buildings.groupby(["id", "duplicate_identifier"])[
444
        "n_amenities_inside"
445
    ].transform(
446
        "sum"
447
    )
448
449
    # create column to always go for bus_id with max amenities
450
    df_amenities_in_buildings[
451
        "max_amenities"
452
    ] = df_amenities_in_buildings.groupby(["id", "bus_id"])[
453
        "n_amenities_inside"
454
    ].transform(
455
        "sum"
456
    )
457
    # sort to go for
458
    df_amenities_in_buildings.sort_values(
459
        ["id", "max_amenities"], ascending=False, inplace=True
460
    )
461
462
    # identify lost zensus cells
463
    df_lost_cells = df_amenities_in_buildings.loc[
464
        df_amenities_in_buildings.duplicated(
465
            subset=["id", "duplicate_identifier"], keep="first"
466
        )
467
    ]
468
    df_lost_cells.drop_duplicates(
469
        subset=["zensus_population_id"], inplace=True
470
    )
471
472
    # drop buildings with multiple substation and lower max amenity
473
    df_amenities_in_buildings.drop_duplicates(
474
        subset=["id", "duplicate_identifier"], keep="first", inplace=True
475
    )
476
477
    # check if lost zensus cells are already covered
478
    if not df_lost_cells.empty:
479
        if not (
480
            df_amenities_in_buildings["zensus_population_id"]
481
            .isin(df_lost_cells["zensus_population_id"])
482
            .empty
483
        ):
484
            # query geom data for cell if not
485
            with db.session_scope() as session:
486
                cells_query = session.query(
487
                    DestatisZensusPopulationPerHa.id,
488
                    DestatisZensusPopulationPerHa.geom,
489
                ).filter(
490
                    DestatisZensusPopulationPerHa.id.in_(
491
                        df_lost_cells["zensus_population_id"]
492
                    )
493
                )
494
495
            df_lost_cells = gpd.read_postgis(
496
                cells_query.statement,
497
                cells_query.session.bind,
498
                geom_col="geom",
499
            )
500
            # TODO maybe adapt method
501
            # place random amenity in cell
502
            df_lost_cells["n_amenities_inside"] = 1
503
            df_lost_cells.rename(
504
                columns={
505
                    "id": "zensus_population_id",
506
                },
507
                inplace=True,
508
            )
509
            df_lost_cells = place_buildings_with_amenities(
510
                df_lost_cells, amenities=1
511
            )
512
            df_lost_cells.rename(
513
                columns={
514
                    # "id": "zensus_population_id",
515
                    "geom_point": "geom_amenity",
516
                },
517
                inplace=True,
518
            )
519
            df_lost_cells.drop(
520
                columns=["building_count", "n_amenities_inside"], inplace=True
521
            )
522
        else:
523
            df_lost_cells = None
524
    else:
525
        df_lost_cells = None
526
527
    # drop helper columns
528
    df_amenities_in_buildings.drop(
529
        columns=["duplicate_identifier"], inplace=True
530
    )
531
532
    # sum amenities per building and cell
533
    df_amenities_in_buildings[
534
        "n_amenities_inside"
535
    ] = df_amenities_in_buildings.groupby(["zensus_population_id", "id"])[
536
        "n_amenities_inside"
537
    ].transform(
538
        "sum"
539
    )
540
    # drop duplicated buildings
541
    df_buildings_with_amenities = df_amenities_in_buildings.drop_duplicates(
542
        ["id", "zensus_population_id"]
543
    )
544
    df_buildings_with_amenities.reset_index(inplace=True, drop=True)
545
546
    df_buildings_with_amenities = df_buildings_with_amenities[
547
        ["id", "zensus_population_id", "geom_building", "n_amenities_inside"]
548
    ]
549
    df_buildings_with_amenities.rename(
550
        columns={
551
            # "zensus_population_id": "cell_id",
552
            "egon_building_id": "id"
553
        },
554
        inplace=True,
555
    )
556
557
    return df_buildings_with_amenities, df_lost_cells
558
559
560
def buildings_without_amenities():
561
    """
562
    Buildings (filtered and synthetic) in cells with
563
    cts demand but no amenities are determined.
564
565
    Returns
566
    -------
567
    df_buildings_without_amenities: gpd.GeoDataFrame
568
        Table of buildings without amenities in zensus cells
569
        with cts demand.
570
    """
571
    from saio.boundaries import egon_map_zensus_buildings_filtered_all
572
    from saio.openstreetmap import (
573
        osm_amenities_shops_filtered,
574
        osm_buildings_filtered,
575
        osm_buildings_synthetic,
576
    )
577
578
    # buildings_filtered in cts-demand-cells without amenities
579
    with db.session_scope() as session:
580
581
        # Synthetic Buildings
582
        q_synth_buildings = session.query(
583
            osm_buildings_synthetic.cell_id.cast(Integer).label(
584
                "zensus_population_id"
585
            ),
586
            osm_buildings_synthetic.id.cast(Integer).label("id"),
587
            osm_buildings_synthetic.area.label("area"),
588
            osm_buildings_synthetic.geom_building.label("geom_building"),
589
            osm_buildings_synthetic.geom_point.label("geom_point"),
590
        )
591
592
        # Buildings filtered
593
        q_buildings_filtered = session.query(
594
            egon_map_zensus_buildings_filtered_all.zensus_population_id,
595
            osm_buildings_filtered.id,
596
            osm_buildings_filtered.area,
597
            osm_buildings_filtered.geom_building,
598
            osm_buildings_filtered.geom_point,
599
        ).filter(
600
            osm_buildings_filtered.id
601
            == egon_map_zensus_buildings_filtered_all.id
602
        )
603
604
        # Amenities + zensus_population_id
605
        q_amenities = (
606
            session.query(
607
                DestatisZensusPopulationPerHa.id.label("zensus_population_id"),
608
            )
609
            .filter(
610
                func.st_within(
611
                    osm_amenities_shops_filtered.geom_amenity,
612
                    DestatisZensusPopulationPerHa.geom,
613
                )
614
            )
615
            .distinct(DestatisZensusPopulationPerHa.id)
616
        )
617
618
        # Cells with CTS demand but without amenities
619
        q_cts_without_amenities = (
620
            session.query(
621
                EgonDemandRegioZensusElectricity.zensus_population_id,
622
            )
623
            .filter(
624
                EgonDemandRegioZensusElectricity.sector == "service",
625
                EgonDemandRegioZensusElectricity.scenario == "eGon2035",
626
            )
627
            .filter(
628
                EgonDemandRegioZensusElectricity.zensus_population_id.notin_(
629
                    q_amenities
630
                )
631
            )
632
            .distinct()
633
        )
634
635
        # Buildings filtered + synthetic buildings residential in
636
        # cells with CTS demand but without amenities
637
        cells_query = q_synth_buildings.union(q_buildings_filtered).filter(
638
            egon_map_zensus_buildings_filtered_all.zensus_population_id.in_(
639
                q_cts_without_amenities
640
            )
641
        )
642
643
    # df_buildings_without_amenities = pd.read_sql(
644
    #     cells_query.statement, cells_query.session.bind, index_col=None)
645
    df_buildings_without_amenities = gpd.read_postgis(
646
        cells_query.statement,
647
        cells_query.session.bind,
648
        geom_col="geom_building",
649
    )
650
651
    df_buildings_without_amenities = df_buildings_without_amenities.rename(
652
        columns={
653
            # "zensus_population_id": "cell_id",
654
            "egon_building_id": "id",
655
        }
656
    )
657
658
    return df_buildings_without_amenities
659
660
661
def select_cts_buildings(df_buildings_wo_amenities, max_n):
662
    """
663
    N Buildings (filtered and synthetic) in each cell with
664
    cts demand are selected. Only the first n buildings
665
    are taken for each cell. The buildings are sorted by surface
666
    area.
667
668
    Returns
669
    -------
670
    df_buildings_with_cts_demand: gpd.GeoDataFrame
671
        Table of buildings
672
    """
673
674
    df_buildings_wo_amenities.sort_values(
675
        "area", ascending=False, inplace=True
676
    )
677
    # select first n ids each census cell if available
678
    df_buildings_with_cts_demand = (
679
        df_buildings_wo_amenities.groupby("zensus_population_id")
680
        .nth(list(range(max_n)))
681
        .reset_index()
682
    )
683
    df_buildings_with_cts_demand.reset_index(drop=True, inplace=True)
684
685
    return df_buildings_with_cts_demand
686
687
688
def cells_with_cts_demand_only(df_buildings_without_amenities):
689
    """
690
    Cells with cts demand but no amenities or buildilngs
691
    are determined.
692
693
    Returns
694
    -------
695
    df_cells_only_cts_demand: gpd.GeoDataFrame
696
        Table of cells with cts demand but no amenities or buildings
697
    """
698
    from saio.openstreetmap import osm_amenities_shops_filtered
699
700
    # cells mit amenities
701
    with db.session_scope() as session:
702
        sub_query = (
703
            session.query(
704
                DestatisZensusPopulationPerHa.id.label("zensus_population_id"),
705
            )
706
            .filter(
707
                func.st_within(
708
                    osm_amenities_shops_filtered.geom_amenity,
709
                    DestatisZensusPopulationPerHa.geom,
710
                )
711
            )
712
            .distinct(DestatisZensusPopulationPerHa.id)
713
        )
714
715
        cells_query = (
716
            session.query(
717
                EgonDemandRegioZensusElectricity.zensus_population_id,
718
                EgonDemandRegioZensusElectricity.scenario,
719
                EgonDemandRegioZensusElectricity.sector,
720
                EgonDemandRegioZensusElectricity.demand,
721
                DestatisZensusPopulationPerHa.geom,
722
            )
723
            .filter(
724
                EgonDemandRegioZensusElectricity.sector == "service",
725
                EgonDemandRegioZensusElectricity.scenario == "eGon2035",
726
            )
727
            .filter(
728
                EgonDemandRegioZensusElectricity.zensus_population_id.notin_(
729
                    sub_query
730
                )
731
            )
732
            .filter(
733
                EgonDemandRegioZensusElectricity.zensus_population_id
734
                == DestatisZensusPopulationPerHa.id
735
            )
736
        )
737
738
    df_cts_cell_without_amenities = gpd.read_postgis(
739
        cells_query.statement,
740
        cells_query.session.bind,
741
        geom_col="geom",
742
        index_col=None,
743
    )
744
745
    # TODO maybe remove
746
    df_buildings_without_amenities = df_buildings_without_amenities.rename(
747
        columns={"cell_id": "zensus_population_id"}
748
    )
749
750
    # Census cells with only cts demand
751
    df_cells_only_cts_demand = df_cts_cell_without_amenities.loc[
752
        ~df_cts_cell_without_amenities["zensus_population_id"].isin(
753
            df_buildings_without_amenities["zensus_population_id"].unique()
754
        )
755
    ]
756
757
    df_cells_only_cts_demand.reset_index(drop=True, inplace=True)
758
759
    return df_cells_only_cts_demand
760
761
762
def calc_census_cell_share(scenario, sector):
763
    """
764
    The profile share for each census cell is calculated by it's
765
    share of annual demand per substation bus. The annual demand
766
    per cell is defined by DemandRegio/Peta5. The share is for both
767
    scenarios identical as the annual demand is linearly scaled.
768
769
    Parameters
770
    ----------
771
    scenario: str
772
        Scenario for which the share is calculated: "eGon2035" or "eGon100RE"
773
    sector: str
774
        Scenario for which the share is calculated: "electricity" or "heat"
775
776
    Returns
777
    -------
778
    df_census_share: pd.DataFrame
779
    """
780
    if sector == "electricity":
781
        with db.session_scope() as session:
782
            cells_query = (
783
                session.query(
784
                    EgonDemandRegioZensusElectricity,
785
                    MapZensusGridDistricts.bus_id,
786
                )
787
                .filter(EgonDemandRegioZensusElectricity.sector == "service")
788
                .filter(EgonDemandRegioZensusElectricity.scenario == scenario)
789
                .filter(
790
                    EgonDemandRegioZensusElectricity.zensus_population_id
791
                    == MapZensusGridDistricts.zensus_population_id
792
                )
793
            )
794
795
    elif sector == "heat":
796
        with db.session_scope() as session:
797
            cells_query = (
798
                session.query(EgonPetaHeat, MapZensusGridDistricts.bus_id)
799
                .filter(EgonPetaHeat.sector == "service")
800
                .filter(EgonPetaHeat.scenario == scenario)
801
                .filter(
802
                    EgonPetaHeat.zensus_population_id
803
                    == MapZensusGridDistricts.zensus_population_id
804
                )
805
            )
806
807
    df_demand = pd.read_sql(
808
        cells_query.statement,
0 ignored issues
show
introduced by
The variable cells_query does not seem to be defined for all execution paths.
Loading history...
809
        cells_query.session.bind,
810
        index_col="zensus_population_id",
811
    )
812
813
    # get demand share of cell per bus
814
    df_census_share = df_demand["demand"] / df_demand.groupby("bus_id")[
815
        "demand"
816
    ].transform("sum")
817
    df_census_share = df_census_share.rename("cell_share")
818
819
    df_census_share = pd.concat(
820
        [
821
            df_census_share,
822
            df_demand[["bus_id", "scenario"]],
823
        ],
824
        axis=1,
825
    )
826
827
    df_census_share.reset_index(inplace=True)
828
    return df_census_share
829
830
831
def calc_building_demand_profile_share(
832
    df_cts_buildings, scenario="eGon2035", sector="electricity"
833
):
834
    """
835
    Share of cts electricity demand profile per bus for every selected building
836
    is calculated. Building-amenity share is multiplied with census cell share
837
    to get the substation bus profile share for each building. The share is
838
    grouped and aggregated per building as some buildings exceed the shape of
839
    census cells and have amenities assigned from multiple cells. Building
840
    therefore get the amenity share of all census cells.
841
842
    Parameters
843
    ----------
844
    df_cts_buildings: gpd.GeoDataFrame
845
        Table of all buildings with cts demand assigned
846
    scenario: str
847
        Scenario for which the share is calculated.
848
    sector: str
849
        Sector for which the share is calculated.
850
851
    Returns
852
    -------
853
    df_building_share: pd.DataFrame
854
        Table of bus profile share per building
855
856
    """
857
858
    def calc_building_amenity_share(df_cts_buildings):
859
        """
860
        Calculate the building share by the number amenities per building
861
        within a census cell. Building ids can exist multiple time but with
862
        different zensus_population_ids.
863
        """
864
        df_building_amenity_share = df_cts_buildings[
865
            "n_amenities_inside"
866
        ] / df_cts_buildings.groupby("zensus_population_id")[
867
            "n_amenities_inside"
868
        ].transform(
869
            "sum"
870
        )
871
        df_building_amenity_share = pd.concat(
872
            [
873
                df_building_amenity_share.rename("building_amenity_share"),
874
                df_cts_buildings[["zensus_population_id", "id"]],
875
            ],
876
            axis=1,
877
        )
878
        return df_building_amenity_share
879
880
    df_building_amenity_share = calc_building_amenity_share(df_cts_buildings)
881
882
    df_census_cell_share = calc_census_cell_share(
883
        scenario=scenario, sector=sector
884
    )
885
886
    df_demand_share = pd.merge(
887
        left=df_building_amenity_share,
888
        right=df_census_cell_share,
889
        left_on="zensus_population_id",
890
        right_on="zensus_population_id",
891
    )
892
    df_demand_share["profile_share"] = df_demand_share[
893
        "building_amenity_share"
894
    ].multiply(df_demand_share["cell_share"])
895
896
    # TODO bus_id fix
897
    df_demand_share = df_demand_share[
898
        ["id", "bus_id", "scenario", "profile_share"]
899
    ]
900
901
    # Group and aggregate per building for multi cell buildings
902
    df_demand_share = (
903
        df_demand_share.groupby(["scenario", "id", "bus_id"])
904
        .sum()
905
        .reset_index()
906
    )
907
    if df_demand_share.duplicated("id", keep=False).any():
908
        print(
909
            df_demand_share.loc[df_demand_share.duplicated("id", keep=False)]
910
        )
911
    return df_demand_share
912
913
914
def calc_cts_building_profiles(
915
    egon_building_ids,
916
    bus_ids,
917
    scenario,
918
    sector,
919
):
920
    """
921
    Calculate the cts demand profile for each building. The profile is
922
    calculated by the demand share of the building per substation bus.
923
924
    Parameters
925
    ----------
926
    egon_building_ids: list of int
927
        Ids of the building for which the profile is calculated.
928
    bus_ids: list of int
929
        Ids of the substation for which selected building profiles are
930
        calculated.
931
    scenario: str
932
        Scenario for which the share is calculated: "eGon2035" or "eGon100RE"
933
    sector: str
934
        Sector for which the share is calculated: "electricity" or "heat"
935
936
    Returns
937
    -------
938
    df_building_profiles: pd.DataFrame
939
        Table of demand profile per building
940
    """
941
    if sector == "electricity":
942
        # Get cts building electricity demand share of selected buildings
943
        with db.session_scope() as session:
944
            cells_query = (
945
                session.query(
946
                    EgonCtsElectricityDemandBuildingShare,
947
                )
948
                .filter(
949
                    EgonCtsElectricityDemandBuildingShare.scenario == scenario
950
                )
951
                .filter(
952
                    EgonCtsElectricityDemandBuildingShare.building_id.in_(
953
                        egon_building_ids
954
                    )
955
                )
956
            )
957
958
        df_demand_share = pd.read_sql(
959
            cells_query.statement, cells_query.session.bind, index_col=None
960
        )
961
962
        # Get substation cts electricity load profiles of selected bus_ids
963
        with db.session_scope() as session:
964
            cells_query = (
965
                session.query(EgonEtragoElectricityCts).filter(
966
                    EgonEtragoElectricityCts.scn_name == scenario
967
                )
968
            ).filter(EgonEtragoElectricityCts.bus_id.in_(bus_ids))
969
970
        df_cts_profiles = pd.read_sql(
971
            cells_query.statement,
972
            cells_query.session.bind,
973
        )
974
        df_cts_profiles = pd.DataFrame.from_dict(
975
            df_cts_profiles.set_index("bus_id")["p_set"].to_dict(),
976
            orient="index",
977
        )
978
        # df_cts_profiles = calc_load_curves_cts(scenario)
979
980
    elif sector == "heat":
981
        # Get cts building heat demand share of selected buildings
982
        with db.session_scope() as session:
983
            cells_query = (
984
                session.query(
985
                    EgonCtsHeatDemandBuildingShare,
986
                )
987
                .filter(EgonCtsHeatDemandBuildingShare.scenario == scenario)
988
                .filter(
989
                    EgonCtsHeatDemandBuildingShare.building_id.in_(
990
                        egon_building_ids
991
                    )
992
                )
993
            )
994
995
        df_demand_share = pd.read_sql(
996
            cells_query.statement, cells_query.session.bind, index_col=None
997
        )
998
999
        # Get substation cts heat load profiles of selected bus_ids
1000
        with db.session_scope() as session:
1001
            cells_query = (
1002
                session.query(EgonEtragoHeatCts).filter(
1003
                    EgonEtragoHeatCts.scn_name == scenario
1004
                )
1005
            ).filter(EgonEtragoHeatCts.bus_id.in_(bus_ids))
1006
1007
        df_cts_profiles = pd.read_sql(
1008
            cells_query.statement,
1009
            cells_query.session.bind,
1010
        )
1011
        df_cts_profiles = pd.DataFrame.from_dict(
1012
            df_cts_profiles.set_index("bus_id")["p_set"].to_dict(),
1013
            orient="index",
1014
        )
1015
1016
    # TODO remove later
1017
    df_demand_share.rename(columns={"id": "building_id"}, inplace=True)
0 ignored issues
show
introduced by
The variable df_demand_share does not seem to be defined for all execution paths.
Loading history...
1018
1019
    # get demand profile for all buildings for selected demand share
1020
    df_building_profiles = pd.DataFrame()
1021
    for bus_id, df in df_demand_share.groupby("bus_id"):
1022
        shares = df.set_index("building_id", drop=True)["profile_share"]
1023
        profile_ts = df_cts_profiles.loc[bus_id]
0 ignored issues
show
introduced by
The variable df_cts_profiles does not seem to be defined for all execution paths.
Loading history...
1024
        building_profiles = np.outer(profile_ts, shares)
1025
        building_profiles = pd.DataFrame(
1026
            building_profiles, index=profile_ts.index, columns=shares.index
1027
        )
1028
        df_building_profiles = pd.concat(
1029
            [df_building_profiles, building_profiles], axis=1
1030
        )
1031
1032
    return df_building_profiles
1033
1034
1035
def delete_synthetic_cts_buildings():
1036
    """
1037
    All synthetic cts buildings are deleted from the DB. This is necessary if
1038
    the task is run multiple times as the existing synthetic buildings
1039
    influence the results.
1040
    """
1041
    # import db tables
1042
    from saio.openstreetmap import osm_buildings_synthetic
1043
1044
    # cells mit amenities
1045
    with db.session_scope() as session:
1046
        session.query(osm_buildings_synthetic).filter(
1047
            osm_buildings_synthetic.building == "cts"
1048
        ).delete()
1049
1050
1051
def remove_double_bus_id(df_cts_buildings):
1052
    """This is an backup adhoc fix if there should still be a building which
1053
    is assigned to 2 substations. In this case one of the buildings is just
1054
    dropped. As this currently accounts for only one building with one amenity
1055
    the deviation is neglectable."""
1056
    # assign bus_id via census cell of amenity
1057
    with db.session_scope() as session:
1058
        cells_query = session.query(
1059
            MapZensusGridDistricts.zensus_population_id,
1060
            MapZensusGridDistricts.bus_id,
1061
        )
1062
1063
    df_egon_map_zensus_buildings_buses = pd.read_sql(
1064
        cells_query.statement,
1065
        cells_query.session.bind,
1066
        index_col=None,
1067
    )
1068
    df_cts_buildings = pd.merge(
1069
        left=df_cts_buildings,
1070
        right=df_egon_map_zensus_buildings_buses,
1071
        on="zensus_population_id",
1072
    )
1073
1074
    substation_per_building = df_cts_buildings.groupby("id")[
1075
        "bus_id"
1076
    ].nunique()
1077
    building_id = substation_per_building.loc[
1078
        substation_per_building > 1
1079
    ].index
1080
    df_duplicates = df_cts_buildings.loc[
1081
        df_cts_buildings["id"].isin(building_id)
1082
    ]
1083
    for unique_id in df_duplicates["id"].unique():
1084
        drop_index = df_duplicates[df_duplicates["id"] == unique_id].index[0]
1085
        print(
1086
            f"Buildings {df_cts_buildings.loc[drop_index, 'id']}"
1087
            f" dropped because of double substation"
1088
        )
1089
        df_cts_buildings.drop(index=drop_index, inplace=True)
1090
1091
    df_cts_buildings.drop(columns="bus_id", inplace=True)
1092
1093
    return df_cts_buildings
1094
1095
1096
def cts_buildings():
1097
    """
1098
    Assigns CTS demand to buildings and calculates the respective demand
1099
    profiles. The demand profile per substation are disaggregated per
1100
    annual demand share of each census cell and by the number of amenities
1101
    per building within the cell. If no building data is available,
1102
    synthetic buildings are generated around the amenities. If no amenities
1103
    but cts demand is available, buildings are randomly selected. If no
1104
    building nor amenity is available, random synthetic buildings are
1105
    generated. The demand share is stored in the database.
1106
1107
    Note:
1108
    -----
1109
    Cells with CTS demand, amenities and buildings do not change within
1110
    the scenarios, only the demand itself. Therefore scenario eGon2035
1111
    can be used universally to determine the cts buildings but not for
1112
    he demand share.
1113
    """
1114
1115
    log.info("Start logging!")
1116
    # Buildings with amenities
1117
    df_buildings_with_amenities, df_lost_cells = buildings_with_amenities()
1118
    log.info("Buildings with amenities selected!")
1119
1120
    # Median number of amenities per cell
1121
    median_n_amenities = int(
1122
        df_buildings_with_amenities.groupby("zensus_population_id")[
1123
            "n_amenities_inside"
1124
        ]
1125
        .sum()
1126
        .median()
1127
    )
1128
    log.info(f"Median amenity value: {median_n_amenities}")
1129
1130
    # Remove synthetic CTS buildings if existing
1131
    delete_synthetic_cts_buildings()
1132
    log.info("Old synthetic cts buildings deleted!")
1133
1134
    # Amenities not assigned to buildings
1135
    df_amenities_without_buildings = amenities_without_buildings()
1136
    log.info("Amenities without buildlings selected!")
1137
1138
    # Append lost cells due to duplicated ids, to cover all demand cells
1139
    if not df_lost_cells.empty:
1140
1141
        df_lost_cells["amenities"] = median_n_amenities
1142
        # create row for every amenity
1143
        df_lost_cells["amenities"] = (
1144
            df_lost_cells["amenities"].astype(int).apply(range)
1145
        )
1146
        df_lost_cells = df_lost_cells.explode("amenities")
1147
        df_lost_cells.drop(columns="amenities", inplace=True)
1148
        df_amenities_without_buildings = df_amenities_without_buildings.append(
1149
            df_lost_cells, ignore_index=True
1150
        )
1151
        log.info("Lost cells due to substation intersection appended!")
1152
1153
    # One building per amenity
1154
    df_amenities_without_buildings["n_amenities_inside"] = 1
1155
    # Create synthetic buildings for amenites without buildings
1156
    df_synthetic_buildings_with_amenities = create_synthetic_buildings(
1157
        df_amenities_without_buildings, points="geom_amenity"
1158
    )
1159
    log.info("Synthetic buildings created!")
1160
1161
    # TODO write to DB and remove renaming
1162
    write_table_to_postgis(
1163
        df_synthetic_buildings_with_amenities.rename(
1164
            columns={
1165
                "zensus_population_id": "cell_id",
1166
                "egon_building_id": "id",
1167
            }
1168
        ),
1169
        OsmBuildingsSynthetic,
1170
        engine=engine,
1171
        drop=False,
1172
    )
1173
    log.info("Synthetic buildings exported to DB!")
1174
1175
    # Cells without amenities but CTS demand and buildings
1176
    df_buildings_without_amenities = buildings_without_amenities()
1177
    log.info("Buildings without amenities in demand cells identified!")
1178
1179
    # Backup Bugfix for duplicated buildings which occure in SQL-Querry
1180
    # drop building ids which have already been used
1181
    mask = df_buildings_without_amenities.loc[
1182
        df_buildings_without_amenities["id"].isin(
1183
            df_buildings_with_amenities["id"]
1184
        )
1185
    ].index
1186
    df_buildings_without_amenities = df_buildings_without_amenities.drop(
1187
        index=mask
1188
    ).reset_index(drop=True)
1189
    log.info(f"{len(mask)} duplicated ids removed!")
1190
1191
    # select median n buildings per cell
1192
    df_buildings_without_amenities = select_cts_buildings(
1193
        df_buildings_without_amenities, max_n=median_n_amenities
1194
    )
1195
    df_buildings_without_amenities["n_amenities_inside"] = 1
1196
    log.info(f"{median_n_amenities} buildings per cell selected!")
1197
1198
    # Create synthetic amenities and buildings in cells with only CTS demand
1199
    df_cells_with_cts_demand_only = cells_with_cts_demand_only(
1200
        df_buildings_without_amenities
1201
    )
1202
    log.info("Cells with only demand identified!")
1203
1204
    # Median n Amenities per cell
1205
    df_cells_with_cts_demand_only["amenities"] = median_n_amenities
1206
    # create row for every amenity
1207
    df_cells_with_cts_demand_only["amenities"] = (
1208
        df_cells_with_cts_demand_only["amenities"].astype(int).apply(range)
1209
    )
1210
    df_cells_with_cts_demand_only = df_cells_with_cts_demand_only.explode(
1211
        "amenities"
1212
    )
1213
    df_cells_with_cts_demand_only.drop(columns="amenities", inplace=True)
1214
1215
    # Only 1 Amenity per Building
1216
    df_cells_with_cts_demand_only["n_amenities_inside"] = 1
1217
    df_cells_with_cts_demand_only = place_buildings_with_amenities(
1218
        df_cells_with_cts_demand_only, amenities=1
1219
    )
1220
    df_synthetic_buildings_without_amenities = create_synthetic_buildings(
1221
        df_cells_with_cts_demand_only, points="geom_point"
1222
    )
1223
    log.info(f"{median_n_amenities} synthetic buildings per cell created")
1224
1225
    # TODO remove (backup) renaming after #871
1226
    write_table_to_postgis(
1227
        df_synthetic_buildings_without_amenities.rename(
1228
            columns={
1229
                "zensus_population_id": "cell_id",
1230
                "egon_building_id": "id",
1231
            }
1232
        ),
1233
        OsmBuildingsSynthetic,
1234
        engine=engine,
1235
        drop=False,
1236
    )
1237
    log.info("Synthetic buildings exported to DB")
1238
1239
    # Concat all buildings
1240
    columns = [
1241
        "zensus_population_id",
1242
        "id",
1243
        "geom_building",
1244
        "n_amenities_inside",
1245
        "source",
1246
    ]
1247
1248
    df_buildings_with_amenities["source"] = "bwa"
1249
    df_synthetic_buildings_with_amenities["source"] = "sbwa"
1250
    df_buildings_without_amenities["source"] = "bwoa"
1251
    df_synthetic_buildings_without_amenities["source"] = "sbwoa"
1252
1253
    df_cts_buildings = pd.concat(
1254
        [
1255
            df_buildings_with_amenities[columns],
1256
            df_synthetic_buildings_with_amenities[columns],
1257
            df_buildings_without_amenities[columns],
1258
            df_synthetic_buildings_without_amenities[columns],
1259
        ],
1260
        axis=0,
1261
        ignore_index=True,
1262
    )
1263
    df_cts_buildings = remove_double_bus_id(df_cts_buildings)
1264
    log.info("Double bus_id checked")
1265
1266
    # TODO maybe remove after #772
1267
    df_cts_buildings["id"] = df_cts_buildings["id"].astype(int)
1268
1269
    # Write table to db for debugging
1270
    # TODO remove later? Check if cts-builings are querried in other functions
1271
    df_cts_buildings = gpd.GeoDataFrame(
1272
        df_cts_buildings, geometry="geom_building", crs=3035
1273
    )
1274
    df_cts_buildings = df_cts_buildings.reset_index().rename(
1275
        columns={"index": "serial"}
1276
    )
1277
    write_table_to_postgis(
1278
        df_cts_buildings,
1279
        CtsBuildings,
1280
        engine=engine,
1281
        drop=True,
1282
    )
1283
    log.info("CTS buildings exported to DB!")
1284
1285
1286 View Code Duplication
def cts_electricity():
0 ignored issues
show
Duplication introduced by
This code seems to be duplicated in your project.
Loading history...
1287
    """
1288
    Calculate cts electricity demand share of hvmv substation profile
1289
     for buildings.
1290
    """
1291
    log.info("Start logging!")
1292
    with db.session_scope() as session:
1293
        cells_query = session.query(CtsBuildings)
1294
1295
    df_cts_buildings = pd.read_sql(
1296
        cells_query.statement, cells_query.session.bind, index_col=None
1297
    )
1298
    log.info("CTS buildings from DB imported!")
1299
    df_demand_share_2035 = calc_building_demand_profile_share(
1300
        df_cts_buildings, scenario="eGon2035", sector="electricity"
1301
    )
1302
    log.info("Profile share for egon2035 calculated!")
1303
1304
    df_demand_share_100RE = calc_building_demand_profile_share(
1305
        df_cts_buildings, scenario="eGon100RE", sector="electricity"
1306
    )
1307
    log.info("Profile share for egon100RE calculated!")
1308
1309
    df_demand_share = pd.concat(
1310
        [df_demand_share_2035, df_demand_share_100RE],
1311
        axis=0,
1312
        ignore_index=True,
1313
    )
1314
    df_demand_share.rename(columns={"id": "building_id"}, inplace=True)
1315
1316
    write_table_to_postgres(
1317
        df_demand_share,
1318
        EgonCtsElectricityDemandBuildingShare,
1319
        engine=engine,
1320
        drop=True,
1321
    )
1322
    log.info("Profile share exported to DB!")
1323
1324
1325 View Code Duplication
def cts_heat():
0 ignored issues
show
Duplication introduced by
This code seems to be duplicated in your project.
Loading history...
1326
    """
1327
    Calculate cts electricity demand share of hvmv substation profile
1328
     for buildings.
1329
    """
1330
    log.info("Start logging!")
1331
    with db.session_scope() as session:
1332
        cells_query = session.query(CtsBuildings)
1333
1334
    df_cts_buildings = pd.read_sql(
1335
        cells_query.statement, cells_query.session.bind, index_col=None
1336
    )
1337
    log.info("CTS buildings from DB imported!")
1338
1339
    df_demand_share_2035 = calc_building_demand_profile_share(
1340
        df_cts_buildings, scenario="eGon2035", sector="heat"
1341
    )
1342
    log.info("Profile share for egon2035 calculated!")
1343
    df_demand_share_100RE = calc_building_demand_profile_share(
1344
        df_cts_buildings, scenario="eGon100RE", sector="heat"
1345
    )
1346
    log.info("Profile share for egon100RE calculated!")
1347
    df_demand_share = pd.concat(
1348
        [df_demand_share_2035, df_demand_share_100RE],
1349
        axis=0,
1350
        ignore_index=True,
1351
    )
1352
1353
    write_table_to_postgres(
1354
        df_demand_share,
1355
        EgonCtsHeatDemandBuildingShare,
1356
        engine=engine,
1357
        drop=True,
1358
    )
1359
    log.info("Profile share exported to DB!")
1360
1361
1362 View Code Duplication
def get_cts_electricity_peak_load():
0 ignored issues
show
Duplication introduced by
This code seems to be duplicated in your project.
Loading history...
1363
    """
1364
    Get electricity peak load of all CTS buildings for both scenarios and
1365
    store in DB.
1366
    """
1367
    log.info("Start logging!")
1368
1369
    BuildingElectricityPeakLoads.__table__.create(bind=engine, checkfirst=True)
1370
1371
    # Delete rows with cts demand
1372
    with db.session_scope() as session:
1373
        session.query(BuildingElectricityPeakLoads).filter(
1374
            BuildingElectricityPeakLoads.sector == "cts"
1375
        ).delete()
1376
    log.info("Cts electricity peak load removed from DB!")
1377
1378
    for scenario in ["eGon2035", "eGon100RE"]:
1379
1380
        with db.session_scope() as session:
1381
            cells_query = session.query(
1382
                EgonCtsElectricityDemandBuildingShare
1383
            ).filter(
1384
                EgonCtsElectricityDemandBuildingShare.scenario == scenario
1385
            )
1386
1387
        df_demand_share = pd.read_sql(
1388
            cells_query.statement, cells_query.session.bind, index_col=None
1389
        )
1390
1391
        with db.session_scope() as session:
1392
            cells_query = session.query(EgonEtragoElectricityCts).filter(
1393
                EgonEtragoElectricityCts.scn_name == scenario
1394
            )
1395
1396
        df_cts_profiles = pd.read_sql(
1397
            cells_query.statement,
1398
            cells_query.session.bind,
1399
        )
1400
        df_cts_profiles = pd.DataFrame.from_dict(
1401
            df_cts_profiles.set_index("bus_id")["p_set"].to_dict(),
1402
            orient="columns",
1403
        )
1404
1405
        df_peak_load = pd.merge(
1406
            left=df_cts_profiles.max().astype(float).rename("max"),
1407
            right=df_demand_share,
1408
            left_index=True,
1409
            right_on="bus_id",
1410
        )
1411
1412
        # Convert unit from MWh to W
1413
        df_peak_load["max"] = df_peak_load["max"] * 1e6
1414
        df_peak_load["peak_load_in_w"] = (
1415
            df_peak_load["max"] * df_peak_load["profile_share"]
1416
        )
1417
        log.info(f"Peak load for {scenario} determined!")
1418
1419
        # TODO remove later
1420
        df_peak_load.rename(columns={"id": "building_id"}, inplace=True)
1421
        df_peak_load["sector"] = "cts"
1422
1423
        # # Write peak loads into db
1424
        write_table_to_postgres(
1425
            df_peak_load,
1426
            BuildingElectricityPeakLoads,
1427
            engine=engine,
1428
            drop=False,
1429
            index=False,
1430
            if_exists="append",
1431
        )
1432
1433
        log.info(f"Peak load for {scenario} exported to DB!")
1434
1435
1436 View Code Duplication
def get_cts_heat_peak_load():
0 ignored issues
show
Duplication introduced by
This code seems to be duplicated in your project.
Loading history...
1437
    """
1438
    Get heat peak load of all CTS buildings for both scenarios and store in DB.
1439
    """
1440
    log.info("Start logging!")
1441
1442
    BuildingHeatPeakLoads.__table__.create(bind=engine, checkfirst=True)
1443
1444
    # Delete rows with cts demand
1445
    with db.session_scope() as session:
1446
        session.query(BuildingHeatPeakLoads).filter(
1447
            BuildingHeatPeakLoads.sector == "cts"
1448
        ).delete()
1449
    log.info("Cts heat peak load removed from DB!")
1450
1451
    for scenario in ["eGon2035", "eGon100RE"]:
1452
1453
        with db.session_scope() as session:
1454
            cells_query = session.query(
1455
                EgonCtsElectricityDemandBuildingShare
1456
            ).filter(
1457
                EgonCtsElectricityDemandBuildingShare.scenario == scenario
1458
            )
1459
1460
        df_demand_share = pd.read_sql(
1461
            cells_query.statement, cells_query.session.bind, index_col=None
1462
        )
1463
        log.info(f"Retrieved demand share for scenario: {scenario}")
1464
1465
        with db.session_scope() as session:
1466
            cells_query = session.query(EgonEtragoHeatCts).filter(
1467
                EgonEtragoHeatCts.scn_name == scenario
1468
            )
1469
1470
        df_cts_profiles = pd.read_sql(
1471
            cells_query.statement,
1472
            cells_query.session.bind,
1473
        )
1474
        log.info(f"Retrieved substation profiles for scenario: {scenario}")
1475
1476
        df_cts_profiles = pd.DataFrame.from_dict(
1477
            df_cts_profiles.set_index("bus_id")["p_set"].to_dict(),
1478
            orient="columns",
1479
        )
1480
1481
        df_peak_load = pd.merge(
1482
            left=df_cts_profiles.max().astype(float).rename("max"),
1483
            right=df_demand_share,
1484
            left_index=True,
1485
            right_on="bus_id",
1486
        )
1487
1488
        # Convert unit from MWh to W
1489
        df_peak_load["max"] = df_peak_load["max"] * 1e6
1490
        df_peak_load["peak_load_in_w"] = (
1491
            df_peak_load["max"] * df_peak_load["profile_share"]
1492
        )
1493
        log.info(f"Peak load for {scenario} determined!")
1494
1495
        # TODO remove later
1496
        df_peak_load.rename(columns={"id": "building_id"}, inplace=True)
1497
        df_peak_load["sector"] = "cts"
1498
1499
        # # Write peak loads into db
1500
        write_table_to_postgres(
1501
            df_peak_load,
1502
            BuildingHeatPeakLoads,
1503
            engine=engine,
1504
            drop=False,
1505
            index=False,
1506
            if_exists="append",
1507
        )
1508
1509
        log.info(f"Peak load for {scenario} exported to DB!")
1510