BinPhenotype
Bases: Phenotype
BinPhenotype converts numeric values into categorical bin labels. To use, pass it a numeric valued phenotype such as AgePhenotype, MeasurementPhenotype, ArithmeticPhenotype, or ScorePhenotype.
Takes a phenotype that returns numeric values (like age, measurements, etc.) and converts the VALUE column into bin labels like "[10-20)", "[20-30)", etc.
DATE: The event date selected from the input phenotype VALUE: A categorical variable representing the bin label that the numeric value falls into
Parameters:
Name | Type | Description | Default |
---|---|---|---|
name
|
The name of the phenotype. |
required | |
phenotype
|
Phenotype
|
The phenotype that returns numeric values of interest (AgePhenotype, MeasurementPhenotype, etc.) |
required |
bins
|
List of bin edges. Default is [0, 10, 20, ..., 100] for age ranges. |
list(range(0, 101, 10))
|
Example
# Create an age phenotype
age = AgePhenotype()
# Create bins for age groups: [0-10), [10-20), [20-30), etc.
binned_age = BinPhenotype(
name="age_groups",
phenotype=age,
bins=[0, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100]
)
tables = {"PERSON": example_person_table}
result_table = binned_age.execute(tables)
# Result will have VALUE column with labels like "[20-30)", "[30-40)", etc.
display(result_table)
Source code in phenex/phenotypes/bin_phenotype.py
15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 |
|
dependencies
property
Recursively collect all dependencies of a node (including dependencies of dependencies).
Returns:
Type | Description |
---|---|
Set[Node]
|
List[Node]: A list of Node objects on which this Node depends. |
dependency_graph
property
namespaced_table
property
A PhenotypeTable has generic column names 'person_id', 'boolean', 'event_date', and 'value'. The namespaced_table prepends the phenotype name to all of these columns. This is useful when joining multiple phenotype tables together.
Returns:
Name | Type | Description |
---|---|---|
table |
Table
|
The namespaced table for the current phenotype. |
reverse_dependency_graph
property
execute(tables=None, con=None, overwrite=False, lazy_execution=False, n_threads=1)
Executes the Node computation for the current node and its dependencies. Supports lazy execution using hash-based change detection to avoid recomputing Node's that have already executed.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
tables
|
Dict[str, Table]
|
A dictionary mapping domains to Table objects. |
None
|
con
|
Optional[object]
|
Connection to database for materializing outputs. If provided, outputs from the node and all children nodes will be materialized (written) to the database using the connector. |
None
|
overwrite
|
bool
|
If True, will overwrite any existing tables found in the database while writing. If False, will throw an error when an existing table is found. Has no effect if con is not passed. |
False
|
lazy_execution
|
bool
|
If True, only re-executes if the node's definition has changed. Defaults to False. You should pass overwrite=True with lazy_execution as lazy_execution is intended precisely for iterative updates to a node definition. You must pass a connector (to cache results) for lazy_execution to work. |
False
|
n_threads
|
int
|
Max number of Node's to execute simultaneously when this node has multiple children. |
1
|
Returns:
Name | Type | Description |
---|---|---|
Table |
Table
|
The resulting table for this node. Also accessible through self.table after calling self.execute(). |
Source code in phenex/node.py
visualize_dependencies()
Create a text visualization of the dependency graph for this node and its dependencies.
Returns:
Name | Type | Description |
---|---|---|
str |
str
|
A text representation of the dependency graph |