Data Lake¶
AIDO Datalake endpoints for querying biological reference data (AIDO_DATALAKE_URL).
Gene¶
genbio.data.aido_datalake_apis.gene_histone ¶
gene_histone(cell_type: str, mark: str, chrom: str, start: int, end: int) -> list[dict[str, Any]]
Return raw bigWig signal intervals for a histone mark over a genomic region.
Queries the AIDO Datalake for histone ChIP-seq signal stored as bigWig files.
Accepts explicit chromosomal coordinates only. To query by gene symbol, first
resolve coordinates via gene_interval (GET /data/gene/interval).
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
cell_type
|
str
|
The cell type identifier (e.g. |
required |
mark
|
str
|
The histone mark to query (e.g. |
required |
chrom
|
str
|
Chromosome name (e.g. |
required |
start
|
int
|
0-based genomic start coordinate (inclusive). |
required |
end
|
int
|
0-based genomic end coordinate (exclusive). |
required |
Returns:
| Type | Description |
|---|---|
list[dict[str, Any]]
|
A list of interval dicts, each with the following keys: |
list[dict[str, Any]]
|
|
list[dict[str, Any]]
|
|
list[dict[str, Any]]
|
|
Raises:
| Type | Description |
|---|---|
HTTPError
|
On 400 (invalid mark or chromosome), 404 (cell type / mark not found), or 503 (bigWig file missing on the server), with the response body included in the error message. |
Example::
intervals = gene_histone(
cell_type="K562",
mark="H3K27ac",
chrom="chr1",
start=1_000_000,
end=1_010_000,
)
for iv in intervals:
print(iv["start"], iv["end"], iv["value"])