We use cookies to enhance the usability of our website. If you continue, we'll assume that you are happy to receive all cookies. More information. Don't show this again.
Show complete data for human cells assay. The location(s) are highlighted in the illustration on the right.
Mainly localized to the nucleoplasm. In addition localized to the cytosol.
RNA cell categoryi
The cell lines in the Human Protein Atlas have been analyzed by RNA-seq to estimate the transcript abundance of each protein-coding gene. The RNA-seq data was then used to classify all genes according to their cell line-specific expression into one of six different categories, defined based on the total set of all TPM values in all analyzed cell lines.
Protein evidence scores are generated from several independent sources and are classified as evidence at i) protein level, ii) transcript level, iii) no evidence, or iv) not available.
Evidence at protein level
Main locationi
The main location is characterized by presence in all tested cell lines and/or increased intensity compared to other locations. It is highlighted in the illustration to the right. If available, links to overrepresentation analyses in Reactome, a free, open-source, curated and peer reviewed biological pathway database, are provided. An analysis is done for the corresponding gene set of the proteome localizing to the main and additional locations of the protein on this page, respectively.
Localized to the Nucleoplasm (supported)
Additional locationi
Additional locations are characterized by either a markedly lower staining intensity than the main location, or that it is only observed in a subset of the cell lines. They are highlighted in the illustration to the right. If available, links to overrepresentation analyses in Reactome, a free, open-source, curated and peer reviewed biological pathway database, are provided. An analysis is done for the corresponding gene set of the proteome localizing to the main and additional locations of the protein on this page, respectively.
In addition localized to the Cytosol (approved)
DATA RELIABILITY
Reliability scorei
A reliability score is set for all genes and indicates the level of reliability of the analyzed protein expression pattern based on available protein/RNA/gene characterization data. The reliability of the annotated protein expression data is also scored depending on similarity in immunostaining patterns and consistency with available experimental gene/protein characterization data in the UniProtKB/Swiss-Prot database.
Below is an overview of RNA expression data generated in the HPA project. The analyzed cell lines are divided into 12 color-coded groups according to the organ they were obtained from. By clicking the toolbars in the top right corner it is possible to sort the cell lines in the chart by different criteria: the organ and the origin that the cell line was obtained from, the category of the cell line according to cellosaurus, alphabetically or by descending RNA expression. Detailed information about a specific cell line can be accessed by hovering over the corresponding bar in the chart. The RNA-sequencing results generated in the HPA are reported as number of Transcripts per Kilobase Million (TPM). In the Human Protein Atlas a TPM value of 1.0 is defined as a treshhold for expression of the corresponding protein.
The cell lines in the Human Protein Atlas have been analyzed by RNA-seq to estimate the transcript abundance of each protein-coding gene. The RNA-seq data was then used to classify all genes according to their cell line-specific expression into one of six different categories, defined based on the total set of all TPM values in all analyzed cell lines.
Cell lines sorted after organ of phenotypic resemblance.
Cell lines sorted after biological source for establishment.
Cell lines sorted after the cell line category according to Cellosaurus.
Cell lines sorted on descending RNA expression.
Cell lines sorted alphabetically.
HUMAN CELLSi
The "human cells" section gives an overview about the subcellular location of the protein of interest obtained by indirect immunofluorescence microscopy, an antibody-based protein-visualization technique. The immunofluorescent analysis is carried out in three different cell lines, one of them always being U-2 OS. A selection of immunofluorescent images is displayed below. Three different organelle probes are displayed as different channels in the multicolor images - nucleus stained in blue, microtubules in red and ER in yellow. The antibody staining targeting the protein of interest is shown in green. By using the toggle channel buttons, the different channels can be turned on and off. For the selection of the images to compare, use the checkboxes next to the images at the bottom. Three images can be compared at a time. All images are clickable for an enlarged view. The selected image will appear in large size and miniature images with all other staining results for this gene will be listed at the top left of the image. The selected miniature image has an orange overlay. For cell structure reference, visit the cell dictionary.
Summaryi
Summary of the immunofluorescent analysis in all studied cell lines with all tested antibodies.
Mainly localized to the nucleoplasm. In addition localized to the cytosol.
Main locationi
The main location is characterized by presence in all tested cell lines and/or increased intensity compared to other locations.
Nucleoplasm (supported)
Additional locationi
Additional locations are characterized by either a markedly lower staining intensity than the main location, or that it is only observed in a subset of the cell lines.
Cytosol (approved)
Toggle channelsi
Three different organelle probes are displayed as different channels in the multicolor images - nucleus stained in blue, microtubules in red and ER in yellow. The antibody staining targeting the protein of interest is shown in green. By using the "toggle channels"-buttons, the different channels can be turned on and off. The intensity toggle shows the pixel intensity range in 16 different colors for the selected channel. The object toggle shows the computational segmentation of the cells used for further analysis in the HPA project. For samples where cell cycle dependency for the protein is suggested according to a correlation assay the predicted cell cycle position of each cell is displayed when using the object toggle.
Low
High
G1
S
G2
M
N/A
Thumbnaili
Representative images for the assay. Three images can be compared at the same time. To change which images to compare, use the checkboxes next to the images below. All images are clickable for an enlarged view. The selected image will appear in large size and miniature images with all other staining results for this gene will be listed at the top left of the image. The selected miniature image has an orange overlay.
Antibodyi
Antibody used for analysis. Clicking the antibody ID links to the antibody validation page.
Cell linei
Cell line used for analysis. Read more about the cell lines in the Human Protein Atlas.
Locationi
Location(s) annotated in the corresponding cell line.
Single-cell variationi
As the images in the Cell Atlas provide single cell resolution, variations in protein expression patterns from cell to cell can be observed. A single-cell variation can either be observed in the intensity of the immunofluorescent signal or in the spatial distribution pattern of the protein. This column contains information about whether and for which of the annotated locations a single-cell variation pattern was manually annotated.
Cell cycle dependent variationi
A likely cause for single-cell variation in the immunofluorescent images is cell cycle dependency. This column contains information about whether the manually observed cell-to-cell variation pattern correlates with cell cycle progression.
For the genes where the mouse and human genes are orthologues, the mouse cell line NIH 3T3 is also stained. The main subcellular location of the encoded proteins and any additional locations are reported as well as staining characteristics, staining intensity and validation score. Example images are shown below. To change which images to compare, use the checkboxes next to the images at the bottom. Three images can be compared at a time. All images are clickable for an enlarged view. The selected image will appear in large size and miniature images with all other staining results for this gene will be listed at the top left of the image. The selected miniature image has an orange overlay. For cell structure reference, visit the cell dictionary.
Main locationi
The main location is characterized by a higher intensity compared to other locations observed.
Three different organelle probes are displayed as different channels in the multicolor images - nucleus stained in blue, microtubules in red and ER in yellow. The HPA-antibody staining targeting the protein of interest is shown in green. By using the "toggle channels"-buttons, the different channels can be turned on and off. The intensity toggle shows the pixel intensity range in 16 different colors for the selected channel. The object toggle shows the computational segmentation of the cells used for further analysis in the HPA project. For samples where cell cycle dependency for the protein is suggested according to a correlation assay the predicted cell cycle position of each cell is displayed when using the object toggle.
Low
High
G1
S
G2
M
N/A
Thumbnaili
Representative images for the assay. Three images can be compared at the same time. To change which images to compare, use the checkboxes next to the images below. All images are clickable for an enlarged view. The selected image will appear in large size and miniature images with all other staining results for this gene will be listed at the top left of the image. The selected miniature image has an orange overlay.
Antibodyi
Antibody used for analysis. Clicking the antibody ID links to the antibody validation page.
Cell linei
Cell line used for analysis. Read more about the cell lines in the Human Protein Atlas.
Locationi
Location(s) annotated in the corresponding cell line.
Gene information from Ensembl and Entrez, as well as links to available gene identifiers are displayed here. Information was retrieved from Ensembl if not indicated otherwise.
Gene name
KMT2A (HGNC Symbol)
Synonyms
ALL-1, CXXC7, HRX, HTRX1, MLL, MLL1A, TRX1
Description
Lysine methyltransferase 2A (HGNC Symbol)
Entrez gene summary
This gene encodes a transcriptional coactivator that plays an essential role in regulating gene expression during early development and hematopoiesis. The encoded protein contains multiple conserved functional domains. One of these domains, the SET domain, is responsible for its histone H3 lysine 4 (H3K4) methyltransferase activity which mediates chromatin modifications associated with epigenetic transcriptional activation. This protein is processed by the enzyme Taspase 1 into two fragments, MLL-C and MLL-N. These fragments reassociate and further assemble into different multiprotein complexes that regulate the transcription of specific target genes, including many of the HOX genes. Multiple chromosomal translocations involving this gene are the cause of certain acute lymphoid leukemias and acute myeloid leukemias. Alternate splicing results in multiple transcript variants.[provided by RefSeq, Oct 2010]
The protein browser displays the antigen location on the target protein(s) and the features of the target protein. The tabs at the top of the protein view section can be used to switch between the different splice variants to which an antigen has been mapped.
At the top of the view, the position of the antigen (identified by the corresponding HPA identifier) is shown as a green bar. A yellow triangle on the bar indicates a <100% sequence identity to the protein target.
Under the antigens, the maximum percent sequence identity of the protein to all other proteins from other human genes is displayed, using a sliding window of 10 aa residues (HsID 10) or 50 aa residues (HsID 50). The region with the lowest possible identity is always selected for antigen design, with a maximum identity of 60% allowed for designing a single-target antigen (read more).
The curve in blue displays the predicted antigenicity i.e. the tendency for different regions of the protein to generate an immune response, with peak regions being predicted to be more antigenic.The curve shows average values based on a sliding window approach using an in-house propensity scale. (read more).
If a signal peptide is predicted by a majority of the signal peptide predictors SPOCTOPUS, SignalP 4.0, and Phobius (turquoise) and/or transmembrane regions (orange) are predicted by MDM, these are displayed.
Low complexity regions are shown in yellow and InterPro regions in green. Common (purple) and unique (grey) regions between different splice variants of the gene are also displayed (read more), and at the bottom of the protein view is the protein scale.
KMT2A-001
KMT2A-002
KMT2A-006
KMT2A-007
KMT2A-010
KMT2A-015
KMT2A-016
KMT2A-017
PROTEIN INFORMATIONi
The protein information section displays alternative protein-coding transcripts (splice variants) encoded by this gene according to the Ensembl database.
The ENSP identifier links to the Ensembl website protein summary, while the ENST identifier links to the Ensembl website transcript summary for the selected splice variant. The data in the UniProt column can be expanded to show links to all matching UniProt identifiers for this protein.
The protein classes assigned to this protein are shown if expanding the data in the protein class column. Parent protein classes are in bold font and subclasses are listed under the parent class.
The Gene Ontology terms assigned to this protein are listed if expanding the Gene ontology column. The length of the protein (amino acid residues according to Ensembl), molecular mass (kDalton), predicted signal peptide (according to a majority of the signal peptide predictors SPOCTOPUS, SignalP 4.0, and Phobius) and the number of predicted transmembrane region(s) (according to MDM) are also reported.
Enzymes ENZYME proteins Transferases THUMBUP predicted membrane proteins Predicted intracellular proteins Plasma proteins Transcription factors Zinc-coordinating DNA-binding domains Cancer-related genes Mutated cancer genes Mutational cancer driver genes COSMIC somatic mutations in cancer genes COSMIC Somatic Mutations COSMIC Other Mutations COSMIC Translocations Disease related genes Potential drug targets Protein evidence (Kim et al 2014) Protein evidence (Ezkurdia et al 2014)
Show all
GO:0001046 [core promoter sequence-specific DNA binding] GO:0003677 [DNA binding] GO:0003680 [AT DNA binding] GO:0003682 [chromatin binding] GO:0003700 [transcription factor activity, sequence-specific DNA binding] GO:0005515 [protein binding] GO:0005634 [nucleus] GO:0005654 [nucleoplasm] GO:0005829 [cytosol] GO:0006306 [DNA methylation] GO:0006351 [transcription, DNA-templated] GO:0006355 [regulation of transcription, DNA-templated] GO:0006366 [transcription from RNA polymerase II promoter] GO:0006461 [protein complex assembly] GO:0006915 [apoptotic process] GO:0008168 [methyltransferase activity] GO:0008270 [zinc ion binding] GO:0008285 [negative regulation of cell proliferation] GO:0008542 [visual learning] GO:0009416 [response to light stimulus] GO:0009791 [post-embryonic development] GO:0009952 [anterior/posterior pattern specification] GO:0010468 [regulation of gene expression] GO:0016569 [covalent chromatin modification] GO:0016740 [transferase activity] GO:0018024 [histone-lysine N-methyltransferase activity] GO:0018026 [peptidyl-lysine monomethylation] GO:0032259 [methylation] GO:0032411 [positive regulation of transporter activity] GO:0032922 [circadian regulation of gene expression] GO:0035097 [histone methyltransferase complex] GO:0035162 [embryonic hemopoiesis] GO:0035640 [exploration behavior] GO:0035864 [response to potassium ion] GO:0042800 [histone methyltransferase activity (H3-K4 specific)] GO:0042802 [identical protein binding] GO:0042803 [protein homodimerization activity] GO:0043984 [histone H4-K16 acetylation] GO:0044212 [transcription regulatory region DNA binding] GO:0044648 [histone H3-K4 dimethylation] GO:0045322 [unmethylated CpG binding] GO:0045893 [positive regulation of transcription, DNA-templated] GO:0045944 [positive regulation of transcription from RNA polymerase II promoter] GO:0046872 [metal ion binding] GO:0048172 [regulation of short-term neuronal synaptic plasticity] GO:0048511 [rhythmic process] GO:0048536 [spleen development] GO:0048873 [homeostasis of number of cells within a tissue] GO:0050890 [cognition] GO:0051568 [histone H3-K4 methylation] GO:0051569 [regulation of histone H3-K4 methylation] GO:0051571 [positive regulation of histone H3-K4 methylation] GO:0051899 [membrane depolarization] GO:0060216 [definitive hemopoiesis] GO:0070577 [lysine-acetylated histone binding] GO:0071339 [MLL1 complex] GO:0071440 [regulation of histone H3-K14 acetylation] GO:0080182 [histone H3-K4 trimethylation] GO:1901674 [regulation of histone H3-K27 acetylation] GO:2000615 [regulation of histone H3-K9 acetylation] GO:2001040 [positive regulation of cellular response to drug]
Predicted intracellular proteins Cancer-related genes Mutated cancer genes Mutational cancer driver genes COSMIC somatic mutations in cancer genes COSMIC Somatic Mutations COSMIC Other Mutations COSMIC Translocations Protein evidence (Ezkurdia et al 2014)
SPOCTOPUS predicted secreted proteins Predicted intracellular proteins Cancer-related genes Mutated cancer genes Mutational cancer driver genes COSMIC somatic mutations in cancer genes COSMIC Somatic Mutations COSMIC Other Mutations COSMIC Translocations Protein evidence (Ezkurdia et al 2014)
Predicted intracellular proteins Cancer-related genes Mutated cancer genes Mutational cancer driver genes COSMIC somatic mutations in cancer genes COSMIC Somatic Mutations COSMIC Other Mutations COSMIC Translocations Protein evidence (Ezkurdia et al 2014)
SPOCTOPUS predicted secreted proteins Predicted intracellular proteins Cancer-related genes Mutated cancer genes Mutational cancer driver genes COSMIC somatic mutations in cancer genes COSMIC Somatic Mutations COSMIC Other Mutations COSMIC Translocations Protein evidence (Ezkurdia et al 2014)
Enzymes ENZYME proteins Transferases THUMBUP predicted membrane proteins Predicted intracellular proteins Plasma proteins Transcription factors Zinc-coordinating DNA-binding domains Cancer-related genes Mutated cancer genes Mutational cancer driver genes COSMIC somatic mutations in cancer genes COSMIC Somatic Mutations COSMIC Other Mutations COSMIC Translocations Disease related genes Potential drug targets Protein evidence (Kim et al 2014) Protein evidence (Ezkurdia et al 2014)
Show all
GO:0001046 [core promoter sequence-specific DNA binding] GO:0003677 [DNA binding] GO:0003680 [AT DNA binding] GO:0003700 [transcription factor activity, sequence-specific DNA binding] GO:0005515 [protein binding] GO:0005634 [nucleus] GO:0005654 [nucleoplasm] GO:0005829 [cytosol] GO:0006351 [transcription, DNA-templated] GO:0006355 [regulation of transcription, DNA-templated] GO:0006366 [transcription from RNA polymerase II promoter] GO:0006461 [protein complex assembly] GO:0006915 [apoptotic process] GO:0008168 [methyltransferase activity] GO:0008270 [zinc ion binding] GO:0016569 [covalent chromatin modification] GO:0016740 [transferase activity] GO:0018024 [histone-lysine N-methyltransferase activity] GO:0032259 [methylation] GO:0032411 [positive regulation of transporter activity] GO:0032922 [circadian regulation of gene expression] GO:0035097 [histone methyltransferase complex] GO:0035162 [embryonic hemopoiesis] GO:0042800 [histone methyltransferase activity (H3-K4 specific)] GO:0042802 [identical protein binding] GO:0042803 [protein homodimerization activity] GO:0043984 [histone H4-K16 acetylation] GO:0044212 [transcription regulatory region DNA binding] GO:0045322 [unmethylated CpG binding] GO:0045893 [positive regulation of transcription, DNA-templated] GO:0045944 [positive regulation of transcription from RNA polymerase II promoter] GO:0046872 [metal ion binding] GO:0048511 [rhythmic process] GO:0051568 [histone H3-K4 methylation] GO:0051571 [positive regulation of histone H3-K4 methylation] GO:0070577 [lysine-acetylated histone binding] GO:0071339 [MLL1 complex] GO:0071440 [regulation of histone H3-K14 acetylation] GO:0080182 [histone H3-K4 trimethylation] GO:2000615 [regulation of histone H3-K9 acetylation] GO:2001040 [positive regulation of cellular response to drug]
Predicted intracellular proteins Cancer-related genes Mutated cancer genes Mutational cancer driver genes COSMIC somatic mutations in cancer genes COSMIC Somatic Mutations COSMIC Other Mutations COSMIC Translocations Protein evidence (Ezkurdia et al 2014)
Predicted intracellular proteins Cancer-related genes Mutated cancer genes Mutational cancer driver genes COSMIC somatic mutations in cancer genes COSMIC Somatic Mutations COSMIC Other Mutations COSMIC Translocations Protein evidence (Ezkurdia et al 2014)