Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A2602 |
Symbol | aroH |
ID | 5801074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 2725087 |
End bp | 2726133 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641340471 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001607010 |
Protein GI | 162420455 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.823041 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACAAGA TAGATGAACT GCGGACCGCT CGCATCGATA GCTTGATTAC ACCGCAGCAA CTGGCTGAAA AGTTACCGAT TTCTGAGGTT ATTGCAGATA ACGTGACGGC GTCACGTAAA CGAATAGAGA AAATACTTAT TGGTGAAGAC CCACGTCTAC TCGTGGTGAT TGGCCCCTGC TCTATTCACG ACCTTGATGC AGCCGTTGAT TATGCCACCC GGCTCAAGGT GCTACGAGAA CGCTATCAAG ACCGGCTGGA AATCGTGATG CGCACCTATT TCGAGAAACC ACGGACTGTA GTGGGTTGGA AGGGGCTGAT TTCTGATCCG GCACTTGACG GCTCATGCCA GGTGAACTTG GGTATTGAAC TGGCACGTAA GCTACTGTTA GCCGTGAATG AACTCGGGCT GCCGACCGCT ACCGAGTTCC TCAATATGGT AACAGGCCAA TATATTGCCG ACCTCATCAG TTGGGGGGCA ATAGGCGCAC GTACCACCGA AAGCCAGATC CACCGAGAGA TGGCCTCGGC ACTCTCCTGC CCCGTGGGTT TCAAAAATGG TACAGATGGC AATGTGCGTA TTGCTATTGA TGCCATTCGC GCCGCACAAG CCAGCCATAT GTTCCTTTCT CCGGATAAAA CCGGCCAAAT GACGATTTAC CAAACCAGTG GTAACCCCTA TGGGCATATT ATTATGCGGG GTGGAAAGCA ACCTAACTAT GATGCCTCTG ATATCGCAGC CGCCTGTGAC AGCTTGCGGG AATTTGATTT GCCAGAACAT CTGGTGGTAG ATTTTAGCCA CGGCAATTGC CAGAAGATGC ATCGCCGCCA GTTGGATGTT GCCGAAAATA TCGGGCTACA GATCCGTGCG GGTTCAACAG CGATTGTCGG TGTTATGGCT GAGAGTTTCC TGATTGAGGG CACACAGAAG ATTGTTGCCG GACAGCCCTT AACTTATGGG CAATCCATCA CTGACCCTTG CCTGAATTGG GATGATACTG AACAACTGTT AAGCCTATTG GCAGATGCAG TAGACAGCCG GTTTTAA
|
Protein sequence | MYKIDELRTA RIDSLITPQQ LAEKLPISEV IADNVTASRK RIEKILIGED PRLLVVIGPC SIHDLDAAVD YATRLKVLRE RYQDRLEIVM RTYFEKPRTV VGWKGLISDP ALDGSCQVNL GIELARKLLL AVNELGLPTA TEFLNMVTGQ YIADLISWGA IGARTTESQI HREMASALSC PVGFKNGTDG NVRIAIDAIR AAQASHMFLS PDKTGQMTIY QTSGNPYGHI IMRGGKQPNY DASDIAAACD SLREFDLPEH LVVDFSHGNC QKMHRRQLDV AENIGLQIRA GSTAIVGVMA ESFLIEGTQK IVAGQPLTYG QSITDPCLNW DDTEQLLSLL ADAVDSRF
|
| |