Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_18901 |
Symbol | aroG |
ID | 4912543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1618274 |
End bp | 1619341 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640161496 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001092114 |
Protein GI | 126697228 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.667922 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGACAT CATCAAATAA TTCAGCTTTA GAAAAGACAT CAGATTTACA TGTTCTTGAA ACACGTCCAT TAATACCTCC AAGCAGATTA CATAATGATA TACCTTTAGA TCACGACTCT GCTAATACAG TATCTAAAAC AAGAAGATCG ATACAAAATA TTTTGCATCA TAATGATCAG AAGCTTTTAG TCATTGTGGG TCCATGTTCA ATTCATGATC TTGAGGCGGC AAAGGAATAT TCAAAATATA TTCAAAAATT CCGAGAAATG TATAAAGATA AATTAGAAAT AATTATGAGA GTATATTTTG AAAAACCAAG AACAACTATT GGCTGGAAGG GATTGATAAA TGATCCTCAT CTAGATGATT CTTATGATAT TAATACTGGT TTAAGAAGAG CAAGAAGTTT GCTTTCATAT TTAGCAACTC GTGGTATACC TTCTGCTACA GAATTACTAG ATCCAATTGT TCCTCAATAC ATTGCCGATT TAATAAGTTG GACAGCCATA GGTGCGCGGA CTACAGAAAG TCAAACTCAT AGAGAAATGG CATCAGGATT ATCAATGCCT ATAGGCTTTA AAAATGGAAC GGATGGTTCT TTTACTACTG CAATTAATGC AATGCAGTCA GCTTCAAAAT CCCATCACTT CTTAGGTGTA AATGAAAATG GAATGGCTTC TATAGTTAAT ACTACAGGAA ATCCAGATGG ACATATAGTT TTAAGGGGTG GTTCAAAAGG CCCAAATTTC GAAAATGATC ATATACAAAG AATTTCAGCA GAATTGAGGC AATGTAGTCT TCCCCATAAA GTGATGATTG ATTGTAGTCA TGGAAATTCC AATAAAGATT TCCGAAAACA GTCGGAAGTG CTAAAAAATG TGGCTTCTCA AATTAGTAAT GGTGAAAAAA ATATTTTAGG AGTTATGCTT GAGAGTCATT TGAAGGAAGG AAATCAAAAA CTTTTAAAAA AAGAAGATCT CCAGTTTGGT AGAAGCATTA CAGATGCATG TATAGATATA GAAACAACAA AAAAATTAAT AGCTATTTTA TACGATTCAC TTAGCTAG
|
Protein sequence | MMTSSNNSAL EKTSDLHVLE TRPLIPPSRL HNDIPLDHDS ANTVSKTRRS IQNILHHNDQ KLLVIVGPCS IHDLEAAKEY SKYIQKFREM YKDKLEIIMR VYFEKPRTTI GWKGLINDPH LDDSYDINTG LRRARSLLSY LATRGIPSAT ELLDPIVPQY IADLISWTAI GARTTESQTH REMASGLSMP IGFKNGTDGS FTTAINAMQS ASKSHHFLGV NENGMASIVN TTGNPDGHIV LRGGSKGPNF ENDHIQRISA ELRQCSLPHK VMIDCSHGNS NKDFRKQSEV LKNVASQISN GEKNILGVML ESHLKEGNQK LLKKEDLQFG RSITDACIDI ETTKKLIAIL YDSLS
|
| |