Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_18411 |
Symbol | aroE |
ID | 5731729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1673837 |
End bp | 1674739 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641286228 |
Product | shikimate / quinate 5-dehydrogenase |
Protein accession | YP_001551726 |
Protein GI | 159904382 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.611208 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAA CAAAATTTAG CTCCATTACT GGAAAAACAA GCCTTGTGGG GGTTATTGGG GATCCCATTT CCCATTCTCT TTCACCAGTA ATTCAAAATG CAGCACTGCA CAAAATGAAT CTGGACTGGT GTTATCTCGC GATCCCATGT AAGCCTGACA ACCTAGAATC AATAATCAAA TGCTTAAGTA AGATCAATTG CAAAGGTTTA AATATTACAA TTCCTCATAA AACTAATGCT CTCAAGTTTT GTAATCATGT AAGTGAAATA GCAACCAAAG TTGGAGCAAT CAATACTTTA ATTCCAGACA ATGATGATGG GTGGATTGGA ACAAATACTG ATGTTGAAGG GTTTTTAAAA CCTCTTGGGT TAGAAAAGAA TTGGGAAAGC CGAAATGCAG TAATACTAGG TAATGGTGGA AGTGCAAGAG CAGTTTTACT AGGTCTTGAA AAGTTAAAGT TGGCAGAGAT TGCTATCATT GGCCGAAAAG AAACATCAAT AACTAATCTA ATTAATTCCC TGACTAGCCC TAATGGAAAT ATAAAAGGCT TGACTCAAGA TAGTTTGCAA TTAAATGAAT ATATAAAGAA TGCCGATCTA ATCATCAACA CAACTCCAAT AGGGATGCTT CAAACAAATA ATACAAGCTG TATAGATTCT GAGATTCCTT TTGGTGAAAA GATTTGGGAC AACCTAAAAC CAAAAACTAC TCTTTATGAC TTAATTTACA ATCCAAGACC AACAAAGTGG CTAGAAATAG GAAAAGAAAA AGGCTGTATA CCGATTGATG GCTTAGAAAT GTTAATCCAA CAGGGAGCTG CATCCTTAAG ATTATGGAGC GGTATCGAAG AAATTCCTAT AGATATCATG AGGAAATCAG CTAAAGATCA CCTCCTAAAT TAA
|
Protein sequence | MKETKFSSIT GKTSLVGVIG DPISHSLSPV IQNAALHKMN LDWCYLAIPC KPDNLESIIK CLSKINCKGL NITIPHKTNA LKFCNHVSEI ATKVGAINTL IPDNDDGWIG TNTDVEGFLK PLGLEKNWES RNAVILGNGG SARAVLLGLE KLKLAEIAII GRKETSITNL INSLTSPNGN IKGLTQDSLQ LNEYIKNADL IINTTPIGML QTNNTSCIDS EIPFGEKIWD NLKPKTTLYD LIYNPRPTKW LEIGKEKGCI PIDGLEMLIQ QGAASLRLWS GIEEIPIDIM RKSAKDHLLN
|
| |