Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_10371 |
Symbol | aroA |
ID | 5730997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 930010 |
End bp | 931347 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641285404 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001550922 |
Protein GI | 159903578 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.955757 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATCAAAT TGATCCCAAA ACAACAAAAA TTATCATTGC GGGAAGTCAA GCCTGGGGGC AAGTTAATAG GAAAACTTAA AGTACCTGGC GATAAATCTA TTTCTCATCG AGCATTGCTT TTTGGGTCAA TAGCAGAAGG AGAAACAATA ATTAAAGGTC TTCTGCCAGC TGAAGATCCT ATAAGTACGG CAAACTGCCT AAGGGCAATG GGGGTAAAAA TAAGCTCAAT AGAAAGCGAC AAAGTCGTAT CAATTCAAGG GGTGGGATTA GATGGACTAA AAGAGCCTAG TGAAGTGCTT AATTGTGGAA ATTCAGGAAC AACAATGCGT CTGTTGCTTG GATTATTGGT AGGTCGTAAA GGTCGGCATT TTGTGCTTAA TGGTGACAAA TCATTGAATA AAAGACCAAT GCAAAGAGTT AGGCAACCAC TGAAATTGAT GGGAGCTGAA ATTAATGGTC GTTCAGATGG TGATCTAGCG CCATTATCTA TTGTTGGTCG TAACCTTCAC GGAGCCGTAA TAGGCACTCC AGTGGCAAGT GCCCAAGTTA AGTCTGCAAT ATTACTTGCT GCACTAACAG CAGAAGGGTC AACAAAGGTA ATAGAGCCAA CGAGTTCTAG AGACCATAGT GAGAGAATGT TAAAGGCCTT TGGCGCCAAT CTTGAAGTAA GTGGAGAAAG AGGTCGGCAT ATAAGTGTAT GGCCAGGATC CAAACTATTA GGTCAATCAA TTGTGGTCCC TGGTGATATA AGTTCAGCAG CATTCTGGTT GATAGCAGGA ACAATAATCC CAAATTCAGA GTTGACTATT GAAAATGTAG GGTTAAATCC AACTCGAACT GGGATATTAA AAGTTATGGA ACAAATGGAA GCAAATATTG AATTAATTAA TATCAGAGAC GTTGCAGGCG AGCCTGTAGG GGATATAAAA GTAATTCACA ATGACCAACT AAAACCTTTT AAAATCGATT CGAGCTTAGT ACCCAGTCTT GTTGATGAGA TTCCTATCCT ATCCGTGGCA GCTTGTTTTT GCGATGGCAC GACAAAAATC ACAGGAGCAA GCGAACTTCG TGTCAAAGAA ACGGATCGGC TAAAAGTTAT GACTAGGCAA CTCCTAAGAA TGGGAGCAAA CATCGAAGAG CATCCTGATG GGCTAACCAT ACATGGGGTC GACCATTTAA AAGGTAACCA TCTAGATAGC GAAACAGATC ATAGAGTAGC AATGAGTCTC GCCATTGCAT CTATAGTCGC CAAGGGAACT TCGACGATAG AAAGAAGCAA TGCCGCAGCT GTTTCATATC CAGAATTCTG GAATGATCTA GAGAGACTAA GAGGTTGA
|
Protein sequence | MIKLIPKQQK LSLREVKPGG KLIGKLKVPG DKSISHRALL FGSIAEGETI IKGLLPAEDP ISTANCLRAM GVKISSIESD KVVSIQGVGL DGLKEPSEVL NCGNSGTTMR LLLGLLVGRK GRHFVLNGDK SLNKRPMQRV RQPLKLMGAE INGRSDGDLA PLSIVGRNLH GAVIGTPVAS AQVKSAILLA ALTAEGSTKV IEPTSSRDHS ERMLKAFGAN LEVSGERGRH ISVWPGSKLL GQSIVVPGDI SSAAFWLIAG TIIPNSELTI ENVGLNPTRT GILKVMEQME ANIELINIRD VAGEPVGDIK VIHNDQLKPF KIDSSLVPSL VDEIPILSVA ACFCDGTTKI TGASELRVKE TDRLKVMTRQ LLRMGANIEE HPDGLTIHGV DHLKGNHLDS ETDHRVAMSL AIASIVAKGT STIERSNAAA VSYPEFWNDL ERLRG
|
| |