Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_06391 |
Symbol | aroA |
ID | 4911126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 568062 |
End bp | 569372 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640160220 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001090863 |
Protein GI | 126695977 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAATA TCCGCACAAT TAAAGGTGGA GTTAATTTAA AAGGAAAAAT AAAAGTACCA GGAGATAAAT CCATCTCTCA TAGAGCTTTA ATAATTGGGA GTATTGCTGA GGGTGAAACG ACTATTGAGG GGTTTTTATA TTCTGAAGAT CCCCTTTCAA CTGCTGATTG TCTTAGAAAA TTAGGTGTAA ATATACCAGA AATAAAAAAA GATAAGCCTT TTACGATTTC AGGATTGGGT ATTAATGGAT TAAAAGAGCC CAAAGAGATT CTCAATTGCG GGAATTCGGG AACCACCATG AGATTATTAA TGGGGTTACT TGCCGGTCAA GAAGGCAAGA ATTTTATCTT AACTGGTGAT ATTTCTCTTA ATGAAAGGCC AATGGGGAGA GTGGGTAAAC CATTATCATT GATGGGTGGC AAAATTTTTG GTAGAGAAAA AGGGAACAAA GCACCAATCT CAATTAATGG GAATAAACTA AAAGGATGTG TTATGGGAAC TCCAGTAGCG AGTGCTCAAG TAAAATCCGC AATCTTATTG GCAGGCCTCA AAGCTTCTGG AACCACTTCT GTTATTGAAC CAGCCTCTTC AAGAGATCAT ACTGAAAGAA TGTTGAAAGC ATTTGGAGCA GACATCACTA TCAGAGGGGA ATTTGGAAGA AATGTAGTTA TCAAGTCAGG GGGAAGTTTA ATTGGCCAGA AAATATTGAT TCCTGGAGAC ATAAGCTCTG CTTCTTTTTG GATGATTGCT GCATCTATTG TTCCAAATTC AGAGGTTTTA ATTCAGAATG TCGGATTAAA TCCCACTAGA ACAGGGATTT TAAATATTAT GAATTCAATG GGTTGCAATT ATGAGATTTT AGATAAATCG ACTATTGCTG GTGAACCTAT TGGATCTATA AAAGTAAAGA CTTCAAATAA TTTAAAATCA TTCATTATTG AAGGAGATAT TCTCCCAAAA CTCATAGATG AAATTCCTAT CCTTACTGTG GCTGCTTGTT TTTGTAATGG AGTTTCTGAA ATTAAGGATG CACAAGAATT AAGGGTTAAA GAGACAGATA GATTAAAAGT CATGGCACGA CAGTTACAAA AATTCGGTGC TGAAATAACA GAAAAAGAGG ATGGGTTAAT TATTAATGGG CAATCAAAAT TTCATTCTGC GGAGGTAGAT AGTGAGACAG ATCATCGAGT AGCAATGAGT CTTGCTATTG CTTCACTGCT TGCCAAAGGT ACCTCAAAAA TCATGAGAGC AGATGCAGCT AGCGTCTCGT ATCCCACTTT TTGGGAAGAG CTTGCCAAAC TAACTAACTA G
|
Protein sequence | MNNIRTIKGG VNLKGKIKVP GDKSISHRAL IIGSIAEGET TIEGFLYSED PLSTADCLRK LGVNIPEIKK DKPFTISGLG INGLKEPKEI LNCGNSGTTM RLLMGLLAGQ EGKNFILTGD ISLNERPMGR VGKPLSLMGG KIFGREKGNK APISINGNKL KGCVMGTPVA SAQVKSAILL AGLKASGTTS VIEPASSRDH TERMLKAFGA DITIRGEFGR NVVIKSGGSL IGQKILIPGD ISSASFWMIA ASIVPNSEVL IQNVGLNPTR TGILNIMNSM GCNYEILDKS TIAGEPIGSI KVKTSNNLKS FIIEGDILPK LIDEIPILTV AACFCNGVSE IKDAQELRVK ETDRLKVMAR QLQKFGAEIT EKEDGLIING QSKFHSAEVD SETDHRVAMS LAIASLLAKG TSKIMRADAA SVSYPTFWEE LAKLTN
|
| |