Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_18891 |
Symbol | aroA |
ID | 4778688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1650365 |
End bp | 1651690 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640087398 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001017896 |
Protein GI | 124023589 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.326891 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCGTTA CAAGCACTGC CTCATCGTCA CGTGAGCTTC GAGCAGGTGG TGGGCTTTCT GGAACGGTGC GAGTCCCTGG AGATAAATCG ATTTCCCATC GAGCGCTGTT ATTTGGAGCT ATCGCAGAAG GGATCACCAC CATCGAGGGC CTTTTACCTG CTGAAGATCC CATGAGTACC GCCGCCTGTC TTCGAGCGAT GGGAGCCACG ATCAGCCCGA TTCATGCCGG GCAAATAGTT CGCGTTGAAG GCGTTGGTCT TGATGGACTT CAAGAACCGC AAGATGTACT CGACTGCGGT AACTCCGGCA CAACAATTCG GCTGATGCTA GGGCTGTTGG CAGCAAGCAA GGGCCGCCAT TTCGTGCTTA GCGGTGATTC CTCTTTGCGC CGTAGGCCCA TGAATCGGGT TGGCCAGCCA TTAACAATGA TGGGAGCCAA AATTAAAGGA CGATCCAACG GCGACTTCGC ACCACTTGCC GTGAGTGGTC AGAAACTTCG TGGTGGTGTC ATCGGTACAC CTGTAGCAAG TGCCCAGGTG AAGTCAGCCC TGCTGCTGGC AGCCCTTACC GCCGATGGGG CAACGACTGT GATCGAACCA GCCCATTCTA GAGACCACAG CGAACGCATG TTGCGGGCAT TCGGCGCTGA TCTCGAAGTG GGAGGAGAGA TGGGTCGCCA TATCAGAGTG AGCCCAGGAC AAAAGCTTTA TGGCCAAAAT ATTGTCGTGC CAGGTGATAT CAGTTCCGCT GCTTTTTGGC TTGTCGCAGG AGTGCTTGTG CCTGGAGCTG AGCTCGTGGT GGAAAACGTT GGTTTAAACC CCACCCGTAC CGGCATCCTG GAAGTGTTGC AGCAAATGGA AGCTCGCATT GAGGTGCTCA ACCGCCATGA AGTCGCCGGT GAACCGGTTG GAGACCTACG GGTGAGGCAA GGACCGTTAA AGCCGTTCAG CATCAATGGG GACATCATTC CTCGCCTGGT CGACGAGGTG CCGATCTTGG CGGTGGCTGC TTGTTTCTGT GATGGTGAAA GCAAAATTAC TGACGCCAGT GAACTGCGCG TCAAGGAAAC CGATCGGCTC GCTGTGATGA CCAGGCAGCT CACGGCTATG GGAGCAGATA TCGACGAACA TGCCGATGGG CTTACCATCC GCGGTGGCCG AACACTGCGA GGCACAGAAC TCGATAGTGA AACGGATCAT CGTGTCGCCA TGAGCCTGGC GGTGGCTGCG TTGTTGGCGG AGGGCAATTC TCGCTTGACG GGTAGCGAAG CTGCGGCTGT TTCCTACCCC AACTTCTGGG ACGACCTTGA GCGGCTGCAT CGCTGA
|
Protein sequence | MSVTSTASSS RELRAGGGLS GTVRVPGDKS ISHRALLFGA IAEGITTIEG LLPAEDPMST AACLRAMGAT ISPIHAGQIV RVEGVGLDGL QEPQDVLDCG NSGTTIRLML GLLAASKGRH FVLSGDSSLR RRPMNRVGQP LTMMGAKIKG RSNGDFAPLA VSGQKLRGGV IGTPVASAQV KSALLLAALT ADGATTVIEP AHSRDHSERM LRAFGADLEV GGEMGRHIRV SPGQKLYGQN IVVPGDISSA AFWLVAGVLV PGAELVVENV GLNPTRTGIL EVLQQMEARI EVLNRHEVAG EPVGDLRVRQ GPLKPFSING DIIPRLVDEV PILAVAACFC DGESKITDAS ELRVKETDRL AVMTRQLTAM GADIDEHADG LTIRGGRTLR GTELDSETDH RVAMSLAVAA LLAEGNSRLT GSEAAAVSYP NFWDDLERLH R
|
| |