Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_06691 |
Symbol | aroA |
ID | 4717371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 593638 |
End bp | 594948 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640078382 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001009062 |
Protein GI | 123968204 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.104277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAATA TCCGCACAAT AAAAGGTGGA GTTAATTTAA AAGGAAAAGT AAAAGTACCT GGAGATAAAT CTATTTCTCA TAGAGCTTTA ATAATAGGAA GTATTGCTAA TGGTGAGACG ACTATTGAGG GGTTTTTACA TTCTGAAGAT CCACTTTCAA CTGCTGATTG TCTTAGGAAA TTAGGTGTAA ACATACCAGA AATAAAGAAA AATGAGCCTT TTACGATTTC AGGTTTGGGT CTTGATGGAT TAAAAGAGCC CAAAGAAATT CTAAATTGTG GGAATTCGGG AACCACCATG AGATTATTAA TGGGGTTACT TGCCGGTCAA GAAGACAAGA ATTTTATCTT AACAGGTGAC ATTTCTCTTA ATGAAAGGCC AATGGGGAGA GTGGGCAAAC CATTATCGTT GATGGGTGGC AAGATTTTTG GTAGAGAGAA AGGAAACAAA GCTCCAATCT CAATTGATGG GAATAAACTA AAAGGTTGTG TTATTGGTAC ACCAGTAGCG AGTGCTCAAG TAAAATCTGC AATCTTATTG GCAGGACTCA AAGCTTCTGG GACCACCTCT GTTATTGAAC CAGCATCTTC AAGAGATCAT ACTGAAAGGA TGTTAAAAGC TTTTGGAGCA GATATCAGCG TCAGAGGAGA ATTAGGAAGG AATGTAGTCA TCAAATCAGG GGGGAAGTTA ATTGGCCAGA GAATATTGAT TCCCGGAGAC ATAAGCTCTG CTTCTTTTTG GATGATTGCC GCATCTATTG TTCCAAATTC GGAGGTTTTA ATTCAGAATG TCGGATTAAA TCCTACTAGA ACTGGAATTT TAAATGTAAT GGATTCAATG GGGTGCAATT ATGAGATTTT AGATAAATCG ACCATTGCAG GTGAACCTAT TGGATCTATT AAAGTAAAGT CTTCAAATAA TTTAAAATCA TTCACTATTG AAGGTGATAT CCTCCCAAAA CTTATAGACG AAATTCCTAT TCTTACTGTG GCTGCTTGTT TTTGTAATGG AGTTTCTGAA ATTAAGGATG CCCAAGAATT AAGAGTTAAG GAGACAGATC GATTAAAAGT CATGGCACGA CAGTTACAAA AATTCGGTGC TGAAGTAACA GAGAAAGAGG ATGGGTTAAT TATTAATGGG CAATCAAAAT TTAATTCTGC AGAAGTAGAC AGTGAGACAG ATCATCGAGT AGCAATGAGT CTTGCTATTG CTTCACTTCT TGCTAAAGGT ACCTCAAAAA TCATGAGAGC AGATGCTGCT AGCGTCTCGT ATCCCACTTT TTGGGAAGAC CTTGCCACAC TAACTAACTA G
|
Protein sequence | MNNIRTIKGG VNLKGKVKVP GDKSISHRAL IIGSIANGET TIEGFLHSED PLSTADCLRK LGVNIPEIKK NEPFTISGLG LDGLKEPKEI LNCGNSGTTM RLLMGLLAGQ EDKNFILTGD ISLNERPMGR VGKPLSLMGG KIFGREKGNK APISIDGNKL KGCVIGTPVA SAQVKSAILL AGLKASGTTS VIEPASSRDH TERMLKAFGA DISVRGELGR NVVIKSGGKL IGQRILIPGD ISSASFWMIA ASIVPNSEVL IQNVGLNPTR TGILNVMDSM GCNYEILDKS TIAGEPIGSI KVKSSNNLKS FTIEGDILPK LIDEIPILTV AACFCNGVSE IKDAQELRVK ETDRLKVMAR QLQKFGAEVT EKEDGLIING QSKFNSAEVD SETDHRVAMS LAIASLLAKG TSKIMRADAA SVSYPTFWED LATLTN
|
| |