Gene P9301_06391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_06391 
SymbolaroA 
ID4911126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp568062 
End bp569372 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content37% 
IMG OID640160220 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001090863 
Protein GI126695977 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATA TCCGCACAAT TAAAGGTGGA GTTAATTTAA AAGGAAAAAT AAAAGTACCA 
GGAGATAAAT CCATCTCTCA TAGAGCTTTA ATAATTGGGA GTATTGCTGA GGGTGAAACG
ACTATTGAGG GGTTTTTATA TTCTGAAGAT CCCCTTTCAA CTGCTGATTG TCTTAGAAAA
TTAGGTGTAA ATATACCAGA AATAAAAAAA GATAAGCCTT TTACGATTTC AGGATTGGGT
ATTAATGGAT TAAAAGAGCC CAAAGAGATT CTCAATTGCG GGAATTCGGG AACCACCATG
AGATTATTAA TGGGGTTACT TGCCGGTCAA GAAGGCAAGA ATTTTATCTT AACTGGTGAT
ATTTCTCTTA ATGAAAGGCC AATGGGGAGA GTGGGTAAAC CATTATCATT GATGGGTGGC
AAAATTTTTG GTAGAGAAAA AGGGAACAAA GCACCAATCT CAATTAATGG GAATAAACTA
AAAGGATGTG TTATGGGAAC TCCAGTAGCG AGTGCTCAAG TAAAATCCGC AATCTTATTG
GCAGGCCTCA AAGCTTCTGG AACCACTTCT GTTATTGAAC CAGCCTCTTC AAGAGATCAT
ACTGAAAGAA TGTTGAAAGC ATTTGGAGCA GACATCACTA TCAGAGGGGA ATTTGGAAGA
AATGTAGTTA TCAAGTCAGG GGGAAGTTTA ATTGGCCAGA AAATATTGAT TCCTGGAGAC
ATAAGCTCTG CTTCTTTTTG GATGATTGCT GCATCTATTG TTCCAAATTC AGAGGTTTTA
ATTCAGAATG TCGGATTAAA TCCCACTAGA ACAGGGATTT TAAATATTAT GAATTCAATG
GGTTGCAATT ATGAGATTTT AGATAAATCG ACTATTGCTG GTGAACCTAT TGGATCTATA
AAAGTAAAGA CTTCAAATAA TTTAAAATCA TTCATTATTG AAGGAGATAT TCTCCCAAAA
CTCATAGATG AAATTCCTAT CCTTACTGTG GCTGCTTGTT TTTGTAATGG AGTTTCTGAA
ATTAAGGATG CACAAGAATT AAGGGTTAAA GAGACAGATA GATTAAAAGT CATGGCACGA
CAGTTACAAA AATTCGGTGC TGAAATAACA GAAAAAGAGG ATGGGTTAAT TATTAATGGG
CAATCAAAAT TTCATTCTGC GGAGGTAGAT AGTGAGACAG ATCATCGAGT AGCAATGAGT
CTTGCTATTG CTTCACTGCT TGCCAAAGGT ACCTCAAAAA TCATGAGAGC AGATGCAGCT
AGCGTCTCGT ATCCCACTTT TTGGGAAGAG CTTGCCAAAC TAACTAACTA G
 
Protein sequence
MNNIRTIKGG VNLKGKIKVP GDKSISHRAL IIGSIAEGET TIEGFLYSED PLSTADCLRK 
LGVNIPEIKK DKPFTISGLG INGLKEPKEI LNCGNSGTTM RLLMGLLAGQ EGKNFILTGD
ISLNERPMGR VGKPLSLMGG KIFGREKGNK APISINGNKL KGCVMGTPVA SAQVKSAILL
AGLKASGTTS VIEPASSRDH TERMLKAFGA DITIRGEFGR NVVIKSGGSL IGQKILIPGD
ISSASFWMIA ASIVPNSEVL IQNVGLNPTR TGILNIMNSM GCNYEILDKS TIAGEPIGSI
KVKTSNNLKS FIIEGDILPK LIDEIPILTV AACFCNGVSE IKDAQELRVK ETDRLKVMAR
QLQKFGAEIT EKEDGLIING QSKFHSAEVD SETDHRVAMS LAIASLLAKG TSKIMRADAA
SVSYPTFWEE LAKLTN