Gene A9601_06691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_06691 
SymbolaroA 
ID4717371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp593638 
End bp594948 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content37% 
IMG OID640078382 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001009062 
Protein GI123968204 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104277 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATA TCCGCACAAT AAAAGGTGGA GTTAATTTAA AAGGAAAAGT AAAAGTACCT 
GGAGATAAAT CTATTTCTCA TAGAGCTTTA ATAATAGGAA GTATTGCTAA TGGTGAGACG
ACTATTGAGG GGTTTTTACA TTCTGAAGAT CCACTTTCAA CTGCTGATTG TCTTAGGAAA
TTAGGTGTAA ACATACCAGA AATAAAGAAA AATGAGCCTT TTACGATTTC AGGTTTGGGT
CTTGATGGAT TAAAAGAGCC CAAAGAAATT CTAAATTGTG GGAATTCGGG AACCACCATG
AGATTATTAA TGGGGTTACT TGCCGGTCAA GAAGACAAGA ATTTTATCTT AACAGGTGAC
ATTTCTCTTA ATGAAAGGCC AATGGGGAGA GTGGGCAAAC CATTATCGTT GATGGGTGGC
AAGATTTTTG GTAGAGAGAA AGGAAACAAA GCTCCAATCT CAATTGATGG GAATAAACTA
AAAGGTTGTG TTATTGGTAC ACCAGTAGCG AGTGCTCAAG TAAAATCTGC AATCTTATTG
GCAGGACTCA AAGCTTCTGG GACCACCTCT GTTATTGAAC CAGCATCTTC AAGAGATCAT
ACTGAAAGGA TGTTAAAAGC TTTTGGAGCA GATATCAGCG TCAGAGGAGA ATTAGGAAGG
AATGTAGTCA TCAAATCAGG GGGGAAGTTA ATTGGCCAGA GAATATTGAT TCCCGGAGAC
ATAAGCTCTG CTTCTTTTTG GATGATTGCC GCATCTATTG TTCCAAATTC GGAGGTTTTA
ATTCAGAATG TCGGATTAAA TCCTACTAGA ACTGGAATTT TAAATGTAAT GGATTCAATG
GGGTGCAATT ATGAGATTTT AGATAAATCG ACCATTGCAG GTGAACCTAT TGGATCTATT
AAAGTAAAGT CTTCAAATAA TTTAAAATCA TTCACTATTG AAGGTGATAT CCTCCCAAAA
CTTATAGACG AAATTCCTAT TCTTACTGTG GCTGCTTGTT TTTGTAATGG AGTTTCTGAA
ATTAAGGATG CCCAAGAATT AAGAGTTAAG GAGACAGATC GATTAAAAGT CATGGCACGA
CAGTTACAAA AATTCGGTGC TGAAGTAACA GAGAAAGAGG ATGGGTTAAT TATTAATGGG
CAATCAAAAT TTAATTCTGC AGAAGTAGAC AGTGAGACAG ATCATCGAGT AGCAATGAGT
CTTGCTATTG CTTCACTTCT TGCTAAAGGT ACCTCAAAAA TCATGAGAGC AGATGCTGCT
AGCGTCTCGT ATCCCACTTT TTGGGAAGAC CTTGCCACAC TAACTAACTA G
 
Protein sequence
MNNIRTIKGG VNLKGKVKVP GDKSISHRAL IIGSIANGET TIEGFLHSED PLSTADCLRK 
LGVNIPEIKK NEPFTISGLG LDGLKEPKEI LNCGNSGTTM RLLMGLLAGQ EDKNFILTGD
ISLNERPMGR VGKPLSLMGG KIFGREKGNK APISIDGNKL KGCVIGTPVA SAQVKSAILL
AGLKASGTTS VIEPASSRDH TERMLKAFGA DISVRGELGR NVVIKSGGKL IGQRILIPGD
ISSASFWMIA ASIVPNSEVL IQNVGLNPTR TGILNVMDSM GCNYEILDKS TIAGEPIGSI
KVKSSNNLKS FTIEGDILPK LIDEIPILTV AACFCNGVSE IKDAQELRVK ETDRLKVMAR
QLQKFGAEVT EKEDGLIING QSKFNSAEVD SETDHRVAMS LAIASLLAKG TSKIMRADAA
SVSYPTFWED LATLTN