Gene EcSMS35_3046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3046 
SymbolserA 
ID6147285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3134956 
End bp3136188 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content53% 
IMG OID641617915 
ProductD-3-phosphoglycerate dehydrogenase 
Protein accessionYP_001745066 
Protein GI170683573 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID[TIGR01327] D-3-phosphoglycerate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000364512 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGG TATCGCTGGA GAAAGACAAG ATTAAGTTTC TGCTGGTAGA AGGCGTGCAC 
CAAAAGGCGC TTGAAAGCCT TCGTGCAGCG GGTTACACCA ACATCGAATT TCACAAAGGC
GCGCTGGATG ATGAACAATT AAAAGAATCC ATCCGCGATG CCCACTTCAT CGGCCTGCGA
TCCCGTACCC ATCTGACTGA AGACGTGATC AACGCCGCAG AAAAACTGGT CGCTATTGGC
TGTTTCTGTA TCGGAACAAA CCAGGTTGAT CTGGATGCGG CGGCAAAACG CGGGATCCCG
GTATTTAACG CACCGTTCTC AAATACGCGC TCTGTTGCGG AGCTGGTGAT TGGCGAACTG
CTGCTGCTAT TGCGCGGCGT GCCGGAAGCC AATGCTAAAG CGCACCGTGG CGTGTGGAAC
AAACTGGCGG CGGGTTCTTT TGAAGCGCGC GGCAAAAAGC TGGGTATCAT CGGCTACGGT
CATATTGGTA CGCAATTGGG CATTCTGGCT GAATCGCTGG GAATGTATGT TTACTTTTAT
GATATTGAAA ACAAACTGCC GCTGGGCAAC GCCACTCAGG TACAGCATCT TTCTGACCTG
CTGAATATGA GCGATGTGGT GAGTCTGCAT GTACCAGAGA ATCCGTCCAC CAAAAATATG
ATGGGTGCGA AAGAGATTTC ACTAATGAAG CCCGGCTCGC TGCTGATTAA TGCTTCGCGC
GGTACTGTGG TGGATATTCC GGCGCTGTGT GACGCGCTGG CGAGCAAACA TCTGGCGGGG
GCGGCAATCG ACGTATTTCC GACGGAACCG GCAACCAATA GCGATCCATT TACCTCTCCG
CTGTGTGAAT TCGACAACGT CCTTCTGACG CCACACATTG GCGGTTCGAC TCAGGAAGCG
CAGGAGAATA TCGGCCTGGA AGTTGCGGGT AAATTGATCA AGTATTCTGA CAATGGCTCA
ACGCTCTCTG CGGTGAACTT CCCGGAAGTC TCGCTGCCAC TGCACGGTGG GCGTCGTCTG
ATGCACATCC ACGAAAACCG TCCGGGCGTG TTAACTGCGC TGAACAAAAT CTTCGCCGAG
CAGGGCGTCA ACATCGCTGC CCAGTATCTG CAAACCTCTG CGCAAATGGG TTATGTAGTT
ATTGATATTG AAGCCGACGA AGACGTTGCC GAAAAAGCGC TGCAGGCAAT GAAAGCTATT
CCGGGTACCA TTCGCGCCCG TCTGCTGTAC TAA
 
Protein sequence
MAKVSLEKDK IKFLLVEGVH QKALESLRAA GYTNIEFHKG ALDDEQLKES IRDAHFIGLR 
SRTHLTEDVI NAAEKLVAIG CFCIGTNQVD LDAAAKRGIP VFNAPFSNTR SVAELVIGEL
LLLLRGVPEA NAKAHRGVWN KLAAGSFEAR GKKLGIIGYG HIGTQLGILA ESLGMYVYFY
DIENKLPLGN ATQVQHLSDL LNMSDVVSLH VPENPSTKNM MGAKEISLMK PGSLLINASR
GTVVDIPALC DALASKHLAG AAIDVFPTEP ATNSDPFTSP LCEFDNVLLT PHIGGSTQEA
QENIGLEVAG KLIKYSDNGS TLSAVNFPEV SLPLHGGRRL MHIHENRPGV LTALNKIFAE
QGVNIAAQYL QTSAQMGYVV IDIEADEDVA EKALQAMKAI PGTIRARLLY