Gene EcolC_0797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0797 
Symbol 
ID6066820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp854334 
End bp855566 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content53% 
IMG OID641600201 
ProductD-3-phosphoglycerate dehydrogenase 
Protein accessionYP_001723796 
Protein GI170018842 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID[TIGR01327] D-3-phosphoglycerate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0331169 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGG TATCGCTGGA GAAAGACAAG ATTAAGTTTC TGCTGGTAGA AGGCGTGCAC 
CAAAAGGCGC TGGAAAGCCT TCGTGCAGCT GGTTACACCA ACATCGAATT TCACAAAGGC
GCGCTGGATG ATGAACAATT AAAAGAATCC ATCCGCGATG CCCACTTCAT CGGCCTGCGA
TCCCGTACCC ATCTGACTGA AGACGTGATC AACGCCGCAG AAAAACTGGT CGCTATTGGC
TGTTTCTGTA TCGGAACAAA CCAGGTTGAT CTGGATGCGG CGGCAAAGCG CGGGATCCCG
GTATTTAACG CACCGTTCTC AAATACGCGC TCTGTTGCGG AGCTGGTGAT TGGCGAACTG
CTGCTGCTAT TGCGCGGCGT GCCGGAAGCC AATGCTAAAG CGCACCGTGG CGTGTGGAAC
AAACTGGCGG CGGGTTCTTT TGAAGCGCGC GGCAAAAAGC TGGGTATCAT CGGCTACGGT
CATATTGGTA CGCAATTGGG CATTCTGGCT GAATCGCTGG GAATGTATGT TTACTTTTAT
GATATTGAAA ACAAACTGCC GCTGGGCAAC GCCACTCAGG TACAGCATCT TTCTGACCTG
CTGAATATGA GCGATGTGGT GAGTCTGCAT GTACCAGAGA ATCCGTCCAC CAAAAATATG
ATGGGCGCGA AAGAAATTTC ACTAATGAAG CCCGGCTCGC TGCTGATTAA TGCTTCGCGC
GGTACTGTGG TGGATATTCC GGCGCTGTGT GATGCGCTGG CGAGCAAACA TCTGGCGGGG
GCGGCAATCG ACGTATTCCC GACGGAACCG GCGACCAATA GCGATCCATT TACCTCTCCG
CTGTGTGAAT TCGACAACGT CCTTCTGACG CCACACATTG GCGGTTCGAC TCAGGAAGCG
CAGGAGAATA TTGGCCTGGA AGTTGCGGGT AAATTGATCA AGTATTCTGA CAATGGCTCA
ACGCTCTCTG CGGTGAACTT CCCGGAAGTC TCGCTGCCAC TGCACGGTGG GCGTCGTCTG
ATGCACATCC ACGAAAACCG TCCGGGCGTG CTAACTGCGC TGAACAAAAT CTTCGCCGAG
CAGGGCGTCA ACATCGCCGC GCAATATCTA CAAACTTCCG CCCAGATGGG TTATGTGGTT
ATTGATATTG AAGCCGACGA AGACGTTGCC GAAAAAGCGC TGCAGGCAAT GAAAGCTATT
CCGGGTACCA TTCGCGCCCG TCTGCTGTAC TAA
 
Protein sequence
MAKVSLEKDK IKFLLVEGVH QKALESLRAA GYTNIEFHKG ALDDEQLKES IRDAHFIGLR 
SRTHLTEDVI NAAEKLVAIG CFCIGTNQVD LDAAAKRGIP VFNAPFSNTR SVAELVIGEL
LLLLRGVPEA NAKAHRGVWN KLAAGSFEAR GKKLGIIGYG HIGTQLGILA ESLGMYVYFY
DIENKLPLGN ATQVQHLSDL LNMSDVVSLH VPENPSTKNM MGAKEISLMK PGSLLINASR
GTVVDIPALC DALASKHLAG AAIDVFPTEP ATNSDPFTSP LCEFDNVLLT PHIGGSTQEA
QENIGLEVAG KLIKYSDNGS TLSAVNFPEV SLPLHGGRRL MHIHENRPGV LTALNKIFAE
QGVNIAAQYL QTSAQMGYVV IDIEADEDVA EKALQAMKAI PGTIRARLLY