Gene EcHS_A3071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3071 
SymbolserA 
ID5594362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3084689 
End bp3085921 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content53% 
IMG OID640922190 
ProductD-3-phosphoglycerate dehydrogenase 
Protein accessionYP_001459690 
Protein GI157162372 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID[TIGR01327] D-3-phosphoglycerate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00000000531719 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGG TATCGCTGGA GAAAGACAAG ATTAAGTTTC TGCTGGTAGA AGGCGTGCAC 
CAAAAGGCGC TGGAAAGCCT TCGTGCAGCT GGTTACACCA ACATCGAATT TCACAAAGGC
GCGCTGGATG ATGAACAATT AAAAGAATCC ATCCGCGATG CCCACTTCAT CGGCCTGCGA
TCCCGTACCC ATCTGACTGA AGACGTGATC AACGCCGCAG AAAAACTGGT CGCTATTGGC
TGTTTCTGTA TCGGAACAAA CCAGGTTGAT CTGGATGCGG CGGCAAAGCG CGGGATCCCG
GTATTTAACG CACCGTTCTC AAATACGCGC TCTGTTGCGG AGCTGGTGAT TGGCGAACTG
CTGCTGCTAT TGCGCGGCGT GCCGGAAGCC AATGCTAAAG CGCACCGTGG CGTGTGGAAC
AAACTGGCGG CGGGTTCTTT TGAAGCGCGC GGCAAAAAGC TGGGTATCAT CGGCTACGGT
CATATTGGTA CGCAATTGGG CATTCTGGCT GAATCGCTGG GAATGTATGT TTACTTTTAT
GATATTGAAA ATAAACTGCC GCTGGGCAAC GCCACTCAGG TACAGCATCT TTCTGACCTG
CTGAATATGA GCGATGTGGT GAGTCTGCAT GTACCAGAGA ATCCGTCCAC CAAAAATATG
ATGGGCGCGA AAGAAATTTC ACTAATGAAG CCCGGCTCGC TGCTGATTAA TGCTTCGCGC
GGTACTGTGG TGGATATTCC GGCGCTGTGT GATGCGCTGG CGAGCAAACA TCTGGCGGGG
GCGGCAATCG ACGTATTCCC GACGGAACCG GCGACCAATA GCGATCCATT TACCTCTCCG
CTGTGTGAAT TCGACAACGT CCTTCTGACG CCACACATTG GCGGTTCGAC TCAGGAAGCG
CAGGAGAATA TCGGCCTGGA AGTTGCGGGT AAATTGATCA AGTATTCTGA CAATGGCTCA
ACGCTCTCTG CGGTGAACTT CCCGGAAGTC TCGCTGCCAC TGCACGGTGG GCGTCGTCTG
ATGCACATCC ACGAAAACCG TCCGGGCGTG CTAACTGCGC TCAACAAAAT TTTTGCCGAG
CAGGGCGTCA ACATCGCCGC GCAATATCTG CAAACTTCCG CCCAGATGGG TTATGTAGTT
ATTGATATTG AAGCCGACGA AGACGTTGCC GAAAAAGCGC TGCAGGCAAT GAAAGCTATT
CCGGGTACCA TTCGCGCCCG TCTGCTGTAC TAA
 
Protein sequence
MAKVSLEKDK IKFLLVEGVH QKALESLRAA GYTNIEFHKG ALDDEQLKES IRDAHFIGLR 
SRTHLTEDVI NAAEKLVAIG CFCIGTNQVD LDAAAKRGIP VFNAPFSNTR SVAELVIGEL
LLLLRGVPEA NAKAHRGVWN KLAAGSFEAR GKKLGIIGYG HIGTQLGILA ESLGMYVYFY
DIENKLPLGN ATQVQHLSDL LNMSDVVSLH VPENPSTKNM MGAKEISLMK PGSLLINASR
GTVVDIPALC DALASKHLAG AAIDVFPTEP ATNSDPFTSP LCEFDNVLLT PHIGGSTQEA
QENIGLEVAG KLIKYSDNGS TLSAVNFPEV SLPLHGGRRL MHIHENRPGV LTALNKIFAE
QGVNIAAQYL QTSAQMGYVV IDIEADEDVA EKALQAMKAI PGTIRARLLY