Gene EcHS_A3779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3779 
SymboldlgD 
ID5592987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3772036 
End bp3773034 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content53% 
IMG OID640922893 
Product2,3-diketo-L-gulonate reductase 
Protein accessionYP_001460371 
Protein GI157163053 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTGA CATTTGAGCA GTTAAAAGCA GCCTTTAATC GGGTCTTAAT TTCACGCGGC 
GTTGACAGCG AAACGGCTGA CGCCTGTGCA GAGATGTTCG CCCGCACCAC CGAATCCGGC
GTTTATTCTC ACGGCGTTAA TCGTTTCCCT CGTTTCATTC AACAACTGGA AAACGGCGAT
ATCATTCCTG ATGCCCAACC CAAACGTATA ACCAGCCTCG GCGCAATTGA ACAGTGGGAC
GCCCAGCGTT CGATCGGTAA CCTGACAGCG AAAAAGATGA TGGATCGCGC CATTGAACTG
GCTGCCGATC ACGGTATTGG TCTGGTGGCA CTACGTAATG CCAACCACTG GATGCGCGGC
GGCAGCTACG GCTGGCAGGC GGCGGAAAAA GGCTATATTG GCATTTGCTG GACCAACTCC
ATCGCCGTAA TGCCGCCGTG GGGCGCAAAA GAGTGTCGCA TCGGCACTAA CCCGCTGATC
GTCGCCATTC CTTCCACGCC GATCACCATG GTCGATATGT CGATGTCGAT GTTCTCTTAC
GGCATGTTAG AAGTTAACCG TCTGGCAGGT CGTCAGCTCC CGGTCGATGG TGGCTTTGAT
GATGAGGGCA ATTTGACCAA AGAACCTGGC GTTATCGAGA AGAATCGCCG CATTTTGCCG
ATGGGCTACT GGAAAGGTTC TGGCATGTCG ATTGTGCTGG ATATGATCGC TACTCTCCTT
TCCGACGGCG CATCCGTTGC CGAAGTCACC CAGGACAACA GCGACGAATA CGGCATTTCA
CAAATTTTTA TTGCCATTGA AGTGGACAAG CTTATCGACG GTCCCACCCG CGATGCCAAG
CTGCAACGCA TCATGGATTA CGTTACTAGT GCCGAGCGTG CTGACGAAAA TCAGGCCATT
CGCTTACCCG GCCATGAATT TACTACCCTG CTGGCCGAAA ACCGCCGTAA CGGTATTACC
GTTGATGACA GCGTGTGGGC CAAAATCCAG GCGTTATAA
 
Protein sequence
MKVTFEQLKA AFNRVLISRG VDSETADACA EMFARTTESG VYSHGVNRFP RFIQQLENGD 
IIPDAQPKRI TSLGAIEQWD AQRSIGNLTA KKMMDRAIEL AADHGIGLVA LRNANHWMRG
GSYGWQAAEK GYIGICWTNS IAVMPPWGAK ECRIGTNPLI VAIPSTPITM VDMSMSMFSY
GMLEVNRLAG RQLPVDGGFD DEGNLTKEPG VIEKNRRILP MGYWKGSGMS IVLDMIATLL
SDGASVAEVT QDNSDEYGIS QIFIAIEVDK LIDGPTRDAK LQRIMDYVTS AERADENQAI
RLPGHEFTTL LAENRRNGIT VDDSVWAKIQ AL