Gene EcSMS35_3898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3898 
SymboldlgD 
ID6144299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3968804 
End bp3969802 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content54% 
IMG OID641618724 
Product2,3-diketo-L-gulonate reductase 
Protein accessionYP_001745863 
Protein GI170680670 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTGA CATTTGAGCA GTTAAAAGCA GCCTTTAATC GGGTCTTAAT TTCACGCGGC 
GTTGACAGCG AAACGGCTGA CGCCTGTGCA GAGATGTTCG CCCGCACCAC CGAATCCGGC
GTTTATTCTC ACGGCGTTAA TCGTTTCCCT CGTTTCATTC AACAACTGGA AAACGGCGAT
ATCATTCCTG ATGCCCAACC CAAACGTATA ACCAGCCTCG GCGCAATTGA ACAGTGGGAC
GCCCAGCGTT CGATCGGTAA CCTGACAGCG AAAAAGATGA TGGATCGCGC CATTGAACTG
GCTGCCGATC ACGGTATTGG TCTGGTGGCA CTACGTAATG CCAACCACTG GATGCGCGGC
GGCAGCTACG GCTGGCAGGC GGCGGAAAAA GGCTATATTG GCATTTGCTG GACCAACTCC
ATCGCCGTAA TGCCGCCGTG GGGCGCAAAA GAGTGTCGCA TCGGCACCAA CCCGCTGATC
GTCGCCATTC CTTCTACCCC AATCACCATG GTCGATATGT CGATGTCGAT GTTCTCTTAC
GGCATGTTAG AAGTTAACCG TCTGGCAGGC CGTCAGCTCC CGGTCGATGG TGGCTTTGAT
GATGAGGGCA ATTTGACCAA AGAACCTGGC GTTATCGAGA AGAATCGCCG CATTTTGCCG
ATGGGCTACT GGAAAGGTTC TGGCATGTCG ATTGTGCTGG ATATGATCGC CACTCTCCTT
TCCGACGGCG CATCGGTTGC CGAAGTCACC CAGGACAACA GCGACGAATA CGGCGTTTCG
CAAATCTTTA TCGCCATCGA AGTGGATAAA TTGATCGACG GCCCCACCCG CGATGCCAAG
CTGCAACGCA TCATGGATTA CGTTACTACC GCTGAACGCG CTGACGAAAA CCAGGCCATC
CGCTTACCCG GCCACGAATT TACTACCCTG CTGGCCGAAA ACCGCCGTAA CGGCATCACC
GTTGATGACA GCGTGTGGGC AAAAATCCAG GCGTTATAA
 
Protein sequence
MKVTFEQLKA AFNRVLISRG VDSETADACA EMFARTTESG VYSHGVNRFP RFIQQLENGD 
IIPDAQPKRI TSLGAIEQWD AQRSIGNLTA KKMMDRAIEL AADHGIGLVA LRNANHWMRG
GSYGWQAAEK GYIGICWTNS IAVMPPWGAK ECRIGTNPLI VAIPSTPITM VDMSMSMFSY
GMLEVNRLAG RQLPVDGGFD DEGNLTKEPG VIEKNRRILP MGYWKGSGMS IVLDMIATLL
SDGASVAEVT QDNSDEYGVS QIFIAIEVDK LIDGPTRDAK LQRIMDYVTT AERADENQAI
RLPGHEFTTL LAENRRNGIT VDDSVWAKIQ AL