Gene EcSMS35_0911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0911 
Symboldld 
ID6144145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp916780 
End bp918495 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content52% 
IMG OID641615799 
ProductD-lactate dehydrogenase 
Protein accessionYP_001742991 
Protein GI170681665 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.42749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.35472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCCA TGACAACAAC TGATAATAAA GCCTTTTTGA ATGAACTTGC CCGTCTGGTC 
GGTCATTCAC ACCTGCTCAC CGATCCCGCA AAAACGGCCC GCTATCGCAA GGGCTTCCGT
TCTGGTCAGG GCGACGCGCT GGCTGTCGTT TTCCCTGGCT CACTACTGGA ATTGTGGCGG
GTGCTGAAAG CCTGCGTCAC CGCCGACAAA ATTATTCTGA TGCAGGCCGC CAATACAGGT
CTGACCGAAG GCTCGACGCC AAACGGTAAC GACTATGATC GCGATATCGT GATCATCAGC
ACCCTGCGTC TCGATAAGTT GCACGTTCTC GGCAAGGGCG AACAAGTGCT GGCCTATCCG
GGCACCACGC TTTATTCGCT GGAAAAAGCA CTCAAACCGC TGGGACGCGA ACCGCACTCG
GTGATTGGAT CATCGTGTAT AGGCGCATCG GTCATCGGCG GTATTTGTAA CAACTCCGGC
GGCTCGCTGG TGCAGCGTGG CCCGGCGTAT ACCGAAATGT CGTTATTCGC GCGTATAAAT
GAAGACGGCA AACTGACGCT GGTGAACCAT CTCGGGATTG ATCTGGGCGA AACGCCGGAG
CAGATCCTTA GCAAGCTGGA TGACGATCGC ATCAAAGATG ACGATGTGCG TCACGATGGT
CGTCACGCCC ACGATTATGA CTATGTCCAC CGCGTTCGTG ATATTGAAGC CAACACGCCC
GCACGTTATA ACGCCGATCC GGATCGGTTA TTTGAATCTT CTGGTTGCGC AGGTAAGCTG
GCCGTCTTTG CGGTACGTCT TGATACCTTC GAAGCGGAAA AAAATCAGCA GGTGTTTTAT
ATCGGCACCA ACCAGCCGGA AGTGCTGACC GAAATCCGCC GTCATATTCT GGCTAATTTC
GAAAATCTGC CGGTTGCCGG GGAATATATG CACCGGGATA TCTACGATAT TGCGGAAAAA
TACGGCAAAG ACACCTTCCT GATGATTGAT AAGTTAGGCA CCGACAAGAT GCCATTCTTC
TTTAATCTCA AGGGGCGCAC CGATGCGATG CTGGAGAAAG TGAAATTCTT CCGTCCCCAT
TTTACCGACC GTGCGATGCA AAAATTCGGT CACCTGTTCC CCAGCCATTT ACCGCCGCGC
ATGAAAAACT GGCGCGATAA ATACGAACAT CATCTGCTGT TAAAAATGGC GGGCGATGGC
GTCGGTGAAG CCAAATCGTG GCTGGTGGAT TATTTCAAAC AGGCCGAAGG CGATTTCTTT
GTCTGTACGC CGGAGGAAGG TAGCAAAGCG TTTTTACACC GTTTCGCCGC TGCGGGCGCA
GCAATTCGTT ATCAGGCGGT GCATTCCGAT GAAGTCGAAG ACATTCTGGC ACTGGATATC
GCTCTGCGGC GTAACGACAC CGAGTGGTAT GAGCATTTAC CGCCGGAGAT CGACAACCAA
CTGGTGCACA AGCTCTATTA CGGCCATTTT ATGTGCTATG TCTTCCATCA GGATTACATC
GTTAAAAAAG GCGTGGATGT GCATGCGTTG AAAGAACAGA TGCTGGAACT GCTGCAGCAG
CGTGGCGCGC AATACCCTGC CGAGCATAAC GTCGGTCATT TGTATAAAGC ACCGGAGACG
TTGCAGAAGT TCTATCGCGA GAACGATCCG ACCAACAGTA TGAATCCGGG GATCGGTAAA
ACCAGTAAGC GGAAAAACTG GCAGGAAGTG GAGTAA
 
Protein sequence
MSSMTTTDNK AFLNELARLV GHSHLLTDPA KTARYRKGFR SGQGDALAVV FPGSLLELWR 
VLKACVTADK IILMQAANTG LTEGSTPNGN DYDRDIVIIS TLRLDKLHVL GKGEQVLAYP
GTTLYSLEKA LKPLGREPHS VIGSSCIGAS VIGGICNNSG GSLVQRGPAY TEMSLFARIN
EDGKLTLVNH LGIDLGETPE QILSKLDDDR IKDDDVRHDG RHAHDYDYVH RVRDIEANTP
ARYNADPDRL FESSGCAGKL AVFAVRLDTF EAEKNQQVFY IGTNQPEVLT EIRRHILANF
ENLPVAGEYM HRDIYDIAEK YGKDTFLMID KLGTDKMPFF FNLKGRTDAM LEKVKFFRPH
FTDRAMQKFG HLFPSHLPPR MKNWRDKYEH HLLLKMAGDG VGEAKSWLVD YFKQAEGDFF
VCTPEEGSKA FLHRFAAAGA AIRYQAVHSD EVEDILALDI ALRRNDTEWY EHLPPEIDNQ
LVHKLYYGHF MCYVFHQDYI VKKGVDVHAL KEQMLELLQQ RGAQYPAEHN VGHLYKAPET
LQKFYRENDP TNSMNPGIGK TSKRKNWQEV E