Gene EcSMS35_3942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3942 
SymbollldD 
ID6147041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4021194 
End bp4022384 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content56% 
IMG OID641618768 
ProductL-lactate dehydrogenase 
Protein accessionYP_001745907 
Protein GI170679931 
COG category[C] Energy production and conversion 
COG ID[COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATTT CCGCAGCCAG CGATTATCGC GCCGCAGCAC AACGCATTCT GCCGCCGTTC 
CTGTTCCACT ATATGGATGG GGGTGCATAT TCTGAATACA CGCTGCGCCG CAACGTGGAA
GATTTGTCAG AAGTGGCGCT GCGCCAGCGT ATTCTGAAAA ACATGTCTGA CTTAAGCCTG
GAAACGACGC TGTTTAATGA GAAATTGTCG ATGCCGGTGG CGCTGGCTCC GGTGGGTTTG
TGTGGCATGT ATGCGCGACG CGGCGAAGTT CAGGCTGCCA AAGCAGCGGA TGCGCATGGC
ATTCCGTTTA CTCTCTCGAC GGTTTCCGTT TGCCCGATTG AAGAAGTGGC TCCGGCTATC
AAACGTCCGA TGTGGTTCCA GCTTTATGTG CTGCGCGATC GCGGCTTTAT GCGTAACGCA
CTGGAGCGAG CAAAAGCAGC GGGTTGTTCG ACACTGGTTT TCACCGTGGA TATGCCGACA
CCGGGCGCAC GTTACCGTGA TGCGCATTCT GGGATGAGCG GCCCAAATGC GGCAATGCGC
CGCTACTTGC AGGCGGTGAC GCATCCGCAA TGGGCGTGGG ATGTGGGCCT GAACGGTCGT
CCGCATGATT TAGGTAATAT CTCGGCTTAC CTCGGCAAAC CGACCGGACT GGAAGATTAC
ATCGGCTGGC TGGGGAATAA CTTCGATCCG TCCATCTCAT GGAAAGACCT TGAGTGGATC
CGCGATTTCT GGGATGGCCC GATGGTGATC AAAGGGATCC TCGATCCGGA AGATGCGCGC
GATGCAGTAC GTTTTGGTGC TGATGGGATT GTGGTTTCTA ACCACGGTGG CCGCCAGCTG
GACGGTGTAC TCTCTTCCGC TCGTGCATTG CCCGCTATTG CGGATGCGGT GAAAGGTGAT
ATTGCCATTC TGGCGGATAG CGGAATACGT AACGGGCTTG ATGTCGTGCG TATGATTGCG
CTCGGTGCCG ACACCGTACT GCTGGGTCGT GCTTTCCTGT ATGCGCTGGC AACAGCGGGC
CAGGCGGGTG TAGCTAACCT GCTAAATCTG ATCGAAAAAG AGATGAAAGT GGCGATGACG
CTGACTGGCG CGAAATCGAT TAGCGAAATT ACGCAAGATT CGCTGGTGCA AGAGCTGAGT
AAAGCGCCGG CGGCGGCGCT GGCTCCAATG GCGAAAGGGA ATGCAGCTTA A
 
Protein sequence
MIISAASDYR AAAQRILPPF LFHYMDGGAY SEYTLRRNVE DLSEVALRQR ILKNMSDLSL 
ETTLFNEKLS MPVALAPVGL CGMYARRGEV QAAKAADAHG IPFTLSTVSV CPIEEVAPAI
KRPMWFQLYV LRDRGFMRNA LERAKAAGCS TLVFTVDMPT PGARYRDAHS GMSGPNAAMR
RYLQAVTHPQ WAWDVGLNGR PHDLGNISAY LGKPTGLEDY IGWLGNNFDP SISWKDLEWI
RDFWDGPMVI KGILDPEDAR DAVRFGADGI VVSNHGGRQL DGVLSSARAL PAIADAVKGD
IAILADSGIR NGLDVVRMIA LGADTVLLGR AFLYALATAG QAGVANLLNL IEKEMKVAMT
LTGAKSISEI TQDSLVQELS KAPAAALAPM AKGNAA