Gene EcSMS35_1771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1771 
SymbolidhA 
ID6145272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1786687 
End bp1787676 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content50% 
IMG OID641616647 
ProductD-lactate dehydrogenase 
Protein accessionYP_001743825 
Protein GI170680565 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1052] Lactate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.914114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCG CCGTTTATAG CACAAAACAG TACGACAAGA AGTACCTGCA ACAGGTGAAC 
GAGTCCTTTG GCTTTGAGCT GGAATTTTTT GACTTTCTGC TGACGGAAAA AACCGCTAAA
ACTGCCAATG GCTGCGAAGC GGTATGTATT TTCGTAAACG ATGACGGCAG CCGCCCGGTG
CTGGAAGAGC TGAAAAAGCA CGGCGTTAAA TATATCGCCC TGCGCTGTGC CGGTTTCAAT
AACGTCGACC TTGACGCGGC AAAAGAACTG GGGCTGAAAG TAGTCCGTGT TCCAGCCTAT
GATCCAGAAG CCGTTGCTGA ACATGCCATC GGTATGATGA TGACGCTGAA CCGCCGTATT
CACCGCGCGT ATCAGCGTAC CCGTGACGCC AACTTCTCTC TGGAAGGTCT GACCGGCTTT
ACTATGTATG GCAAAACGGC AGGCGTTATC GGTACTGGTA AAATCGGTGT GGCGATGCTG
CGCATTCTGA AAGGTTTTGG TATGCGTCTG CTGGCGTTCG ATCCGTATCC AAGTGCGGCG
GCGCTGGAAC TCGGTGTGGA GTATGTCGAT CTGCCAACCC TGTTCTCTGA ATCAGACGTT
ATCTCTCTGC ACTGCCCGCT GACACCGGAA AACTATCATC TGTTGAACGA AGCCGCCTTC
GATCAGATGA AAAATGGCGT GATGATCGTC AATACCAGTC GCGGTGCATT GATTGATTCT
CAGGCAGCAA TTGAAGCGCT GAAAAATCAG AAAATTGGTT CGTTGGGTAT GGACGTGTAT
GAGAACGAAC GCGATCTGTT CTTTGAAGAC AAATCCAACG ACGTGATCCA GGATGACGTA
TTCCGTCGTC TGTCTGCCTG CCACAACGTG CTGTTTACCG GGCACCAGGC ATTCCTGACA
GCAGAAGCAC TGACCAGTAT TTCTCAGACT ACGCTGCAAA ACTTAAGCAA TCTGGAAAAA
GGCGAAACCT GCCCGAACGA ACTGGTTTAA
 
Protein sequence
MKLAVYSTKQ YDKKYLQQVN ESFGFELEFF DFLLTEKTAK TANGCEAVCI FVNDDGSRPV 
LEELKKHGVK YIALRCAGFN NVDLDAAKEL GLKVVRVPAY DPEAVAEHAI GMMMTLNRRI
HRAYQRTRDA NFSLEGLTGF TMYGKTAGVI GTGKIGVAML RILKGFGMRL LAFDPYPSAA
ALELGVEYVD LPTLFSESDV ISLHCPLTPE NYHLLNEAAF DQMKNGVMIV NTSRGALIDS
QAAIEALKNQ KIGSLGMDVY ENERDLFFED KSNDVIQDDV FRRLSACHNV LFTGHQAFLT
AEALTSISQT TLQNLSNLEK GETCPNELV