Gene ECH74115_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2002 
SymbolidhA 
ID6966898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1893946 
End bp1894935 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content50% 
IMG OID643385921 
ProductD-lactate dehydrogenase 
Protein accessionYP_002270410 
Protein GI209397196 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1052] Lactate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCG CCGTTTATAG CACAAAACAG TACGACAAGA AGTACCTGCA ACAGGTGAAC 
GAGTCCTTTG GCTTTGAGCT GGAATTTTTT GACTTTCTGC TGACGGAAAA AACCGCTAAA
ACTGCCAATG GCTGCGAAGC GGTATGTATT TTCGTAAACG ATGACGGCAG CCGCCCGGTG
CTGGAAGAGC TGAAAAAGCA CGGCGTTAAA TATATCGCCC TGCGCTGTGC CGGTTTCAAT
AACGTCGACC TTGACGCGGC AAAAGAACTG GGGCTGAAAG TAGTCCGTGT TCCAGCCTAT
GATCCAGAGG CCGTTGCTGA ACACGCCATC GGTATGATGA TGACGCTGAA CCGCCGTATT
CACCGCGCGT ATCAGCGTAC CCGTGACGCT AACTTCTCTC TGGAAGGTCT GACCGGCTTT
ACTATGTATG GCAAAACGGC AGGCGTTATC GGTACCGGTA AAATCGGTGT GGCGATGCTG
CGCATTCTGA AAGGTTTTGG TATGCGTCTG CTGGCGTTCG ATCCGTATCC AAGTGCAGCG
GCGCTGGAAC TCGGTGTGGA GTATGTCGAT CTGCCAACCC TGTTCTCTGA ATCAGACGTT
ATCTCTCTGC ACTGCCCGCT GACACCGGAA AACTACCATC TGTTGAACGA AGCCGCCTTC
GATCAGATGA AAAATGGCGT GATGATCGTC AATACCAGTC GCGGTGCATT GATTGATTCT
CAGGCAGCAA TTGAAGCGCT GAAAAATCAG AAAATTGGTT CGTTGGGTAT GGACGTGTAT
GAGAACGAAC GCGATCTGTT CTTTGAAGAT AAATCCAACG ACGTGATCCA GGATGACGTA
TTCCGTCGCC TGTCTGCCTG CCACAACGTG CTGTTTACCG GGCACCAGGC ATTCCTGACA
GCAGAAGCTC TGACCAGTAT TTCTCAGACT ACGCTGCAAA ACTTAAGCAA TCTGGAAAAA
GGTGAAACCT GCCCGAACGA ACTGGTTTAA
 
Protein sequence
MKLAVYSTKQ YDKKYLQQVN ESFGFELEFF DFLLTEKTAK TANGCEAVCI FVNDDGSRPV 
LEELKKHGVK YIALRCAGFN NVDLDAAKEL GLKVVRVPAY DPEAVAEHAI GMMMTLNRRI
HRAYQRTRDA NFSLEGLTGF TMYGKTAGVI GTGKIGVAML RILKGFGMRL LAFDPYPSAA
ALELGVEYVD LPTLFSESDV ISLHCPLTPE NYHLLNEAAF DQMKNGVMIV NTSRGALIDS
QAAIEALKNQ KIGSLGMDVY ENERDLFFED KSNDVIQDDV FRRLSACHNV LFTGHQAFLT
AEALTSISQT TLQNLSNLEK GETCPNELV