Gene ECH74115_4978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4978 
SymbollldD 
ID6966815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4629775 
End bp4630965 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content55% 
IMG OID643388660 
ProductL-lactate dehydrogenase 
Protein accessionYP_002273087 
Protein GI209396531 
COG category[C] Energy production and conversion 
COG ID[COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATTT CCGCAGCCAG CGATTATCGC GCCGCAGCGC AACGCATTCT GCCGCCGTTC 
CTGTTCCACT ATATGGATGG TGGTGCATAT TCTGAATACA CGCTGCGCCG CAACGTGGAA
GATTTGTCAG AAGTGGCACT GCGCCAGCGT ATTCTGAAAA ACATGTCTGA CTTAAGCCTG
GAAACGACGC TGTTTAATGA GAAATTGTCG ATGCCGGTAG CACTGGCACC GGTGGGTTTG
TGTGGCATGT ATGCGCGTCG TGGCGAAGTT CAGGCAGCCA AAGCGGCGGA CGCGCATGGC
ATTCCGTTTA CTCTCTCAAC GGTTTCCGTT TGCCCGATTG AAGAAGTCGC GCCAGCCATC
AAGCGCCCAA TGTGGTTCCA GCTTTATGTA CTGCGCGATC GCGGCTTTAT GCGTAACGCG
CTGGAGCGAG CAAAAGCAGC GGGTTGTTCG ACGCTGGTTT TCACCGTGGA TATGCCGACA
CCGGGAGCGC GTTATCGTGA TGCGCATTCT GGGATGAGCG GCCCAAATGC GGCAATGCGC
CGCTACTTGC AAGCGGTGAC GCATCCGCAA TGGGCGTGGG ATGTGGGCCT GAACGGTCGT
CCGCATGATT TAGGTAATAT TTCGGCTTAC CTCGGCAAAC CGACCGGACT GGAAGATTAC
ATCGGCTGGC TGGGGAATAA CTTCGATCCG TCCATCTCAT GGAAAGACCT TGAGTGGATC
CGCGATTTCT GGGATGGCCC GATGGTGATC AAAGGGATCC TCGATCCGGA AGATGCGCGC
GATGCAGTAC GTTTTGGTGC TGATGGGATT GTAGTTTCTA ACCACGGTGG CCGCCAGCTG
GACGGTGTAC TCTCTTCCGC TCGTGCGTTG CCCGCTATTG CAGATGCGGT GAAAGGTGAT
ATCGCCATTC TGGCGGATAG CGGAATTCGT AACGGGCTTG ATGTCGTGCG TATGATTGCG
CTCGGTGCCG ACACCGTACT GCTGGGTCGT GCTTTCCTGT ATGCGCTGGC AACAGCGGGC
CAGGCGGGTG TAGCTAACCT GCTAAATCTG ATCGAGAAAG AGATGAAAGT GGCAATGACG
CTGACTGGCG CGAAATCGAT CAGTGAAATT ACGCAAGATT CGCTGGTGCA GGGGCTGGGT
AAAGAGTTGC CTACGGCACT GGCTCCGATG GCGAAAGGGA ATGCGGCATA G
 
Protein sequence
MIISAASDYR AAAQRILPPF LFHYMDGGAY SEYTLRRNVE DLSEVALRQR ILKNMSDLSL 
ETTLFNEKLS MPVALAPVGL CGMYARRGEV QAAKAADAHG IPFTLSTVSV CPIEEVAPAI
KRPMWFQLYV LRDRGFMRNA LERAKAAGCS TLVFTVDMPT PGARYRDAHS GMSGPNAAMR
RYLQAVTHPQ WAWDVGLNGR PHDLGNISAY LGKPTGLEDY IGWLGNNFDP SISWKDLEWI
RDFWDGPMVI KGILDPEDAR DAVRFGADGI VVSNHGGRQL DGVLSSARAL PAIADAVKGD
IAILADSGIR NGLDVVRMIA LGADTVLLGR AFLYALATAG QAGVANLLNL IEKEMKVAMT
LTGAKSISEI TQDSLVQGLG KELPTALAPM AKGNAA