Gene ECH74115_5849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5849 
Symbol 
ID6969397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5501448 
End bp5502599 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content53% 
IMG OID643389471 
Productputative 2-hydroxyglutaryl-CoA dehydratase, D-component 
Protein accessionYP_002273863 
Protein GI209398248 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1775] Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.981774 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTTG TCACCGATCT ACCCGCCATT TTCGATCAGT TCTCTGAAGC TCGCCAGAAA 
GGCTTTCTCA CCGTCATGGA TCTCAAGGTG CGCGGCATTC CGCTGGTTGG CACTTACTGC
ACCTTTATGC CGCAAGAGAT CCCGATGGCA GCCGGTGCGG TTGTGGTTTC GCTCTGTTCC
ACCTCTGATG AAACCATTGA AGAAGCGGAG AAAGATCTGC CGCGCAACCT CTGCCCGCTG
ATTAAAAGTA GCTACGGCTT CGGCAAAACC GATAAATGCC CCTACTTCTA CTTTTCGGAT
CTGGTGGTCG GTGAAACCAC CTGCGACGGC AAAAAGAAAA TGTATGAATA CATGGCGGAG
TTTAAGCCCG TTCATGTGAT GCAGCTGCCA AACAGCGTTA AAGACGATGC CTCGCGTGCG
TTATGGAAAG CCGAGATGCT GCGCTTACAA AAAGCGGTGG AAGAACGTTT TGGGCACGAA
ATTAGCGAAG ATGCTCTGCG CGATGCCATT GCGCTGAAAA ACCGCGAACG TCGCGCACTG
GCCAATTTTT ATCATCTTGG GCAGTTCAAT CCTCCGGCGC TTAGCGGCAG CGACATTCTG
AAAGTGGTTT ACGGCGCAAC CTTCCGGTTC GATAAAGAGG CGTTGATCAA TGAACTGGAC
GCGATGACCG CCCGCATTCG TCAGCAGTGG GAAGAAGGCC AGCGACTGGA CCCGCGTCCG
CGCATTTTAA TCACCGGCTG CCCGATTGGC GGCGCAGCAG AGAAAGTGGT GCGCGCGATT
GAAGAGAATG GCGGCTGGGT TGTCGGTTAT GAAAACTGCA CCGGGGCGAA AGCGACCGAG
CAATGCGTGG TAGAAACGGG CGATGTCTAC GACGCGCTGG CGGATAAATA TCTGGCGATT
GGCTGCTCCT GTGTTTCGCC GAACGATCAG CGCCTGAAAA TGCTCAGCCA GATGGTGGAA
GAATATCAGG TCGATGGCGT AGTTGATGTG ATTTTGCAGG CGTGCCATAC CTACGCGGTG
GAATCGCTGG CAATTAAACG TCATGTGCGT CAGCAGCACA ACATTCCTTA TATCGCTATT
GAAACAGACT ACTCCACCTC GGATGTTGGG CAGCTCAGTA CCCGTGTCGC GGCCTTTATT
GAGATGCTGT AA
 
Protein sequence
MSLVTDLPAI FDQFSEARQK GFLTVMDLKV RGIPLVGTYC TFMPQEIPMA AGAVVVSLCS 
TSDETIEEAE KDLPRNLCPL IKSSYGFGKT DKCPYFYFSD LVVGETTCDG KKKMYEYMAE
FKPVHVMQLP NSVKDDASRA LWKAEMLRLQ KAVEERFGHE ISEDALRDAI ALKNRERRAL
ANFYHLGQFN PPALSGSDIL KVVYGATFRF DKEALINELD AMTARIRQQW EEGQRLDPRP
RILITGCPIG GAAEKVVRAI EENGGWVVGY ENCTGAKATE QCVVETGDVY DALADKYLAI
GCSCVSPNDQ RLKMLSQMVE EYQVDGVVDV ILQACHTYAV ESLAIKRHVR QQHNIPYIAI
ETDYSTSDVG QLSTRVAAFI EML