Gene ECH74115_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3972 
SymbolhycE 
ID6967056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3672891 
End bp3674621 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content56% 
IMG OID643387741 
Productformate hydrogenlyase, subunit E 
Protein accessionYP_002272184 
Protein GI209400571 
COG category[C] Energy production and conversion 
COG ID[COG0852] NADH:ubiquinone oxidoreductase 27 kD subunit
[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTAAAG AGAGTTTGAG CATGTCTGAA GAAAAATTAG GTCAACATTA TCTCGCCGCG 
CTGAATGAGG CATTTCCGGG CGTCGTGCTG GACCACGCCT GGCAGACCAA AGATCAGCTG
ACTGTCACCG TAAAGGTGAA CTATCTGCCG GAAGTGGTGG AGTTTCTCTA CTACAAGCAG
GGGGGCTGGC TGTCGGTGCT TTTTGGTAAC GACGAACGCA AACTGAATGG TCATTACGCC
GTTTACTACG TGCTGTCGAT GGAGAAGGGG ACTAAGTGCT GGGTAACGGT TCGCGTCGAA
GTTGATGCCA ACAAACCGGA GTATCCTTCC GTGACGCCGC GCGTTCCGGC TGCGGTGTGG
GGCGAGCGTG AAGTACGTGA TATGTACGGT TTGATTCCGG TTGGTTTGCC GGATGAACGT
CGTCTGGTGC TGCCGGATGA CTGGCCGGAT GAACTTTATC CGCTGCGTAA AGACAGCATG
GATTATCGTC AGCGTCCGGC ACCGACCACC GATGCTGAAA CCTACGAGTT CATCAACGAA
CTGGGCGACA AGAAAAACAA CGTCGTGCCG ATTGGTCCGC TGCACGTCAC TTCTGACGAA
CCGGGCCACT TCCGTCTGTT CGTCGATGGC GAAAACATTA TCGACGCCGA CTACCGCCTG
TTCTATGTCC ATCGCGGCAT GGAAAAACTG GCAGAAACCC GCATGGGTTA TAACGAAGTG
ACCTTCCTCT CTGACCGTGT GTGCGGGATC TGCGGTTTTG CTCACAGCAC CGCCTACACC
ACGTCGGTGG AAAACGCGAT GGGTATTCAG GTGCCAGAAC GTGCGCAGAT GATCCGCGCC
ATTCTGCTGG AGGTAGAACG TCTGCACTCG CATCTGCTCA ACCTCGGCCT CGCCTGTCAC
TTTACCGGCT TTGACTCCGG CTTTATGCAG TTCTTCCGCG TGCGTGAAAC CTCCATGAAA
ATGGCAGAGA TCCTTACCGG TGCGCGTAAA ACCTACGGCC TGAACCTGAT CGGCGGGATT
CGTCGCGATC TGCTGAAAGA TGACATGATC CAGACCCGTC AACTGGCGCA ACAGATGCGT
CGTGAAGTGC AGGAGCTGGT GGATGTGCTG CTGAGTACGC CGAACATGGA ACAGCGCACT
GTCGGCATTG GTCGTCTGGA CCCGGAAATC GCTCGCGACT TCAGTAACGT CGGCCCGATG
GTCCGTGCCA GCGGTCACGC CCGCGATACC CGCGCCGATC ACCCGTTTGT TGGTTATGGC
CTGCTGCCAA TGGAAGTCCA CAGCGAGCAG GGCTGCGACG TTATTTCGCG TCTGAAAGTG
CGTATTAACG AAGTCTATAC CGCGCTGAAC ATGATCGACT ACGGTCTGGA TAACCTGCCG
GGCGGCCCGC TGATGGTGGA AGGCTTTACC TACATTCCGC ACCGTTTTGC GCTGGGCTTT
GCCGAAGCGC CGCGCGGCGA TGATATCCAC TGGAGCATGA CCGGCGACAA CCAGAAGCTG
TACCGCTGGC GCTGCCGTGC CGCGACCTAC GCGAACTGGC CGACCCTGCG CTACATGCTG
CGCGGCAACA CCGTTTCTGA TGCGCCGCTG ATTATCGGTA GTCTGGACCC TTGCTACTCC
TGTACCGACC GCATGACCGT GGTCGATGTG CGTAAGAAAA AGAGCAAAGT GGTGCCGTAC
AAAGAACTCG AGCGTTACAG CATTGAGCGT AAAAACTCGC CGCTGAAATA A
 
Protein sequence
MIKESLSMSE EKLGQHYLAA LNEAFPGVVL DHAWQTKDQL TVTVKVNYLP EVVEFLYYKQ 
GGWLSVLFGN DERKLNGHYA VYYVLSMEKG TKCWVTVRVE VDANKPEYPS VTPRVPAAVW
GEREVRDMYG LIPVGLPDER RLVLPDDWPD ELYPLRKDSM DYRQRPAPTT DAETYEFINE
LGDKKNNVVP IGPLHVTSDE PGHFRLFVDG ENIIDADYRL FYVHRGMEKL AETRMGYNEV
TFLSDRVCGI CGFAHSTAYT TSVENAMGIQ VPERAQMIRA ILLEVERLHS HLLNLGLACH
FTGFDSGFMQ FFRVRETSMK MAEILTGARK TYGLNLIGGI RRDLLKDDMI QTRQLAQQMR
REVQELVDVL LSTPNMEQRT VGIGRLDPEI ARDFSNVGPM VRASGHARDT RADHPFVGYG
LLPMEVHSEQ GCDVISRLKV RINEVYTALN MIDYGLDNLP GGPLMVEGFT YIPHRFALGF
AEAPRGDDIH WSMTGDNQKL YRWRCRAATY ANWPTLRYML RGNTVSDAPL IIGSLDPCYS
CTDRMTVVDV RKKKSKVVPY KELERYSIER KNSPLK