Gene ECH74115_3704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3704 
SymbolhyfB 
ID6968338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3422030 
End bp3424048 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content55% 
IMG OID643387498 
Producthydrogenase 4 subunit B 
Protein accessionYP_002271951 
Protein GI209398249 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.726938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones89 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCCC TGCAATTATT AACCTGGTCG CTGATCCTCT ATCTGTTTGC CAGTCTGGCT 
TCGCTGTTTT TACTCGGTCT GGACAGACTG GCTATTAAGC TTTCCGGCAT CACATCGCTG
GTGGGCGGCG TGATTGGCAT CATCAGCGGA ATTACGCAAT TACATGAAGG CGTAACTTTA
GTTGCCCGTT TTGCCACCCC TTTTGACTTT GCCGATTTAA CCCTGCGAAT GGATAGCCTC
TCGGCATTTA TGGTGCTGGT TATCTCCTTG CTGGTGGTGG TTTGTTCGCT CTATTCATTG
ACTTATATGC GCGAATACGA GGGCAAAGGC GCGGCGGCGA TGGGCTTCTT TATGAATATT
TTCATCGCAT CGATGGTTGC CCTGCTGGTG ATGGACAACG CTTTTTGGTT CATCGTGCTG
TTTGAAATGA TGTCGCTGTC TTCCTGGTTT CTGGTCATTG CCAGGCAGGA TAAAACGTCG
ATCAACGCTG GCATGCTCTA CTTTTTTATC GCCCACGCCG GATCGGTGCT GATTATGATC
GCCTTCTTGC TGATGGGGCG CGAAAGCGGC AGCCTAGATT TTGCCAGTTT CCGCACGCTT
TCACTTTCTC CGGGGCTGGC GTCGGCGGTG TTCCTGCTGG CCTTTTTCGG TTTTGGCGCG
AAAGCCGGGA TGATGCCGTT GCACAGCTGG TTGCCGCGCG CTCACCCTGC CGCACCATCG
CACGCTTCGG CGTTGATGTC TGGCGTAATG GTCAAAATCG GTATTTTCGG CATCCTGAAA
GTGGCGATGG ATCTGCTGGC GCAAACGGGT TTGCCGCTGT GGTGGGGCAT TCTGGTGATG
GCGATCGGCG CAATCTCCGC GCTCCTGGGC GTGCTGTATG CGCTGGCGGA ACAGGATATC
AAACGGCTGC TGGCCTGGAG TACCGTCGAA AACGTCGGCA TTATTTTGCT GGCAGTCGGT
GTGGCGATGG TCGGTCTGTC ACTACACGAC CCGCTGCTCA CCGTGGTTGG ACTGCTCGGC
GCACTGTTTC ATCTGCTCAA CCATGCGCTG TTCAAAGGGC TGCTATTTCT CGGCGCGGGC
GCGATTATTT CGCGTTTGCA TACCCACGAC ATGGAAAAAA TGGGGGCACT GGCGAAACGG
ATGCCGTGGA CAGCCGCAGC ATGCCTGATT GGTTGCCTCG CGATATCAGC CCTTCCTCCG
CTGAATGGTT TTATCAGCGA ATGGTACACC TGGCAGTCGC TGTTCTCACT AAGTCGTGTG
GAAGCCGTAG CGCTACAACT TGCGGGTCCT ATTGCTATGG TGATGCTGGC AGTCACTGGT
GGGCTGGCAG TAATGTGCTT CGTCAAAATG TACGGTATTA CTTTCTGTGG TGCGCCGCGC
AGTACACACG CTGAAGAGGC ACAGGAAGTG CCAAATACGA TGATCGTCGC CATGCTACTG
CTCGCGGCAC TCTGCGTATT AATTGCGCTT AGTGCCAGTT GGCTGGCACC GAAGATAATG
CATATTGCCC ATGCGTTTAC CAATACCCCT CCCGTCACTG TCGCCAGCGG AATAGCACTT
GTACCCGGCA CGTTTCATAC ACAGGTCACC CCCTCATTAC TGTTGCTGTT ACTACTGGCG
ATGCCTTTGC TGCCTGGCCT TTACTGGCTG TGGTGTCGTT CGCGCCGCGC AGCGTTTCGT
CGCACAGGAG ATGCCTGGGC ATGCGGCTAC GGCTGGGAAA ATGCGATGGC CCCGTCAGGC
AATGGCGTGA TGCAGCCGCT GCGTGTGGTC TTCTCTGCGC TATTTCGTCT ACGACAACAG
CTCGACCCTA CGCTGAGGCT AAATAAAGGT CTTGCGCACG TCACCGCCAG GGCTCAGAGC
ACAGAACCCT TCTGGGATGA GCGGGTGATC CGCCCCATCG TGAGCGCCAC CCAACGGCTG
GCCAAAGAAA TACAGCATCT GCAAAGCGGC GACTTTCGTC TCTATTGCCT GTATGTGGTC
GCCGCACTGG TTGTGCTGCT AATCGCTATT GCCGTCTAA
 
Protein sequence
MDALQLLTWS LILYLFASLA SLFLLGLDRL AIKLSGITSL VGGVIGIISG ITQLHEGVTL 
VARFATPFDF ADLTLRMDSL SAFMVLVISL LVVVCSLYSL TYMREYEGKG AAAMGFFMNI
FIASMVALLV MDNAFWFIVL FEMMSLSSWF LVIARQDKTS INAGMLYFFI AHAGSVLIMI
AFLLMGRESG SLDFASFRTL SLSPGLASAV FLLAFFGFGA KAGMMPLHSW LPRAHPAAPS
HASALMSGVM VKIGIFGILK VAMDLLAQTG LPLWWGILVM AIGAISALLG VLYALAEQDI
KRLLAWSTVE NVGIILLAVG VAMVGLSLHD PLLTVVGLLG ALFHLLNHAL FKGLLFLGAG
AIISRLHTHD MEKMGALAKR MPWTAAACLI GCLAISALPP LNGFISEWYT WQSLFSLSRV
EAVALQLAGP IAMVMLAVTG GLAVMCFVKM YGITFCGAPR STHAEEAQEV PNTMIVAMLL
LAALCVLIAL SASWLAPKIM HIAHAFTNTP PVTVASGIAL VPGTFHTQVT PSLLLLLLLA
MPLLPGLYWL WCRSRRAAFR RTGDAWACGY GWENAMAPSG NGVMQPLRVV FSALFRLRQQ
LDPTLRLNKG LAHVTARAQS TEPFWDERVI RPIVSATQRL AKEIQHLQSG DFRLYCLYVV
AALVVLLIAI AV