Gene EcSMS35_2629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2629 
SymbolhyfB 
ID6142811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2686098 
End bp2688116 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content56% 
IMG OID641617500 
Producthydrogenase 4 subunit B 
Protein accessionYP_001744665 
Protein GI170680316 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCCC TGCAATTATT AACCTGGTCG CTGATCCTCT ATCTGTTTGC CAGTCTGGCT 
TCGCTGTTTT TACTCGGTCT GGACAGACTG GCTATTAAGC TTTCCGGCAT CACATCGCTG
GTGGGCGGCG TGATTGGCAT CATCAGCGGA ATTACGCAAT TACATGCAGG CGTAACTTTA
GTCGCCCGTT TTGCCACGCC TTTTGACTTT GCCGATTTAA CCCTGCGAAT GGATAGCCTC
TCGGCATTTA TGGTGCTGGT TATCTCCTTG CTGGTGGTGG TTTGTTCGCT CTATTCATTG
ACTTATATGC GCGAATACGA GGGCAAAGGC GCGGCGGCGA TGGGCTTCTT TATGAATCTT
TTCATCGCCT CGATGGTTGC CCTGCTGGTG ATGGATAACG CTTTTTGGTT CATCGTGCTG
TTTGAAATGA TGTCGCTGTC TTCCTGGTTT CTGGTCATTG CCAGGCAGGA TAAAACGTCA
ATCAACGCTG GCATGCTCTA CGTTTTTATC GCCCACGCCG GATCGGTGCT GATTATGATC
GCCTTCTTGC TGATGGGGCG CGAAAGCGGC AGCCTCGATT TTGCCAGTTT CCGCACGCTT
TCACTTTCTC CGGGGCTGGC GTCGGCGGTG TTCCTGCTGG CCTTCTTCGG TTTTGGCGCG
AAAGCCGGGA TGATGCCGTT GCACAGCTGG TTGCCGCGCG CTCACCCTGC CGCACCATCG
CACGCTTCGG CGTTGATGTC TGGCGTAATG GTCAAAATCG GTATTTTCGG CATCCTGAAA
GTGGCGATGG ATCTGCTGGC GCAAACGGGT TTGCCGCTGT GGTGGGGCAT TCTGGTGATG
GCGATCGGCG CAATCTCCGC GCTCCTGGGC GTGCTGTATG CGCTGGCGGA ACAGGATATC
AAACGGCTGC TGGCCTGGAG TACCGTCGAA AACGTCGGCA TTATTTTGCT GGCGGTCGGT
GTGTCCATGG TCGGTCTGTC ACTGCACGAC CCGCTGCTCA CCATGGTTGG ACTGCTCGGC
GCACTGTTTC ATCTGCTCAA CCATGCGCTG TTCAAAGGGC TGCTATTTCT CGGCGCGGGT
GCGATTATTT CGCGTTTGCA TACCCACGAC ATGGAAAAAA TGGGGGCACT GGCAAAACGG
ATGCCGTGGA CAGCCGCAGC ATGCCTGATT GGTTGCCTCG CGATATCAGC CATTCCTCCG
CTGAATGGTT TTATCAGCGA ATGGTACACC TGGCAGTCGC TGTTCTCACT AAGTCGTGTG
GAAGCCGTAG CGCTACAACT TGCGGGTCCT ATTGCTATGG TGATGCTGGC AGTCACTGGT
GGGCTGGCAG TAATGTGCTT CGTCAAAATG TACGGCATTA CTTTCTGTGG TGCGCCGCGC
AGTACACACG CTGAAGAGGC ACAGGACGTG CCAAATACGA TGATCGTCGC CATGCTACTG
CTCGCGGCAC TCTGCGTATT CATTGCGCTT AGTGCCAGTT GGCTGGCACC GAAGATAATG
CACATTGCCC ATGCGTTTAC CAATACCCCT CCCGTCACTG TCGCCAGCGG AATAGCACTT
GTACCCGGCA CGTTTCATAC ACGGGTCACT CCCTCATTAC TGTTGCTGTT ACTACTGGCG
ATGCCTTTGC TGCCTGGCCT TTACTGGCTG TGGTGTCGTT CGCGCCGCGC AGCGTTTCGT
CGCACAGGAG ATGCCTGGGC ATGCGGCTAC GGCTGGGATA ATGCGATGGC CCCGTCAGGC
AATGGCGTGA TGCAGCCGCT GCGTGTGGTC TTTTGTGCGC TATTTCGTCT ACGACAACAG
CTCGACCCTA CGCTGAGGCT GAACAAAGGT CTTGCGCACG TCACCGCCAG GGCTCAGAGC
ACAGAACCCT TCTGGGATGA GCGGGTGATC CGCCCCATCG TGAGCGCCAC CCAACGGCTG
GCCAAAGAAA TACAGCATCT GCAAAGCGGC GACTTTCGTC TCTATTGCCT GTATGTGGTC
GCCGCACTGG TTGTGCTGCT AATCGCTATT GCCGTCTAA
 
Protein sequence
MDALQLLTWS LILYLFASLA SLFLLGLDRL AIKLSGITSL VGGVIGIISG ITQLHAGVTL 
VARFATPFDF ADLTLRMDSL SAFMVLVISL LVVVCSLYSL TYMREYEGKG AAAMGFFMNL
FIASMVALLV MDNAFWFIVL FEMMSLSSWF LVIARQDKTS INAGMLYVFI AHAGSVLIMI
AFLLMGRESG SLDFASFRTL SLSPGLASAV FLLAFFGFGA KAGMMPLHSW LPRAHPAAPS
HASALMSGVM VKIGIFGILK VAMDLLAQTG LPLWWGILVM AIGAISALLG VLYALAEQDI
KRLLAWSTVE NVGIILLAVG VSMVGLSLHD PLLTMVGLLG ALFHLLNHAL FKGLLFLGAG
AIISRLHTHD MEKMGALAKR MPWTAAACLI GCLAISAIPP LNGFISEWYT WQSLFSLSRV
EAVALQLAGP IAMVMLAVTG GLAVMCFVKM YGITFCGAPR STHAEEAQDV PNTMIVAMLL
LAALCVFIAL SASWLAPKIM HIAHAFTNTP PVTVASGIAL VPGTFHTRVT PSLLLLLLLA
MPLLPGLYWL WCRSRRAAFR RTGDAWACGY GWDNAMAPSG NGVMQPLRVV FCALFRLRQQ
LDPTLRLNKG LAHVTARAQS TEPFWDERVI RPIVSATQRL AKEIQHLQSG DFRLYCLYVV
AALVVLLIAI AV