Gene ECH74115_4308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4308 
SymbolhybA 
ID6970299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3987590 
End bp3988576 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content55% 
IMG OID643388037 
Producthydrogenase 2 protein HybA 
Protein accessionYP_002272475 
Protein GI209398622 
COG category[C] Energy production and conversion 
COG ID[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAGAC GTAATTTTAT TAAAGCAGCC TCCTGCGGGG CATTGCTGAC GGGCGCGCTG 
CCGTCTGTCA GTCATGCGGC TGCTGAAAAC CGCCCGCCAA TTCCGGGATC GCTGGGGATG
TTGTACGACT CGACCTTGTG CGTAGGCTGC CAGGCTTGCG TCACCAAGTG TCAGGATATC
AACTTCCCTG AACGTAACCC GCAAGGGGAA CAGACCTGGT CGAACAACGA CAAACTGTCG
CCGTATACCA ATAACATCAT TCAAGTGTGG ACCAGCGGCA CAGGGGTCAA CAAAGACCAG
GAGGAGAACG GCTACGCGTA CATTAAGAAA CAGTGTATGC ACTGCGTCGA TCCGAACTGT
GTCTCTGTGT GCCCGGTCTC TGCACTGAAA AAAGATCCGA AAACCGGCAT TGTCCATTAC
GACAAAGACG TGTGCACCGG TTGCCGTTAC TGCATGGTCG CCTGTCCGTA CAACGTGCCG
AAGTACGACT ACAACAACCC GTTTGGTGCG CTGCATAAGT GCGAGCTGTG CAACCAGAAA
GGTGTGGAAC GTCTCGATAA AGGCGGTCTG CCTGGCTGCG TAGAAGTGTG CCCGGCGGGC
GCGGTGATTT TTGGTACGCG TGAAGAGCTG ATGGCGGAGG CGAAAAAACG TCTGGCGCTG
AAGCCTGGCA GCGAATACCA CTATCCGCGT CAGACGCTGA AATCTGGCGA CACTTACCTG
CATACGGTGC CGAAATATTA TCCGCATCTG TACGGCGAGA AAGAGGGCGG CGGTACTCAG
GTTCTGGTAC TGACGGGTGT GCCTTATGAA AATCTCGACC TGCCGAAACT GGACGATCTT
TCTACCGGTG CGCGTTCCGA AAATATTCAA CACACCCTGT ATAAAGGCAT GATGCTACCA
CTGGCTGTGC TGGCGGGCTT GACCGTGCTG GTTCGTCGCA ACACCAAAAA CGACCATCAC
GACGGAGGAG ACGATCATGA GTCATGA
 
Protein sequence
MNRRNFIKAA SCGALLTGAL PSVSHAAAEN RPPIPGSLGM LYDSTLCVGC QACVTKCQDI 
NFPERNPQGE QTWSNNDKLS PYTNNIIQVW TSGTGVNKDQ EENGYAYIKK QCMHCVDPNC
VSVCPVSALK KDPKTGIVHY DKDVCTGCRY CMVACPYNVP KYDYNNPFGA LHKCELCNQK
GVERLDKGGL PGCVEVCPAG AVIFGTREEL MAEAKKRLAL KPGSEYHYPR QTLKSGDTYL
HTVPKYYPHL YGEKEGGGTQ VLVLTGVPYE NLDLPKLDDL STGARSENIQ HTLYKGMMLP
LAVLAGLTVL VRRNTKNDHH DGGDDHES