Gene EcSMS35_3282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3282 
SymbolhybA 
ID6144588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3359479 
End bp3360465 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content55% 
IMG OID641618112 
Producthydrogenase 2 protein HybA 
Protein accessionYP_001745262 
Protein GI170679790 
COG category[C] Energy production and conversion 
COG ID[COG0437] Fe-S-cluster-containing hydrogenase components 1 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAGAC GTAATTTTAT TAAAGCAGCC TCCTGCGGGG CATTGCTGAC GGGCGCGCTG 
CCGTCTGTCA GTCATGCGGC TGCTGAAAAC CGCCCGCCAA TTCCGGGATC GCTGGGGATG
TTGTACGACT CGACCTTGTG CGTTGGCTGC CAGGCGTGCG TCACCAAGTG TCAGGATATC
AACTTCCCTG AACGTAACCC GCAAGGGGAA CAGACCTGGT CGAACAACGA CAAACTGTCG
CCGTACACCA ATAACATCAT TCAGGTGTGG ACCAGCGGCA CAGGGGTCAA CAAAGACCAG
GAGGAGAATG GCTACGCGTA CATTAAGAAA CAGTGTATGC ATTGCGTCGA TCCGAACTGT
GTCTCTGTGT GCCCGGTCTC TGCACTAAAA AAAGATCCGA AAACCGGCAT TGTCCATTAC
GACAAAGATG TGTGCACCGG CTGCCGTTAC TGCATGGTCG CCTGCCCATA CAATGTGCCG
AAGTACGACT ACAACAACCC GTTTGGTGCG CTGCATAAGT GCGAGCTGTG CAACCAGAAA
GGTGTGGAAC GTCTCGATAA AGGCGGTCTG CCTGGCTGCG TAGAAGTGTG CCCGGCGGGC
GCGGTGATTT TCGGTACGCG TGAAGAGCTG ATGGCGGAGG CGAAAAAACG TCTGGCGCTG
AAGCCTGGCA GCGAATACCA CTATCCGCGT CAGACGCTGA AATCTGGCGA CACTTACCTG
CATACGGTGC CGAAATATTA TCCGCATCTG TACGGCGAGA AAGAGGGCGG CGGTACTCAG
GTTCTGGTAC TGACGGGTGT GCCTTATGAA AATCTCGACC TGCCGAAACT GGACGATCTT
TCTACCGGTG CGCGTTCCGA AAATATTCAA CACACCCTGT ATAAAGGCAT GATGCTACCA
CTGGCTGTGC TGGCGGGCTT AACCGTGCTG GTTCGTCGCA ACACCAAAAA CGACCATCAC
GACGGAGGAG ACGATCATGA GTCATGA
 
Protein sequence
MNRRNFIKAA SCGALLTGAL PSVSHAAAEN RPPIPGSLGM LYDSTLCVGC QACVTKCQDI 
NFPERNPQGE QTWSNNDKLS PYTNNIIQVW TSGTGVNKDQ EENGYAYIKK QCMHCVDPNC
VSVCPVSALK KDPKTGIVHY DKDVCTGCRY CMVACPYNVP KYDYNNPFGA LHKCELCNQK
GVERLDKGGL PGCVEVCPAG AVIFGTREEL MAEAKKRLAL KPGSEYHYPR QTLKSGDTYL
HTVPKYYPHL YGEKEGGGTQ VLVLTGVPYE NLDLPKLDDL STGARSENIQ HTLYKGMMLP
LAVLAGLTVL VRRNTKNDHH DGGDDHES