Gene Sfum_2952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_2952 
Symbol 
ID4458712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp3642823 
End bp3644463 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content61% 
IMG OID639703724 
Productnickel-dependent hydrogenase, large subunit 
Protein accessionYP_847061 
Protein GI116750374 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.437472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACAGC GAGTAGTCAT CGACCCGATC ACCCGTATCG AGGGTCATCT GCGTATAGAA 
GTGGAAGTGA CCAACGGCAA GGTGTCCAAC GCCTGGAGCA GCTCGACGCT CTTCCGCGGC
CTCGAGATCA TGCTCCAGGG ACGCGATCCC CGCGACGCCT ACCTGTTCAC CCAGAGAGCC
TGCGGCGTGT GCACTTACGT TCACGGCCTG GCATCGGTCC GCGCGGTGGA CGATGCGGCC
AAGATCACGG TGCCCGAGAA TGCACGACTG ATTCGTAACC TGCTCCTCGG CGCCCAGTTC
CTTCATGACC ACATTGTGCA CTTTTATCAC CTGCACGCCC TCGACTGGGT GGATGTGGTC
AGCGCGCTCA AGGCCGATCC CAAGAAGACG GCGGATTTGG CCGCAAAGGT CTCGCCCGCC
CCGGGCAGCG GCATCAGCGA CTTCAAGGAA ACTCAGGGCC GCGTGAAAAA ACTGGTCGAA
AGCGGTCAGT TGGGCATCTT CGCCAACGGT TACTGGGGGC ATCCGGCGTA CAAGCTGCCC
CCCGAGGTCA ACCTGCTGGC GGTCGCGCAT TACCTGCAAG CCTTGCGGCA GCAGGCCCGC
ACCGCTCGAA TGCACGCCAT CTTCGGCGGC AAGAACCCGC ATGTCCAGAG CCTCGTGGTC
GGCGGCGTCA CCTGCGCCAC CGATCTGACC CCGGATCGCC TCGCCGAATT CAAGTACCTT
TACAAAGAGA CCATGGATTT CGTCAAGCAG TACTACATCC CCGATCTGAA AGCCGTGGCG
GGCTTTTACA AAGACTGGGG CAAGATCGGC GGAACCAAGA ATTTCCTCGT CTACGGTGAA
TTCCCTCAGA GCGACAAGGA ACCCGACAGC TTCCTCTTCC CGCGGGGCGC CATTTTCAAG
CGCAACATCG GCCAGGTGCA GGCGGTCGAT ATGGCCCAGG TGCAGGAGCA CGTGAAGCGC
AGCTGGTACG AAGGCGACAA GGCGCTGCAT CCTTCCGAGG GTGAAACCAA GCCCAAGTAC
GAGGCCCTCG ACGTCGAAAA GCGGTACAGC TGGATGAAGG CCCCGCGCTA CAAGGGCGAA
CCCATGGAGG TCGGGCCGCT GGCGCGCGTG CTGGTGGCCT ACGGCAAGGG TCACGCCCCG
ACCAAGAAAG CCGTGGACGG CCTGTTGAAA GAGTTGGGCG TTCCCGTGGA CGCTCTCTTC
TCGACGCTCG GCAGGACTGC GGCCAGAGGT CTGGAAACGG CGATCATCGG CGATTCCATG
GAAGGCTGGC TGAACCAGTT GATCGAGAAC GTCGGCAAAG GCAATACCAA GATATACCAG
GATTACCAGA TGCCCGCCGA AGCCATGGGA GCGGGTCTGA ACGACGTTCC CCGCGGCGCG
CTCGGGCACT GGGTCCAGAT CAAGGACCAG AAGATCGCCA ACTTCCAGCT GGTCGTTCCG
TCCACCTGGA ACCTCGGTCC GCGTTGCGCG CAAAACAAGC CGGGACCGGT CGAGGAAGCC
CTGATGGGAA CCCCGGTGGC CGATCCGAAG CGTCCGGTGG AAATCCTGCG CACCGTTCAT
TCCTTTGACC CGTGCATCGC GTGCGGCGTC CATGTGATCG ATCCGAAGTC CAACGAGGTC
TACAAGTTCA GGGTTGTGTA G
 
Protein sequence
MGQRVVIDPI TRIEGHLRIE VEVTNGKVSN AWSSSTLFRG LEIMLQGRDP RDAYLFTQRA 
CGVCTYVHGL ASVRAVDDAA KITVPENARL IRNLLLGAQF LHDHIVHFYH LHALDWVDVV
SALKADPKKT ADLAAKVSPA PGSGISDFKE TQGRVKKLVE SGQLGIFANG YWGHPAYKLP
PEVNLLAVAH YLQALRQQAR TARMHAIFGG KNPHVQSLVV GGVTCATDLT PDRLAEFKYL
YKETMDFVKQ YYIPDLKAVA GFYKDWGKIG GTKNFLVYGE FPQSDKEPDS FLFPRGAIFK
RNIGQVQAVD MAQVQEHVKR SWYEGDKALH PSEGETKPKY EALDVEKRYS WMKAPRYKGE
PMEVGPLARV LVAYGKGHAP TKKAVDGLLK ELGVPVDALF STLGRTAARG LETAIIGDSM
EGWLNQLIEN VGKGNTKIYQ DYQMPAEAMG AGLNDVPRGA LGHWVQIKDQ KIANFQLVVP
STWNLGPRCA QNKPGPVEEA LMGTPVADPK RPVEILRTVH SFDPCIACGV HVIDPKSNEV
YKFRVV