Gene Sfum_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_0844 
Symbol 
ID4461049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1046260 
End bp1047990 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content59% 
IMG OID639701606 
Producthydrogenases, Fe-only 
Protein accessionYP_844976 
Protein GI116748289 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG1034] NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G)
[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID[TIGR02512] hydrogenases, Fe-only 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA TATCGCTCAT GATCGACAAA CAGAATGTGT CAGTGCCGGA AGGGTCGACT 
ATCCTCCAGG CGGCCGGGAA GATCGGCGTA AGGATACCGA CTCTTTGTTA CCTGAAGGAC
ATCAACGTCA TCGGAGCCTG CCGGATCTGC ATGGTCCAGG TGCAGGGAGC TCGAACCATG
ATGGCGGCCT GCGTCACTCC GGTGACCGAA GGGATGATCG TGCGCACCAA TACCCCGGAG
GTGATCGCCG CGCGCAGAGT GGTTCTCGAA CTGATTCTCT CCGATCATCC GATGGAGTGT
CTCACGTGCG TCCGAAACCA GAACTGTGAG TTACAGAGAC TGGCGGAGGA ATTCAATATC
AAGGGAATCC GCTTCGACGG CGCCAGGACC GAATACGGCA TCGACGATTC CAGCCCGGCC
ATTCAAAGAG ACCCAAGGAA ATGCATTCTC TGCCGGCGCT GTATTTCGGT TTGTTCCGAA
GTTCAGGGCG TGAAAGCGCT CACCACCGGC CATAGGGGCT TCAACACCCT GATCGCCCCC
GCGTTCAACG AGAGACTTGC CAACGTCGAA TGCGTCCAGT GCGGACAGTG CTCGCTCGTC
TGTCCCACGG GGGCGATTCA CGAAGTGGAC GATACGGAAA AAGTCTGGGC GGCTCTGGCC
GATCCGAAAA AACACGTCGT CGTCCAGACC GCACCGGCAA CGCGGGTCCA GGTGGGAGAG
ACAATGGGGG CAGCCCCCGG CAGCATCGTC ACCGGACAGA TGGTCGCAGG GTTGCGGCGA
CTCGGATTCG ACAAGGTCTT TGACACCGAC TTCACCGCCG ACCTGACCAT CCTGGAGGAA
GGCAACGAGT TGCTTCAGCG CCTCCAAAAC GGCGGCGCGC TTCCCATGAT CACTTCGTGC
AGTCCGGGCT GGATCAAGTT CGCCGAGCAC TTCTACCCCG AGTTGCTGCC CCATCTTTCC
ACCTGCAAGT CGCCGCAGCA GATGTTCGGC GCTCTGGCCA AGACCTATTA CGCTCAGAAG
GCGGGGATTG ATCCGTCCGA CGTCTTTGTC GTGTCCGTCA TGCCCTGCAC GGCCAAAAAA
TTCGAATGCA TTCGACCCGA GATGAGGAGC AGCGGCTACC AGGATGTCGA CGTGGTGCTC
ACTTCGCGTG AGTTGGGCCG CATGTTCAAA CAGGCCGGTC TGAGCATGGA AAACCTGCCG
CCCGAGGAAT ACGACGCTCC GCTGGGGATC TCCACGGGCG CAGGCGAGAT ATTCGGAGCT
TCTGGCGGCG TGATGGAGGC CGCCCTGCGC ACCGTGTACG AGGTCGTCAC GGGCCAATCC
CTGGAAACCA TCGAATTCAA GGAATGCAGG GGGCTCGACG GAGTGAAGGA GGCCACCGTC
CAGGTGGGCG CCCTGCCCGT CCGGGTTGCA ATAACCAATG GGCTTGGCAA TGCGAGGAAG
GTGCTGGAAA AGATCCGGGA AGGCTCCTCG GACCATCATT TCATCGAAAT CATGTGCTGC
CCGGGAGGGT GCGTCGGAGG CGGCGGCTCC CCGATCCCGA CCAACAAGGA GATACGCCTC
AACAGGATCG ATGCCGTGTA CCGGGAAGAC GAGCGAATGG CGCTTCGCAA ATCGCACGAC
AACCCGGCCG TGGCGGAACT CTACAAAGAG TTTCTTGAAA AACCGCTGGG ACACAAATCG
CATCAACTGC TTCACACTCA CTACACCGAA AGGGGATTGA ACAAAACTTG A
 
Protein sequence
MKMISLMIDK QNVSVPEGST ILQAAGKIGV RIPTLCYLKD INVIGACRIC MVQVQGARTM 
MAACVTPVTE GMIVRTNTPE VIAARRVVLE LILSDHPMEC LTCVRNQNCE LQRLAEEFNI
KGIRFDGART EYGIDDSSPA IQRDPRKCIL CRRCISVCSE VQGVKALTTG HRGFNTLIAP
AFNERLANVE CVQCGQCSLV CPTGAIHEVD DTEKVWAALA DPKKHVVVQT APATRVQVGE
TMGAAPGSIV TGQMVAGLRR LGFDKVFDTD FTADLTILEE GNELLQRLQN GGALPMITSC
SPGWIKFAEH FYPELLPHLS TCKSPQQMFG ALAKTYYAQK AGIDPSDVFV VSVMPCTAKK
FECIRPEMRS SGYQDVDVVL TSRELGRMFK QAGLSMENLP PEEYDAPLGI STGAGEIFGA
SGGVMEAALR TVYEVVTGQS LETIEFKECR GLDGVKEATV QVGALPVRVA ITNGLGNARK
VLEKIREGSS DHHFIEIMCC PGGCVGGGGS PIPTNKEIRL NRIDAVYRED ERMALRKSHD
NPAVAELYKE FLEKPLGHKS HQLLHTHYTE RGLNKT