Gene Dhaf_4271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_4271 
Symbol 
ID7261296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp4520720 
End bp4523770 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content50% 
IMG OID643564188 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_002460712 
Protein GI219670277 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.12828 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGTAA CTCGTCGTGG GTTCCTTAAG CTTTCTGGGG CATCCCTGTT TGCTTTGGCT 
TCCGGTCTTG GCTTTGATCC ACAGATCGCC CAAGCCCAGG GATTTAGTCT TAGAATTGAA
GGAACAACTA AGATTCCTAG CATTTGCCAT TTCTGTTCCG GCGGGTGCGG TTTGCTGCTT
CATATTAAAG ACGAAAAACT CGTTTATCTG GATGGAGACC CGGATAATCC GGTGAATTCC
GGTGCCTTGT GTCCGAAAGG GGCCAGTTTG GGTCATGTGG CCAATGCCAA GGACCGTGTC
ACAAAGCCCA GATACAGAGC TCCCGGGTCC AGCGATTGGC AGGATATTTC CTGGGATGAA
GCGATTAATA AGATCGCCTC AAAGATCAAG GAAGTTCGTG ACACAACCTG GATGGCTACG
GAAGAAATTG GTGGTACAGC CTACAATGTA AACCGCGCTG ATGGCATTGC CGTTCTCGGT
TCAGCCGAAG TGGATAATGA AGAATCCTAC CTGATCAAGA AACTGTCCGA GCTCATCGGA
ACACCTTATA ACGAACACCA GGCCCGGATA TGACACGCTC CCACGGTGGC AAGTTTGTCA
CCTTCATTTG GCCGCGGAGC TATGACCAAT TCCTGGACGG ATATGCAAAA CACGAAATGC
TTCCTGATTG CGGGCAGCAA CTGTGCGGAG AATCATCCCA TAGCTATGCG TTGGATCAAT
AAAGCCAAAG AAAACGGCGC CAAGGTGATC GTGGTGGATC CGCGGTTTAC CCGTACCGCC
TCCCAAGCAG ATATCTTTGC CCAGGTTCGT CCCGGTGCGG ATATTGCTTA TTTAAATGCA
ATCATCAATT ATATCTTAGA GAACAAACTC TATGATCAAG ACTATGTACT CAACCATACC
AATGGGCTGT ACAAGATCAG TAAAGACTTT AAATTTGCCG ATGGACTGTT TTCCGGATTT
GATCCCGAGA CCAAAAAGTA TAATTTTGAC AGCTGGGCTT ATCAGCTTGA TGCCGAAAAC
AAACCGGTGA AAGCGGAAAG TCTGGATGAT CCTGATTGCG TATTTGGCAA GCTTAAAGAG
CATTTCTCCC GTTACACTCT GGAAGTAGGG GCCGATATCA GCGGTATACC TGCTGAAAAA
ATTAAAGAAA TTGCTGATAC CTTCTGCAAT ACCAGACCAG GCAGCATTCT TTATGCACTG
GGCATGACCC AACACACCAC CGGCGTTCAG GGAATCCGCA GTTACGCGAT CATCCAGCTG
CTTTTGGGCA ACGTAGGTAA AGCAGGCAGC GGCATCCAAG CCTTGCGTGG TGAGCCCAAT
GTTCAAGGCT CCACCGATAT GGCCAATCTC TTTAATAACC TGCCGGGTTA CTTGCCGGCC
CCGGTTCATA CGGATAAGGA TCTGCGCAGT TATCTGGTGC GCAGCGGCTC GGCTTTTGAA
AGGCATATTA TCTCTCAGCT CAAAGCTTGG TTTGGCGAAA ATGCCACCAA AGAAAATGAT
TATTGCTTTA ATTACCTGCC TAAGTACAAC TCAGGCAAGA ACTACAGCAT GGTAAAACTT
TGGGAAGCTG CCAATAGCGG ACAATTCAAA ATGCTGCTTA ATTTTGGCTC GAATTCCATG
GTATCCATCC CTAACCGGCA AATCGTCCGT GAAGGCTTGG CAAAGCTGGA TATGCTGGTC
ATCGCGGATG TTTATGAGGT TGAAACTGCC CAATTCTGGC GTGAAAAGGA TCCGACCACA
GGCGAGCTCC TGGTCAATCC GGCCAAAATC AACACAGAAG TCATCCTGCT TCCCGCAGCT
TTCGTCTATG AAAAAGGGGG CACGCTATCC AACTCCGGCC GCTGGATTCA GTGGAAGGAT
GCTGCTCTGA AGCCGCCAGG GGAAGCTAAG CCTGACCTGG ATATTCTGGA TCATATCTAC
CATAAACTTA AAGAACTTTA TGCCGGCAGT ACCGATCCTA AGGATGAACC CATTCTCAAG
GCCCGGTGGG ACTATGGTCA TGAACCGGAT CCGCTGAAAG TTCTCCAGGA GATCAGTGGT
TATGATGAAA CAACTGGTAA AGTGCTGCCT ACCCTGGCTG ATTATTTAAA AGCTCCTATC
GGCTCAGCTT CGTCAGGCTG TTGGATTTAT GCCGGTGTTA CAGGCAATGG CAACCTGGCT
GCCCGCCGGG ACAACAGTGA TCCCTCGGGG CTTGGTCTAT ACCGCAATTG GAGCTTCTCC
TGGCCGGGTA ACATCCGCAT CCTTTATAAT CGCGGCTCCT GTGATATGAA CGGTCAGCCT
CTGGATGAGA ACCGCAAGCT GATTTGGTGG GATGCGGCCA AGAATTCCTG GGAAGGCAAT
GACGGTGCCG ATGTGCCGGA CAAAACCAAA GGCCCGGATA CCCCGGAAGG GAAACAGGTT
TTCCGCATGA ATCCTGAAGG AGTAGGCCGC TTATTCACTG CGAAATATTT CAGTGGAATT
CCTGCCACAC CTGCAGCAGA TGGCTTGCCC CATATTGGCG TTAGACCGGC CGGTCAATGT
AATGACGGTC CTTTGCCGGA GTTTTATGAG CCGGTGGAAA GTCCGACGGT CAATAGTCTG
CATCCGGATG TAAGCTCTAA TCCTACCGTG CCCATCCCGA ACTTCCTGCC TGGTGTTACG
AACCATGGCA GCAAGGAAGA TTTCCCTTAT GTACTAACGA CCTATGCCTT AGTTGAGCAT
TTCTGCGCCG GCGGGATCAC GAGAAATATC CCCATGCTCA ATGAGCTGAT GCCTCAGCCC
TTTGCCGAGA TCAGCAAAAA TCTGGCCCAA AAGATCGGAG TCAAAGAAGG GGACATGGTA
GAAGTATCTT CTGCCCGTGG CAAAGTTCAA GTAGTCGCCC TGGTAACGGA CAGAATTCAG
ACCTTAAAGA TCAACGGTCA GGATTCCGAA ACCATTGGTA TGCCTTGGAG TTGGGGCTTC
GCATCCCTAA GTCCCGGACC GACGACCAAC AACCTGACCA TCAGCGCCAT CGATCCCACT
GCAGGCACCC CGGAATATAA ATGCTGCCTG GTCAACATAA GGAGGGCGTA G
 
Protein sequence
MEVTRRGFLK LSGASLFALA SGLGFDPQIA QAQGFSLRIE GTTKIPSICH FCSGGCGLLL 
HIKDEKLVYL DGDPDNPVNS GALCPKGASL GHVANAKDRV TKPRYRAPGS SDWQDISWDE
AINKIASKIK EVRDTTWMAT EEIGGTAYNV NRADGIAVLG SAEVDNEESY LIKKLSELIG
TPYNEHQARI UHAPTVASLS PSFGRGAMTN SWTDMQNTKC FLIAGSNCAE NHPIAMRWIN
KAKENGAKVI VVDPRFTRTA SQADIFAQVR PGADIAYLNA IINYILENKL YDQDYVLNHT
NGLYKISKDF KFADGLFSGF DPETKKYNFD SWAYQLDAEN KPVKAESLDD PDCVFGKLKE
HFSRYTLEVG ADISGIPAEK IKEIADTFCN TRPGSILYAL GMTQHTTGVQ GIRSYAIIQL
LLGNVGKAGS GIQALRGEPN VQGSTDMANL FNNLPGYLPA PVHTDKDLRS YLVRSGSAFE
RHIISQLKAW FGENATKEND YCFNYLPKYN SGKNYSMVKL WEAANSGQFK MLLNFGSNSM
VSIPNRQIVR EGLAKLDMLV IADVYEVETA QFWREKDPTT GELLVNPAKI NTEVILLPAA
FVYEKGGTLS NSGRWIQWKD AALKPPGEAK PDLDILDHIY HKLKELYAGS TDPKDEPILK
ARWDYGHEPD PLKVLQEISG YDETTGKVLP TLADYLKAPI GSASSGCWIY AGVTGNGNLA
ARRDNSDPSG LGLYRNWSFS WPGNIRILYN RGSCDMNGQP LDENRKLIWW DAAKNSWEGN
DGADVPDKTK GPDTPEGKQV FRMNPEGVGR LFTAKYFSGI PATPAADGLP HIGVRPAGQC
NDGPLPEFYE PVESPTVNSL HPDVSSNPTV PIPNFLPGVT NHGSKEDFPY VLTTYALVEH
FCAGGITRNI PMLNELMPQP FAEISKNLAQ KIGVKEGDMV EVSSARGKVQ VVALVTDRIQ
TLKINGQDSE TIGMPWSWGF ASLSPGPTTN NLTISAIDPT AGTPEYKCCL VNIRRA