Gene DvMF_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_0643 
Symbol 
ID7172530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp770931 
End bp773969 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content63% 
IMG OID643539143 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_002435068 
Protein GI218885747 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value0.0165738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAGAC GCCAATTCCT GAAGATGTCC TGCGTGCTTG GGGCGGGTGT CGCCCTGTGC 
GGGCTGGGCG TCGACCTCGT GCCCATCGAG GCGTGGGCAG ACGAGATCAA GAAGATCGAC
CGCCTGAAAA GCGCCAGGCA GACCACGTCT GTGTGCTGCT ACTGTTCGGT GGGGTGCGGC
CTGATCTGCG CCACCGACGA GAAGACCGGC AAGATCATCA ACATCGAGGG CGATCCCGAG
CACCCGGTGA GCGAAGGCTC GCTGTGCGCC AAGGGTGCGG GCATGTTCCA GACCACCGAG
GCCAACGAAC ACCGCCTGAC CAAAGTGCTG TACCGCGCCC CCAACTCGGA CAAGTGGGAA
GAAAAGAGCT GGGACTGGGC CATCGACCGC ATCGCGCGGC TGATGAAGGA AGAGCGCGAC
AAGTCCTTCA TCACCCAGAA CGCGGCGGGA CAGGTGGTCA ACCGGCTGGA AACCCTGGCC
CACATGGGCA GTTCCAACCT GGACAACGAG GAATGCTGGA GCATCACCGC CTGGGCGCGG
TCTCTCGGCC TGGTGTACAT AGATCACCAG GCCCGGGTCT GACACGGCCC CACCGTACCG
GCTCTGGCAG AGTCGTTCGG ACGCGGCGCG ATGACCAATC ACTGGATCGA CATCCAGCAC
AGTGATTGCA TTCTCATCCA GGGCAGCAAC GCTGCCGAAA ACCACCCCAT CTCCTTCAAG
TGGGTGATGC GCGCCAAGGA AAGGGGCGCC AAGCTGATCC ACGTGGACCC GCGCTTCACG
CGGACCTCGT CCAAGGCCGA TTTCTACGCC CCCCTGCGCT CGGGCACGGA CATCGCCTTC
CTTGGCGGGA TGATCAAGTA CATCATCGAC AACAACAGGA TATTCCACGA CTACGTGGTC
AACTACACCA ACGCCCCGTT CCTGGTGGAC GGCAAGTTCG GCTTCAAGGA CGGCCTGTTC
ACCGGCTACG ACGCGGACAA GCGCGGCTAC GACAAGGCCA CCTGGACCTA CCAGAAGGAT
GCCAATGGCA TCATCCTGCG CGACGAGACG CTGAAGAACC CGCGTTGCGT CTACCAGATG
CTGAAGGCGC ACTACGCCCG CTACGACCTG AAGAAGGTCT CTTCCATCAC CGGCACGCCC
GAAAAGGACC TGAAGACCGT GTACGAGATG TTCTCGTCCA CCGGCACCAA GGACCGTGCG
GGCACCGTCA TGTACGCCCT GGGGCAGACC CACCACACCG TGGGGGTGCA GAACATCCGC
GCCCTGGCCA TCGTCCAGCT GCTGCTGGGC AACATCGGGG TGTGCGGCGG CGGCGTCAAC
GCACTGCGCG GCGAACCCAA CGTGCAGGGA TCCACGGACC ACGCCCTGCT GTACCACTAC
CTGCCGGGCT ACCTTTCGGC CCCGCGCACC TCGCAGCAGA CGCTGCAATC GTACATCAAG
AAGATCACCC CCTCGACCAC GGAAGCCAAA TCCGTCAACT GGCAGTCCAA CTACGGCAAG
TACGCGGTAA GCCTGCTGAA GTCGTGGTAC GGCGATGCCG CCACCAAGGA CAACGAGTTC
GGCTACGCCT GGGTGCCCAA GGCCGACGAC GGCAAGGACT ACTCGGTCAT GACCCTGTTC
GACGTGATGT ACGCCGGCAA GATCAAGGGC ATGACCGTGT TCGGCCAGAA CCCCGCGTGC
AGCATCCCCA ACTCGAACAA GGTGCGCGCC GCCTTCGCCA AGCTGGACTG GATGGTGCAC
CTGAACATCT TCGACAACGA GACGGCGTCA TTCTGGAAGG GGCCGGGCAT GGACCCCGCC
AAGGTGAAAA CCGAGGTCTT CCTGCTGCCC GCCTCTGCCT CCGTCGAAAA GGCCGGCAGC
CAGACCAACA GCGGCCGCTG GATACAGTGG CGCTACGAGG CCACCAAGGC TCCCGGCGAC
TGCATCTACG CGGGCGAGGC CATCATCCGC ATCCAAAACA AGCTGAAGGA ACTGTACAGC
AAGGAAGGCG GCACCTTCCC CGACCCCATC CTGAACATGA CCTCCGACGG TCTGGCAGAA
AAGGGCGGCT ACGACGCGGA AAAGGTGGCC AAGCTCATCA ACGGCTACTT CCTTGCCGAC
GTGGAGATAG GCGGCAAGCA GTACAAGAAG GGAGACTGCG TGCCCTCGTT CGCCCTGTTG
CAGGCCGACG GCTCCACCTC GTCGGGCAAC TGGATCTGCG CGGGCAGCTT CAGCCAGGAT
GGCAAGAACC TGATGCAGCG ACGCGGCAAG GCTGACCCCA CGGGCCTTGG CCTGTACCCC
GAATGGTCGT TCGCCTGGCC GGTGAACCGC CGGGTGCTGT ACAACCGCGC CTCGTGCGAC
CCCAGCGGAA AGCCCTACAA CCCCAAGCGC GCCGTGCTGG AATGGAAGGA CGGCAAGTGG
GTGGGCGACG TGCCCGACGG CGCCCCGCCG CCGCTGGCGG CGGAAGGCGG CAAGCTGCCG
TTCATCATGA AGCCCGACGG GGTCAGCTCC CTGTTCGGGC CGGGTCTTGG CGACGGCCCC
TTCCCGGAAC ACTACGAACC GCTGGAAAGC CCGCTGGCCA AGAACCTGAT GTCACCGCAG
CAGAACAACC CCGCCATCCT GCTGTACAAG AGCGACAAGG ACATGGTGGC CAGCGCCGAC
GCGCGCTTCC CCATCGTCAT GACCACCGAC TCGTCCACCG AGCACTGGTG CACCGGCGCC
TTCACCCGCT GGCAATCGTG GCTTACCGAG GCCATGCCCC AGGCATACGT GGAAATGAGC
GAGGAACTGG CCAAGGAAAA GGGCATCAAG AACGGCGACA AGGTGCGCGT GGAATCGGCT
CGCGGCAAGC TGGAGTGCGT GGCCATGGTC ACGCCGCGCT TCCGTCCCTT CCTGGTCAAC
GGCAAGACGC TGCATCAGGT GGGCATGCCT TACAACTACG GCTGGCGCTT CCCCGCCGAC
AACGGCGACA GTGCCAACCT GCTCACTCCC ACGGTGGGCG ACGCCAACGC CATGACGCCG
GAATACAAGG CGTTCATGGT CAACGTGGTC AAGGTATAG
 
Protein sequence
MRRRQFLKMS CVLGAGVALC GLGVDLVPIE AWADEIKKID RLKSARQTTS VCCYCSVGCG 
LICATDEKTG KIINIEGDPE HPVSEGSLCA KGAGMFQTTE ANEHRLTKVL YRAPNSDKWE
EKSWDWAIDR IARLMKEERD KSFITQNAAG QVVNRLETLA HMGSSNLDNE ECWSITAWAR
SLGLVYIDHQ ARVUHGPTVP ALAESFGRGA MTNHWIDIQH SDCILIQGSN AAENHPISFK
WVMRAKERGA KLIHVDPRFT RTSSKADFYA PLRSGTDIAF LGGMIKYIID NNRIFHDYVV
NYTNAPFLVD GKFGFKDGLF TGYDADKRGY DKATWTYQKD ANGIILRDET LKNPRCVYQM
LKAHYARYDL KKVSSITGTP EKDLKTVYEM FSSTGTKDRA GTVMYALGQT HHTVGVQNIR
ALAIVQLLLG NIGVCGGGVN ALRGEPNVQG STDHALLYHY LPGYLSAPRT SQQTLQSYIK
KITPSTTEAK SVNWQSNYGK YAVSLLKSWY GDAATKDNEF GYAWVPKADD GKDYSVMTLF
DVMYAGKIKG MTVFGQNPAC SIPNSNKVRA AFAKLDWMVH LNIFDNETAS FWKGPGMDPA
KVKTEVFLLP ASASVEKAGS QTNSGRWIQW RYEATKAPGD CIYAGEAIIR IQNKLKELYS
KEGGTFPDPI LNMTSDGLAE KGGYDAEKVA KLINGYFLAD VEIGGKQYKK GDCVPSFALL
QADGSTSSGN WICAGSFSQD GKNLMQRRGK ADPTGLGLYP EWSFAWPVNR RVLYNRASCD
PSGKPYNPKR AVLEWKDGKW VGDVPDGAPP PLAAEGGKLP FIMKPDGVSS LFGPGLGDGP
FPEHYEPLES PLAKNLMSPQ QNNPAILLYK SDKDMVASAD ARFPIVMTTD SSTEHWCTGA
FTRWQSWLTE AMPQAYVEMS EELAKEKGIK NGDKVRVESA RGKLECVAMV TPRFRPFLVN
GKTLHQVGMP YNYGWRFPAD NGDSANLLTP TVGDANAMTP EYKAFMVNVV KV