Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0643 |
Symbol | |
ID | 7172530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | + |
Start bp | 770931 |
End bp | 773969 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643539143 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002435068 |
Protein GI | 218885747 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 0.0165738 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAGAC GCCAATTCCT GAAGATGTCC TGCGTGCTTG GGGCGGGTGT CGCCCTGTGC GGGCTGGGCG TCGACCTCGT GCCCATCGAG GCGTGGGCAG ACGAGATCAA GAAGATCGAC CGCCTGAAAA GCGCCAGGCA GACCACGTCT GTGTGCTGCT ACTGTTCGGT GGGGTGCGGC CTGATCTGCG CCACCGACGA GAAGACCGGC AAGATCATCA ACATCGAGGG CGATCCCGAG CACCCGGTGA GCGAAGGCTC GCTGTGCGCC AAGGGTGCGG GCATGTTCCA GACCACCGAG GCCAACGAAC ACCGCCTGAC CAAAGTGCTG TACCGCGCCC CCAACTCGGA CAAGTGGGAA GAAAAGAGCT GGGACTGGGC CATCGACCGC ATCGCGCGGC TGATGAAGGA AGAGCGCGAC AAGTCCTTCA TCACCCAGAA CGCGGCGGGA CAGGTGGTCA ACCGGCTGGA AACCCTGGCC CACATGGGCA GTTCCAACCT GGACAACGAG GAATGCTGGA GCATCACCGC CTGGGCGCGG TCTCTCGGCC TGGTGTACAT AGATCACCAG GCCCGGGTCT GACACGGCCC CACCGTACCG GCTCTGGCAG AGTCGTTCGG ACGCGGCGCG ATGACCAATC ACTGGATCGA CATCCAGCAC AGTGATTGCA TTCTCATCCA GGGCAGCAAC GCTGCCGAAA ACCACCCCAT CTCCTTCAAG TGGGTGATGC GCGCCAAGGA AAGGGGCGCC AAGCTGATCC ACGTGGACCC GCGCTTCACG CGGACCTCGT CCAAGGCCGA TTTCTACGCC CCCCTGCGCT CGGGCACGGA CATCGCCTTC CTTGGCGGGA TGATCAAGTA CATCATCGAC AACAACAGGA TATTCCACGA CTACGTGGTC AACTACACCA ACGCCCCGTT CCTGGTGGAC GGCAAGTTCG GCTTCAAGGA CGGCCTGTTC ACCGGCTACG ACGCGGACAA GCGCGGCTAC GACAAGGCCA CCTGGACCTA CCAGAAGGAT GCCAATGGCA TCATCCTGCG CGACGAGACG CTGAAGAACC CGCGTTGCGT CTACCAGATG CTGAAGGCGC ACTACGCCCG CTACGACCTG AAGAAGGTCT CTTCCATCAC CGGCACGCCC GAAAAGGACC TGAAGACCGT GTACGAGATG TTCTCGTCCA CCGGCACCAA GGACCGTGCG GGCACCGTCA TGTACGCCCT GGGGCAGACC CACCACACCG TGGGGGTGCA GAACATCCGC GCCCTGGCCA TCGTCCAGCT GCTGCTGGGC AACATCGGGG TGTGCGGCGG CGGCGTCAAC GCACTGCGCG GCGAACCCAA CGTGCAGGGA TCCACGGACC ACGCCCTGCT GTACCACTAC CTGCCGGGCT ACCTTTCGGC CCCGCGCACC TCGCAGCAGA CGCTGCAATC GTACATCAAG AAGATCACCC CCTCGACCAC GGAAGCCAAA TCCGTCAACT GGCAGTCCAA CTACGGCAAG TACGCGGTAA GCCTGCTGAA GTCGTGGTAC GGCGATGCCG CCACCAAGGA CAACGAGTTC GGCTACGCCT GGGTGCCCAA GGCCGACGAC GGCAAGGACT ACTCGGTCAT GACCCTGTTC GACGTGATGT ACGCCGGCAA GATCAAGGGC ATGACCGTGT TCGGCCAGAA CCCCGCGTGC AGCATCCCCA ACTCGAACAA GGTGCGCGCC GCCTTCGCCA AGCTGGACTG GATGGTGCAC CTGAACATCT TCGACAACGA GACGGCGTCA TTCTGGAAGG GGCCGGGCAT GGACCCCGCC AAGGTGAAAA CCGAGGTCTT CCTGCTGCCC GCCTCTGCCT CCGTCGAAAA GGCCGGCAGC CAGACCAACA GCGGCCGCTG GATACAGTGG CGCTACGAGG CCACCAAGGC TCCCGGCGAC TGCATCTACG CGGGCGAGGC CATCATCCGC ATCCAAAACA AGCTGAAGGA ACTGTACAGC AAGGAAGGCG GCACCTTCCC CGACCCCATC CTGAACATGA CCTCCGACGG TCTGGCAGAA AAGGGCGGCT ACGACGCGGA AAAGGTGGCC AAGCTCATCA ACGGCTACTT CCTTGCCGAC GTGGAGATAG GCGGCAAGCA GTACAAGAAG GGAGACTGCG TGCCCTCGTT CGCCCTGTTG CAGGCCGACG GCTCCACCTC GTCGGGCAAC TGGATCTGCG CGGGCAGCTT CAGCCAGGAT GGCAAGAACC TGATGCAGCG ACGCGGCAAG GCTGACCCCA CGGGCCTTGG CCTGTACCCC GAATGGTCGT TCGCCTGGCC GGTGAACCGC CGGGTGCTGT ACAACCGCGC CTCGTGCGAC CCCAGCGGAA AGCCCTACAA CCCCAAGCGC GCCGTGCTGG AATGGAAGGA CGGCAAGTGG GTGGGCGACG TGCCCGACGG CGCCCCGCCG CCGCTGGCGG CGGAAGGCGG CAAGCTGCCG TTCATCATGA AGCCCGACGG GGTCAGCTCC CTGTTCGGGC CGGGTCTTGG CGACGGCCCC TTCCCGGAAC ACTACGAACC GCTGGAAAGC CCGCTGGCCA AGAACCTGAT GTCACCGCAG CAGAACAACC CCGCCATCCT GCTGTACAAG AGCGACAAGG ACATGGTGGC CAGCGCCGAC GCGCGCTTCC CCATCGTCAT GACCACCGAC TCGTCCACCG AGCACTGGTG CACCGGCGCC TTCACCCGCT GGCAATCGTG GCTTACCGAG GCCATGCCCC AGGCATACGT GGAAATGAGC GAGGAACTGG CCAAGGAAAA GGGCATCAAG AACGGCGACA AGGTGCGCGT GGAATCGGCT CGCGGCAAGC TGGAGTGCGT GGCCATGGTC ACGCCGCGCT TCCGTCCCTT CCTGGTCAAC GGCAAGACGC TGCATCAGGT GGGCATGCCT TACAACTACG GCTGGCGCTT CCCCGCCGAC AACGGCGACA GTGCCAACCT GCTCACTCCC ACGGTGGGCG ACGCCAACGC CATGACGCCG GAATACAAGG CGTTCATGGT CAACGTGGTC AAGGTATAG
|
Protein sequence | MRRRQFLKMS CVLGAGVALC GLGVDLVPIE AWADEIKKID RLKSARQTTS VCCYCSVGCG LICATDEKTG KIINIEGDPE HPVSEGSLCA KGAGMFQTTE ANEHRLTKVL YRAPNSDKWE EKSWDWAIDR IARLMKEERD KSFITQNAAG QVVNRLETLA HMGSSNLDNE ECWSITAWAR SLGLVYIDHQ ARVUHGPTVP ALAESFGRGA MTNHWIDIQH SDCILIQGSN AAENHPISFK WVMRAKERGA KLIHVDPRFT RTSSKADFYA PLRSGTDIAF LGGMIKYIID NNRIFHDYVV NYTNAPFLVD GKFGFKDGLF TGYDADKRGY DKATWTYQKD ANGIILRDET LKNPRCVYQM LKAHYARYDL KKVSSITGTP EKDLKTVYEM FSSTGTKDRA GTVMYALGQT HHTVGVQNIR ALAIVQLLLG NIGVCGGGVN ALRGEPNVQG STDHALLYHY LPGYLSAPRT SQQTLQSYIK KITPSTTEAK SVNWQSNYGK YAVSLLKSWY GDAATKDNEF GYAWVPKADD GKDYSVMTLF DVMYAGKIKG MTVFGQNPAC SIPNSNKVRA AFAKLDWMVH LNIFDNETAS FWKGPGMDPA KVKTEVFLLP ASASVEKAGS QTNSGRWIQW RYEATKAPGD CIYAGEAIIR IQNKLKELYS KEGGTFPDPI LNMTSDGLAE KGGYDAEKVA KLINGYFLAD VEIGGKQYKK GDCVPSFALL QADGSTSSGN WICAGSFSQD GKNLMQRRGK ADPTGLGLYP EWSFAWPVNR RVLYNRASCD PSGKPYNPKR AVLEWKDGKW VGDVPDGAPP PLAAEGGKLP FIMKPDGVSS LFGPGLGDGP FPEHYEPLES PLAKNLMSPQ QNNPAILLYK SDKDMVASAD ARFPIVMTTD SSTEHWCTGA FTRWQSWLTE AMPQAYVEMS EELAKEKGIK NGDKVRVESA RGKLECVAMV TPRFRPFLVN GKTLHQVGMP YNYGWRFPAD NGDSANLLTP TVGDANAMTP EYKAFMVNVV KV
|
| |