Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_2533 |
Symbol | |
ID | 7174469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | + |
Start bp | 3196674 |
End bp | 3199706 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643541063 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002436940 |
Protein GI | 218887619 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 83 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGA ACCGGCGGGG CTTCATCAAG CTCGCTGCCA CGGGTGCAGT GGCCACCGCG TTCGGGGGGC TGGGCATCAG CCTTGCCCCG GTCGCGGTCC ATGCCGAAGG GTTGGCCATC GACAAGGCCA CCCTGACCAC GTCCGTCTGT TGTTACTGCG CCGTGGGCTG CGGCCTGCTG GTCTGGACCG ACCCCGCGAC CGGACGCACC ATCAATATCG AAGGCAACCC GGACCACCCC ACCAACGAAG GCACCCTGTG CCCCAAGGGG TCGTCCATCT GGCAGACCGC GGAGCAGAGC CAGCGCATCA CCAAGGTGCT GTATCGCGCC CCCGGCAGCG ACAAGTGGGA AGAGAAGTCG TGGGACTGGG CGCTGCCGCG CATAGCCCGC AAGATCAAGG ACACCCGCGA CGCCTCGTTC GAGGCCGTCA ACGACAAGGG CCAGCCGGTC AACCGTACCC GCGCCATCGC CTCCGTCGGT TCCGCCGCCA TCGACAACGA GGAGGGCTGG TTGCTGCAGG CCATGCACCG CTCGCTCGGC CTGGTGTACC TGGAGAGCCA CGCCCGCATC TGACACAGTT CCACTGTGGG GGCTCTGGCA GAGTCCTACG GACGCGGCGC GATGACGAAT CACTGGATCG ACCTCAAGAA CAGCGACGTG TTGCTTATCA TGGGCAGCAA CGCGGCGGAA AACCACCCCA TCTCGTTCAA GTGGGTCACC CGTGCCCAGC AGCGCGGAGC AACGCTGATC CACGTGGACC CGCGCTTCAC GCGCACCTCG GCCAAGGCGG ACCTGTACGC GCCCATCCGC TCGGGCACGG ACCTGGTGTT CTTTGGCGGG CTGATCAAGC ACATTCTGGA TAACGAGCTG TATTTCAAGC AGTACGTGGT GGACTACACC AACGCCTCGT ACCTTGTGGG GCCGGACTAC GACTTCAAGG ACGGCCTGTT CTCCGGCTTC AACCCGGAGA CGGCCAGCTA CGACCGCAGG AAATGGGCCT TCGTCATCGA CGACAAGGGC GTGACCCTGA AGGATCCCAC CCTGAAGGAC CCGCGCTGCG TCTTGCAGGT GATGCGCCGG CACTACGCCC GCTACGACCT GAAGACCGTG GTGGACGTGA CCGGCATGCC GGAAGACAAG GTGCTGGCGG TGTGGAATTC CTTCGCCTCC ACCGGCAAGC CGGACAAGGC GGGCACCATC CTGTACGCCA TGGGCCAGTG CCAGCACACC GTGGGCGTGC AGAACATCCG CGCCCTGTCC ATGATCCAGA TGCTGCTGGG CAACATCGGC ATAGCGGGCG GCGGGGTCAA CGCGCTGCGC GGCGAATCCA ACGTGCAGGG CACCACGGAC ATCGCCCTGT TGTGCGACAA CCTGCCCGGC TACCTGCCCA TTCCCCGCGC CACCTGGGCA GACTACGACG CCTACGTGAA GGCGGGCACC CCGGTCACGG CGGATCCGCA AAGCGCCAAC TGGTGGTCAA ACATCGACAA GTACGCGGCA TCCCTGATGA AGGCCATGTA CCCCACGGTG GACCACAAGG AAGCGTACAC CTGGCTGCCC AAGATCGACG ACACCAAGGT GGTGGAATAC TCCTGGCTGT CGCTGTTCGA GCGCATGTAC AACGGCGGCT TCAAGGGCGC GTTCGTGTGG GGGCAGAACC CCTGCGCGGG CGGGGCCAAC GCGGGCAAGA ACCGCAAGGC CATGGCCAAG CTGGACTGGG CGGTGATGGT CAACCTGTTC GAGAACGAGA CCTCGCTGTT CTGGAAGGGT CCGGGCGTGA ACCCCAAGGA CGTCAAGACC GAGGTGTTCT TTCTGCCCGC GTGCATGAGC GTGGAAAAGG ACGGCTCGGT GGCCAACTCC GGCCGCTGGC TGCAATGGCG CGAGAAGAGC GCCAGGTTCA TGGGCGATTC CCTGAGCGAC GGCGACATCG TCATCCGGCT GTTCGAGGAA GTGCGCAAGC TGTACAAGGC AGAGGGCGGC AAGTTCCCGG AACCCATCCT GAACCTTGAC ACCGCCTACC TGAAGGACGG CGCATACGAC GCCAGCGCGC TGGCCAAGCG CCTCAACGGC ACCTTCCTGA AGGACGTGAC CATCGCCGAC AAGCAGTGGA AGGCGGGTCA GCAGGTGCCC GGCTTTGCCG CGTTGCAGGC CGACGGTTCC ACCGCGTGCG GCTGCTGGAT ATTCTCCGGC TGCTACACCG AGCAGGGCAA CATGATGGCC CGGCACGACC GCACCCAGAC CCCGGAACAG GCGGCCATCG GCCTGTTCCC CAACTGGTCG TACGCCTGGC CCGCCAACCG GCGCATCCTG TACAACCGCG CCGGGGTGGA CCAGACGGGC AAGCCCTTCG ACCCCAAGCG CGCGGTCATT GCCTGGAACG GCGAAAAGTG GGTGGGCGAC GTGCCCGACG GCGGGTGGAA GCCCGGCGAA AAGCTGCCGT TCATCATGGT GCGCGAAGGG CGCGGCCAGT TGTTCGGCCC CGGGCGCGTG GACGGCCCCC TGCCGGAACA CTACGAACCC TTCGAAAGCC CGCTGGCGGG CAACCCCCTG TCGCCGCAGC GGGTCAACCC CACGGCCCTG CACTTCGCCC ACGAGGAAAA GGCCGTGCGC GACCCGCGCT TCCCCTACGT GTGCACCACC TACCGGGTGA CCGAGCAGTG GCAGTCCGGC ACCATGACCC GCAAGACCGC GTGGCTGAAG GAAATGCAGC CCGACGGCTT CTGCGAGATG AGCCGCGAGC TGGCGGCGCA ACTGGGCGTG CAGAACGGCG ACCAGGTGGT GCTGGAATCG GTGCGCGGCA AGGTGCAGGT GGTGGCCATC GTCACCCCGC GCCTGAAGCC GTTCACCGTC ATGGGCGAAA CCGTGCACCA GGTGGGCATT CCCTGGCAGT TCGGCTGGGG CCAGAAGAAG AACGCCACGT TCGATTCGGC CAACCTGCTG TCGCCTTCGG TGGGCGACCC GAACACGGGC ATTCCGGAGA CGAAAGTCTT CATGGTGAAC GTGAGAAAAG CCCAGTCCGG GAAGCAAGGG TAG
|
Protein sequence | MKMNRRGFIK LAATGAVATA FGGLGISLAP VAVHAEGLAI DKATLTTSVC CYCAVGCGLL VWTDPATGRT INIEGNPDHP TNEGTLCPKG SSIWQTAEQS QRITKVLYRA PGSDKWEEKS WDWALPRIAR KIKDTRDASF EAVNDKGQPV NRTRAIASVG SAAIDNEEGW LLQAMHRSLG LVYLESHARI UHSSTVGALA ESYGRGAMTN HWIDLKNSDV LLIMGSNAAE NHPISFKWVT RAQQRGATLI HVDPRFTRTS AKADLYAPIR SGTDLVFFGG LIKHILDNEL YFKQYVVDYT NASYLVGPDY DFKDGLFSGF NPETASYDRR KWAFVIDDKG VTLKDPTLKD PRCVLQVMRR HYARYDLKTV VDVTGMPEDK VLAVWNSFAS TGKPDKAGTI LYAMGQCQHT VGVQNIRALS MIQMLLGNIG IAGGGVNALR GESNVQGTTD IALLCDNLPG YLPIPRATWA DYDAYVKAGT PVTADPQSAN WWSNIDKYAA SLMKAMYPTV DHKEAYTWLP KIDDTKVVEY SWLSLFERMY NGGFKGAFVW GQNPCAGGAN AGKNRKAMAK LDWAVMVNLF ENETSLFWKG PGVNPKDVKT EVFFLPACMS VEKDGSVANS GRWLQWREKS ARFMGDSLSD GDIVIRLFEE VRKLYKAEGG KFPEPILNLD TAYLKDGAYD ASALAKRLNG TFLKDVTIAD KQWKAGQQVP GFAALQADGS TACGCWIFSG CYTEQGNMMA RHDRTQTPEQ AAIGLFPNWS YAWPANRRIL YNRAGVDQTG KPFDPKRAVI AWNGEKWVGD VPDGGWKPGE KLPFIMVREG RGQLFGPGRV DGPLPEHYEP FESPLAGNPL SPQRVNPTAL HFAHEEKAVR DPRFPYVCTT YRVTEQWQSG TMTRKTAWLK EMQPDGFCEM SRELAAQLGV QNGDQVVLES VRGKVQVVAI VTPRLKPFTV MGETVHQVGI PWQFGWGQKK NATFDSANLL SPSVGDPNTG IPETKVFMVN VRKAQSGKQG
|
| |