Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0646 |
Symbol | |
ID | 7172533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | + |
Start bp | 776678 |
End bp | 779719 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643539146 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002435071 |
Protein GI | 218885750 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 0.166325 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACTA CGCGGCGGAG TTTTCTCAAG CTCATGAGCG TGAGCGCTGT CGGCCTCTCG CTCGGCCAGC TTGGGTTCGA TCTTGCTGAA GCGCAGGCGT ATGCCAGCAA GCTCAAGATC GAAGGCGCAA AGGAAGTGGG CAGCGTCTGC CCCTTCTGTT CCGTCTGTTG TCAGATCATC GCCTACGTGC GCAACGGCAA GCTCGTTTCG ACCGAAGGCG ACCCCGACTT TCCGGTAAAC GAAGGGGCGC TGTGCGCCAA GGGCGCGGCC CTCTTCTCCA TGTACACCAA CCCGCATCGC CTGACGAAGC CGCTGTACCG CGCTCCGCAC AGCGACAAGT GGGTGGAGAA GGACTGGGAC TGGACCCTGA ACCAGATCGC GCGCCGCGTT AAGGACACCC GCGACAAGGA CATGGTCCTC AAGAACGAAA AGGGCCAGAC CGTCAACCGC GTGGATTCGC TTTTCATGAT GGGCACCTCG CACGCCTCCA ACGAGGAATG CGCCGTCATC CATCAAGCCA TACGAAGCCT GGGTATGGTC CAAATGGACC ACCAGGCCCG GGTCTGACAC AGTCCCACTG TTGCGGCTCT GGCAGAGTCG TTCGGTCGCG GGGCTATGAC GAACCACTGG ATCGATATCA AGAATAGCGA TGCAGTGCTT ATTATCGGCA GCAATGCCGC AGAACACCAT CCTGTCGCCT TCAAGTGGGT CATGCGCGCC AAGGAAAACG GCGCCGTGCT CATGCACGTG GACCCCAAGT TCTCCCGCAC CTCGGCACGC TGCGATTTCC ACGTGCCCCT GCGTTCGGGC ACGGACATCC CCTTCCTGGG CGGCATGATC AACTACATCC TTGAAAAGGA ACTCTTTTTC AAGGAGTACG TCTTCAACTA CACCAACTTC GCCTTTGTCG TCGGCAAGGA CTACGAGTTC AACGACGGCC TCTTCAGCGG GTACGACCCC AAGACCCGCA AGTACGACCA GTCCAAGTGG GCCTTCGAAA AGGGTCCCGA CGGCGGCCCG GTCATCGACG AAACCCTCAA GAACGAGCGC TGCGTCTTCA ACCTGATGAA GAAGCACTAC TCGCGCTACA CGCTGAAGAC CGTCTCCGAC GTCACCGGCG TGTCCGAGGA AAACCTGCTG CGCGTGTACG ATGCGTTCTG CGCCACCGGC AAGCCGGACA AGGCAGGCAC CATCATGTAC GCCCTGGGCT GGACCCAGCA CACCGTGGGC GTGCAGAACA TCCGCGCCTC GTCGCTCATC CAGCTGCTGC TGGGCAACAT CGGCATCGCG GGCGGCGGCA TCAACGCCCT GCGCGGCGAG CCCAACGTGC AGGGTTCCAC CGACCACGGC CTGCTGTACA GCTCGCTGCC GGGCTATCAC AACCTGCCCG TGTCCACCTG GCAGACCCTG GCCGACTACA ACAAGGCCAA CACCCCGGTC ACCACGCTGA AGAACAGCGC CAACTGGTGG AGCAACCGTC CCAAGTACGT GGCCAGCCTG CTGAAGGGCT GGTACGGCGA CGCGGCCACG CCCGAGAACG ACTTCTGCTA CGAATACCTG CCGAAGCTCG AACCCGGCGA GGACTGCTCG TACATGTACG TCATGGACAA GATGTACCAT GGCAAGATCA AGGGCGGGTT CATCTTCGGG GTCAACCCGA TGAACAGCTT CTCCAACACC AACAAGATGC GCGCGGCGCT GGACAAGCTG GACTGGCTGG TCTGCTCCGA ACTGCACAAC TCGGAAACCA CGGACAACTG GCAGCGCCCC GGCGTTGATC CCAAGACCAA GAAGACCGAG GTGTTCCTGC TGCCGTCGGC CCACCGCATC GAAAAGGCGG GCACCATCAG CAACAGCGGT CGCTGGTTGC AGTGGTTCGA CAAGGCGGTG GAACCGGGCG GACAGGCCCG CAACTTCGCC GACATCTTCG TGCCGCTGAT CAACAAGATC CGCGACCTGT ACAGGAAGGA AGGCGGCACG CTGCCCGAGC CCCTGCTGAA GATGCACTGG ACCGAAAAGT TCGAACCGGA AGAATGGACC CGCCGCATCA ACGGCTTCTT CTGGGCCGAC ACCAAGGTGG GCGACAAGGA ATACAAGCGC GGCCAGATGG TGCCCGCCTT CGCCCAACTG AAGGACGACG GCTCCACCTC GTCGCTGAAC TGGATCTACG CAGGCAGCTA CACCGAAGAG GCCGGCAACA AGTCCAAGAA GCGCGACGCC AGCCAGACGC CCATGCAGGC CAACATCGGC CTGTTCCCCA ACTGGTCGTG GTGCTGGCCG GTGAACCGAC GCATCCTGTA CAACCGCGCT TCGGTTGACC TGAACGGCAA GCCGTTCAAC CCCAAGAAGG CCGTCATCGA ATGGGACGGC GCGAAGTGGA TCGGCGACGT GCCCGACGGC CCGTGGCCCC CCATGGCCGA CAAGGAAAAG GGCAAGCTGC CCTTCATCAT GACCAAGGAC GGCTATGCCC AGTTCTACGG CACCGGCCGC GTGGACGGTC CCTTCCCCGA GCACTACGAA CCTGCGGAAA CCCCGCTGGA CAGCCATCCG TTCTCCAAGC AGCTGTCCAG CCCGGTGTAC AAGTTCCACA CCTCGGACAT GGACCAGTTC GCCAAGGCGG CCGACCCCAA CTACCCGTAC GTGCTCACCA CCTACAGCCT GACGGAACAC TGGTGCGGCG GCGGCGAAAC GCGCAACGTG CCCAACCTGC TGGAAACCGA GCCTCAGCTC TACATAGAGA TGAGCCACGA GCTGGCCAAG GAAAAGGGCA TCAACAACGG CGACGGCGTG ATCGTGGAAA GTGCGCGCGG GCGCGTGGAG GCCATAGCCA TGGTCACCGT GCGCATCCGG CCCTTCACCA TCATGGGCAA GACGGTGCAC CTCGTGGGCA TGCCTTTCGC CTACGGCTGG ACCACGCCCA AGTGCGGTGA CTCCACCAAC CGCCTTACCG TCGGCGCGTA CGACCCGAAC ACCACCATTC CCGAAAGCAA GGCCTGTCTC GTCAACTTGC GCAAGGCCGA CAAGCTGACC GAAATAGCCT AA
|
Protein sequence | MNTTRRSFLK LMSVSAVGLS LGQLGFDLAE AQAYASKLKI EGAKEVGSVC PFCSVCCQII AYVRNGKLVS TEGDPDFPVN EGALCAKGAA LFSMYTNPHR LTKPLYRAPH SDKWVEKDWD WTLNQIARRV KDTRDKDMVL KNEKGQTVNR VDSLFMMGTS HASNEECAVI HQAIRSLGMV QMDHQARVUH SPTVAALAES FGRGAMTNHW IDIKNSDAVL IIGSNAAEHH PVAFKWVMRA KENGAVLMHV DPKFSRTSAR CDFHVPLRSG TDIPFLGGMI NYILEKELFF KEYVFNYTNF AFVVGKDYEF NDGLFSGYDP KTRKYDQSKW AFEKGPDGGP VIDETLKNER CVFNLMKKHY SRYTLKTVSD VTGVSEENLL RVYDAFCATG KPDKAGTIMY ALGWTQHTVG VQNIRASSLI QLLLGNIGIA GGGINALRGE PNVQGSTDHG LLYSSLPGYH NLPVSTWQTL ADYNKANTPV TTLKNSANWW SNRPKYVASL LKGWYGDAAT PENDFCYEYL PKLEPGEDCS YMYVMDKMYH GKIKGGFIFG VNPMNSFSNT NKMRAALDKL DWLVCSELHN SETTDNWQRP GVDPKTKKTE VFLLPSAHRI EKAGTISNSG RWLQWFDKAV EPGGQARNFA DIFVPLINKI RDLYRKEGGT LPEPLLKMHW TEKFEPEEWT RRINGFFWAD TKVGDKEYKR GQMVPAFAQL KDDGSTSSLN WIYAGSYTEE AGNKSKKRDA SQTPMQANIG LFPNWSWCWP VNRRILYNRA SVDLNGKPFN PKKAVIEWDG AKWIGDVPDG PWPPMADKEK GKLPFIMTKD GYAQFYGTGR VDGPFPEHYE PAETPLDSHP FSKQLSSPVY KFHTSDMDQF AKAADPNYPY VLTTYSLTEH WCGGGETRNV PNLLETEPQL YIEMSHELAK EKGINNGDGV IVESARGRVE AIAMVTVRIR PFTIMGKTVH LVGMPFAYGW TTPKCGDSTN RLTVGAYDPN TTIPESKACL VNLRKADKLT EIA
|
| |