Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_1217 |
Symbol | |
ID | 7173120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 1498036 |
End bp | 1501062 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643539726 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_002435636 |
Protein GI | 218886315 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 0.0663541 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGTCA ATCGCAGGCA ATTTCTCAAG CTGAGCGCGG GCGCCACCCT GGCAAGCGCG TTCGGCGGGC TGGGGATCAG CCTCGCGCCC TCGGTGGCGC GGGCCGAACT CCAGAAGCTT CAGTGGGCAA AGCAGACCAC CTCGGTTTGC TGCTACTGCG CGGTGGGCTG CGGGCTCATC GTCCATACCG CCAAGAATGG CGAGGGCCGC GCCGTCAACG TGGAAGGCGA CCCGGACCAT CCGATCAACG AAGGTTCGCT GTGCCCCAAG GGCGCGTCCA TCTTCCAATT GGGCGAAAAC AACGCCCGCT CGCCCAAGCC CCTGTACCGC GGGCCCAACA GCGGCGAGTG GAAGGAAGTG GAATGGGACT GGGCGCTGAC CGAAATCGCC AAGCGCGTCA AGAAGACCCG CGACGAATCC TTCCAACTGG CCAATGCCGC CGGTGAAAAG GTGAACCGGA CGGAAGCCAT CGCCTCCTTC GGCTCCGCCG CCATGGACAA CGAGGAATGC TGGGCCTACC AGGTCATCCT CAGAAGCCTC GGCCTGGTGT TCATCGAACA CCAGGCGCGG ATCTGACACA GCCCCACTGT ACCGGCTCTG GCAGAGTCGT TCGGTCGCGG TGCTATGACG AATCACTGGA ACGATCTTGC GAACAGTGAT TGTGTGTTGA TCATGGGCAG CAACGCTGCC GAAAACCACC CCATTTCCTT CAAGTGGGTG CTGCGCGCGC AGGACAAGGG CGCCACGCTG ATCCACGTGG ACCCGCGCTT CACGCGCACT TCCGCCAAGT GTGACATCTA CGCCCCCATC CGGTCGGGCG CGGACATCCC CTTCCTTGGC GGCCTCATCA AGTACATCCT CGAGAACAAG CTGTATTTCG AGGAGTACGT GCGCGAGTAC ACCAACGCCT CGCTCATCGT GGGCGAAAAG TTCTCGTTCA AGGACGGCCT GTTCAGCGGC TACGATGCGG ACAAGCGCAA GTACGACAAG TCGCAGTGGG CCTTCGAACT TGACGAGAAC GGCGTGCCCA GGCGCGACCC GTCGCTGAAG CACCCCCGGT GCGTCTTCAA CCTGATGAAG AAGCACTACG AGCGCTACAC CGTGGACAAG GTGGCCGACA TCACCGGCAC GCCCAAGGAC CTGATCCTGA AGGTCTACAA GGCCTACGCG GCCACGGGCA AGCCGGACAA GGCGGGCACC ATCATGTACG CCATGGGCTG GACGCAGCAC TCCGTGGGCG TGCAGAACAT CCGCGCCATG GCCATGATCC AGCTTCTGCT GGGCAACATC GGCGTGGCGG GCGGCGGCGT CAACGCGCTG CGCGGCGAAT CCAACGTGCA GGGCTCCACC GACCAGGGCC TCCTGGCCCA CATCTGGCCC GCCTACAACC CCGCGCCCAA CAGCAAGCAG ACCACGCTCG ACGCCTACAA TGCGGCCACC CCGCAGTCCA AGGATCCCAT GAGCGTGAAC TGGTGGCAGA ACCGGCCCAA GTACGTGGCC AGCTACCTGA AGGCGCTGTA CCCCGACCTT GCCCCGGCGG ACGCCTACGA CATCATGCCC CGGCTTGATG CGTCCAAGCC CGCCACCTAC TACTTCTGGC TGAACATCTT CGACAAGATG GACAAGGGCG ATGTGAAGGG CTGCTTCGCG TGGGGCATGA ACCCCGCCTG CGGCGGCGCC AACGCCAACA AGAACCGGCG TGCCCTGGGC AAGCTGGACT GGCTGGTGAA CGTCAACATC TTCGAGAACG AAACCTCTTC GTTCTGGAAG GGCCCGGGCA TGAAGCCGGA GGAAATCGGC ACGGAAGTGT TCTTCCTGCC GTGCGCCGTG TCCATCGAAA AGGAAGGCTC GGTCGCCAAC TCCGGCCGCT GGATGCAGTG GCGCTATCGC GGGCCCAAGC CGTGGGGCCA GACCAAGCCC GACGGCGACA TCATGCTGGA AATGATGCAC AAGATCCGCG ACCTGTACGC CAAGGAAGGC GGCGTGCACG CCGACCCCAT CCTGAAGCTG AACATCAAGG ACTGGGAAGA GCACAACGAG TTCTCCCCGG CCAAGACCGC CAAGCTGATG AACGGCTACT TCCTGAAGGA CACGGAAGTG GGCGGCAAGC AGTTCAAGGC CGGGCAGCAG GTGCCCTCGT TCGCCTTCCT GACGGCGGAC GGCTCCACCT GTTCCGGCAA CTGGCTGCAT GCCGGGTCGT TCACCGATGC GGGCAACATG ATGGCCCGCC GCGACACCGC GCAGACGCCG GAACAGGCGC GCATCGGCCT GTTCCCCAAC TGGTCGTTCT GCTGGCCGGT GAACCGGCGC ATCATCTACA ACCGCGCTTC CGTGGACAAG ACCGGCAAGC CGTGGAACCC GGCCAAGGCC GTCATCGAAT GGAAGGACGG CAAGTGGGTG GGCGACGTGG TTGACGGCGG CGGCGACCCC GGCACCAAGC ACCCGTTCAT CATGCAGACG CACGGTTTCG GCGCGCTGTA CGGCCCCGGG CGAGAGGAAG GCCCCTTCCC CGAGCACTAC GAACCGCTGG AGTGCCCGGT TTCCAAGAAC CCGTTCTCGA AGCAGCTGCA CAACCCGGTG GCCTTCAAGA TCGAGGGCGA AAAGGCGGCG GTGTGCGATC CGAAGTTCCC CTTCATCGGC ACCACCTACC GCGTCACCGA ACACTGGCAG ACCGGCCTGA TGACCCGCCG TTGCGCCTGG CTGGTGGAAG CGGAGCCCGA GATCTTCGCC GAAGTCAGCA AGGAACTGGC CAAGCTGCGC GGCATCAAGA ACGGCGACCG GGTCAAGGTC TCCAGCCTGC GTGGCTCGCT GGAGGCGGTG GCCATCGTCA CCGAGCGCAT CAAGCCCTAC AAGGTCATGG GGGCGGAAAT CCACATGGTG GGCCTGCCCT GGCATTACGG CTGGATGGTG CCCAGAAACG GCGGCGACAC GGCCAACCTG CTCACGCCGT CTGCGGGCGA CCCGAACACC GGCATCCCCG AGACCAAGGC GTTCATGGTC GATATCCGCA AGGTGGGAGG TAAGTAG
|
Protein sequence | MTVNRRQFLK LSAGATLASA FGGLGISLAP SVARAELQKL QWAKQTTSVC CYCAVGCGLI VHTAKNGEGR AVNVEGDPDH PINEGSLCPK GASIFQLGEN NARSPKPLYR GPNSGEWKEV EWDWALTEIA KRVKKTRDES FQLANAAGEK VNRTEAIASF GSAAMDNEEC WAYQVILRSL GLVFIEHQAR IUHSPTVPAL AESFGRGAMT NHWNDLANSD CVLIMGSNAA ENHPISFKWV LRAQDKGATL IHVDPRFTRT SAKCDIYAPI RSGADIPFLG GLIKYILENK LYFEEYVREY TNASLIVGEK FSFKDGLFSG YDADKRKYDK SQWAFELDEN GVPRRDPSLK HPRCVFNLMK KHYERYTVDK VADITGTPKD LILKVYKAYA ATGKPDKAGT IMYAMGWTQH SVGVQNIRAM AMIQLLLGNI GVAGGGVNAL RGESNVQGST DQGLLAHIWP AYNPAPNSKQ TTLDAYNAAT PQSKDPMSVN WWQNRPKYVA SYLKALYPDL APADAYDIMP RLDASKPATY YFWLNIFDKM DKGDVKGCFA WGMNPACGGA NANKNRRALG KLDWLVNVNI FENETSSFWK GPGMKPEEIG TEVFFLPCAV SIEKEGSVAN SGRWMQWRYR GPKPWGQTKP DGDIMLEMMH KIRDLYAKEG GVHADPILKL NIKDWEEHNE FSPAKTAKLM NGYFLKDTEV GGKQFKAGQQ VPSFAFLTAD GSTCSGNWLH AGSFTDAGNM MARRDTAQTP EQARIGLFPN WSFCWPVNRR IIYNRASVDK TGKPWNPAKA VIEWKDGKWV GDVVDGGGDP GTKHPFIMQT HGFGALYGPG REEGPFPEHY EPLECPVSKN PFSKQLHNPV AFKIEGEKAA VCDPKFPFIG TTYRVTEHWQ TGLMTRRCAW LVEAEPEIFA EVSKELAKLR GIKNGDRVKV SSLRGSLEAV AIVTERIKPY KVMGAEIHMV GLPWHYGWMV PRNGGDTANL LTPSAGDPNT GIPETKAFMV DIRKVGGK
|
| |