Gene DvMF_1217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvMF_1217 
Symbol 
ID7173120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris str. 'Miyazaki F' 
KingdomBacteria 
Replicon accessionNC_011769 
Strand
Start bp1498036 
End bp1501062 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content64% 
IMG OID643539726 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_002435636 
Protein GI218886315 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.0663541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTCA ATCGCAGGCA ATTTCTCAAG CTGAGCGCGG GCGCCACCCT GGCAAGCGCG 
TTCGGCGGGC TGGGGATCAG CCTCGCGCCC TCGGTGGCGC GGGCCGAACT CCAGAAGCTT
CAGTGGGCAA AGCAGACCAC CTCGGTTTGC TGCTACTGCG CGGTGGGCTG CGGGCTCATC
GTCCATACCG CCAAGAATGG CGAGGGCCGC GCCGTCAACG TGGAAGGCGA CCCGGACCAT
CCGATCAACG AAGGTTCGCT GTGCCCCAAG GGCGCGTCCA TCTTCCAATT GGGCGAAAAC
AACGCCCGCT CGCCCAAGCC CCTGTACCGC GGGCCCAACA GCGGCGAGTG GAAGGAAGTG
GAATGGGACT GGGCGCTGAC CGAAATCGCC AAGCGCGTCA AGAAGACCCG CGACGAATCC
TTCCAACTGG CCAATGCCGC CGGTGAAAAG GTGAACCGGA CGGAAGCCAT CGCCTCCTTC
GGCTCCGCCG CCATGGACAA CGAGGAATGC TGGGCCTACC AGGTCATCCT CAGAAGCCTC
GGCCTGGTGT TCATCGAACA CCAGGCGCGG ATCTGACACA GCCCCACTGT ACCGGCTCTG
GCAGAGTCGT TCGGTCGCGG TGCTATGACG AATCACTGGA ACGATCTTGC GAACAGTGAT
TGTGTGTTGA TCATGGGCAG CAACGCTGCC GAAAACCACC CCATTTCCTT CAAGTGGGTG
CTGCGCGCGC AGGACAAGGG CGCCACGCTG ATCCACGTGG ACCCGCGCTT CACGCGCACT
TCCGCCAAGT GTGACATCTA CGCCCCCATC CGGTCGGGCG CGGACATCCC CTTCCTTGGC
GGCCTCATCA AGTACATCCT CGAGAACAAG CTGTATTTCG AGGAGTACGT GCGCGAGTAC
ACCAACGCCT CGCTCATCGT GGGCGAAAAG TTCTCGTTCA AGGACGGCCT GTTCAGCGGC
TACGATGCGG ACAAGCGCAA GTACGACAAG TCGCAGTGGG CCTTCGAACT TGACGAGAAC
GGCGTGCCCA GGCGCGACCC GTCGCTGAAG CACCCCCGGT GCGTCTTCAA CCTGATGAAG
AAGCACTACG AGCGCTACAC CGTGGACAAG GTGGCCGACA TCACCGGCAC GCCCAAGGAC
CTGATCCTGA AGGTCTACAA GGCCTACGCG GCCACGGGCA AGCCGGACAA GGCGGGCACC
ATCATGTACG CCATGGGCTG GACGCAGCAC TCCGTGGGCG TGCAGAACAT CCGCGCCATG
GCCATGATCC AGCTTCTGCT GGGCAACATC GGCGTGGCGG GCGGCGGCGT CAACGCGCTG
CGCGGCGAAT CCAACGTGCA GGGCTCCACC GACCAGGGCC TCCTGGCCCA CATCTGGCCC
GCCTACAACC CCGCGCCCAA CAGCAAGCAG ACCACGCTCG ACGCCTACAA TGCGGCCACC
CCGCAGTCCA AGGATCCCAT GAGCGTGAAC TGGTGGCAGA ACCGGCCCAA GTACGTGGCC
AGCTACCTGA AGGCGCTGTA CCCCGACCTT GCCCCGGCGG ACGCCTACGA CATCATGCCC
CGGCTTGATG CGTCCAAGCC CGCCACCTAC TACTTCTGGC TGAACATCTT CGACAAGATG
GACAAGGGCG ATGTGAAGGG CTGCTTCGCG TGGGGCATGA ACCCCGCCTG CGGCGGCGCC
AACGCCAACA AGAACCGGCG TGCCCTGGGC AAGCTGGACT GGCTGGTGAA CGTCAACATC
TTCGAGAACG AAACCTCTTC GTTCTGGAAG GGCCCGGGCA TGAAGCCGGA GGAAATCGGC
ACGGAAGTGT TCTTCCTGCC GTGCGCCGTG TCCATCGAAA AGGAAGGCTC GGTCGCCAAC
TCCGGCCGCT GGATGCAGTG GCGCTATCGC GGGCCCAAGC CGTGGGGCCA GACCAAGCCC
GACGGCGACA TCATGCTGGA AATGATGCAC AAGATCCGCG ACCTGTACGC CAAGGAAGGC
GGCGTGCACG CCGACCCCAT CCTGAAGCTG AACATCAAGG ACTGGGAAGA GCACAACGAG
TTCTCCCCGG CCAAGACCGC CAAGCTGATG AACGGCTACT TCCTGAAGGA CACGGAAGTG
GGCGGCAAGC AGTTCAAGGC CGGGCAGCAG GTGCCCTCGT TCGCCTTCCT GACGGCGGAC
GGCTCCACCT GTTCCGGCAA CTGGCTGCAT GCCGGGTCGT TCACCGATGC GGGCAACATG
ATGGCCCGCC GCGACACCGC GCAGACGCCG GAACAGGCGC GCATCGGCCT GTTCCCCAAC
TGGTCGTTCT GCTGGCCGGT GAACCGGCGC ATCATCTACA ACCGCGCTTC CGTGGACAAG
ACCGGCAAGC CGTGGAACCC GGCCAAGGCC GTCATCGAAT GGAAGGACGG CAAGTGGGTG
GGCGACGTGG TTGACGGCGG CGGCGACCCC GGCACCAAGC ACCCGTTCAT CATGCAGACG
CACGGTTTCG GCGCGCTGTA CGGCCCCGGG CGAGAGGAAG GCCCCTTCCC CGAGCACTAC
GAACCGCTGG AGTGCCCGGT TTCCAAGAAC CCGTTCTCGA AGCAGCTGCA CAACCCGGTG
GCCTTCAAGA TCGAGGGCGA AAAGGCGGCG GTGTGCGATC CGAAGTTCCC CTTCATCGGC
ACCACCTACC GCGTCACCGA ACACTGGCAG ACCGGCCTGA TGACCCGCCG TTGCGCCTGG
CTGGTGGAAG CGGAGCCCGA GATCTTCGCC GAAGTCAGCA AGGAACTGGC CAAGCTGCGC
GGCATCAAGA ACGGCGACCG GGTCAAGGTC TCCAGCCTGC GTGGCTCGCT GGAGGCGGTG
GCCATCGTCA CCGAGCGCAT CAAGCCCTAC AAGGTCATGG GGGCGGAAAT CCACATGGTG
GGCCTGCCCT GGCATTACGG CTGGATGGTG CCCAGAAACG GCGGCGACAC GGCCAACCTG
CTCACGCCGT CTGCGGGCGA CCCGAACACC GGCATCCCCG AGACCAAGGC GTTCATGGTC
GATATCCGCA AGGTGGGAGG TAAGTAG
 
Protein sequence
MTVNRRQFLK LSAGATLASA FGGLGISLAP SVARAELQKL QWAKQTTSVC CYCAVGCGLI 
VHTAKNGEGR AVNVEGDPDH PINEGSLCPK GASIFQLGEN NARSPKPLYR GPNSGEWKEV
EWDWALTEIA KRVKKTRDES FQLANAAGEK VNRTEAIASF GSAAMDNEEC WAYQVILRSL
GLVFIEHQAR IUHSPTVPAL AESFGRGAMT NHWNDLANSD CVLIMGSNAA ENHPISFKWV
LRAQDKGATL IHVDPRFTRT SAKCDIYAPI RSGADIPFLG GLIKYILENK LYFEEYVREY
TNASLIVGEK FSFKDGLFSG YDADKRKYDK SQWAFELDEN GVPRRDPSLK HPRCVFNLMK
KHYERYTVDK VADITGTPKD LILKVYKAYA ATGKPDKAGT IMYAMGWTQH SVGVQNIRAM
AMIQLLLGNI GVAGGGVNAL RGESNVQGST DQGLLAHIWP AYNPAPNSKQ TTLDAYNAAT
PQSKDPMSVN WWQNRPKYVA SYLKALYPDL APADAYDIMP RLDASKPATY YFWLNIFDKM
DKGDVKGCFA WGMNPACGGA NANKNRRALG KLDWLVNVNI FENETSSFWK GPGMKPEEIG
TEVFFLPCAV SIEKEGSVAN SGRWMQWRYR GPKPWGQTKP DGDIMLEMMH KIRDLYAKEG
GVHADPILKL NIKDWEEHNE FSPAKTAKLM NGYFLKDTEV GGKQFKAGQQ VPSFAFLTAD
GSTCSGNWLH AGSFTDAGNM MARRDTAQTP EQARIGLFPN WSFCWPVNRR IIYNRASVDK
TGKPWNPAKA VIEWKDGKWV GDVVDGGGDP GTKHPFIMQT HGFGALYGPG REEGPFPEHY
EPLECPVSKN PFSKQLHNPV AFKIEGEKAA VCDPKFPFIG TTYRVTEHWQ TGLMTRRCAW
LVEAEPEIFA EVSKELAKLR GIKNGDRVKV SSLRGSLEAV AIVTERIKPY KVMGAEIHMV
GLPWHYGWMV PRNGGDTANL LTPSAGDPNT GIPETKAFMV DIRKVGGK