Gene Dvul_2365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_2365 
Symbol 
ID4662145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2758185 
End bp2761202 
Gene Length3018 bp 
Protein Length1005 aa 
Translation table11 
GC content63% 
IMG OID639820613 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_967808 
Protein GI120603408 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.1698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTCA CACGCAGACA TTTCCTCAAA TTGAGCGCCG GGGCCGCCGT GGCAGGTGCT 
TTCACGGGGC TCGGACTCAG CCTCGCGCCC ACGGTGGCGC GGGCCGAGTT GCAGAAACTC
CAGTGGGCGA AACAGACCAC ATCCATATGC TGCTACTGTG CGGTGGGTTG CGGTCTCATC
GTCCATACCG CCAAGGACGG ACAGGGCCGC GCCGTGAACG TCGAGGGCGA CCCCGACCAC
CCCATCAACG AAGGTTCGCT CTGTCCCAAG GGCGCATCCA TCTTCCAGCT GGGCGAGAAC
GACCAGCGCG GCACGCAGCC GCTCTACCGC GCCCCCTTCA GTGATACATG GAAGCCGGTG
ACCTGGGACT TCGCCCTCAC CGAGATCGCC AAGCGCATCA AGAAGACCCG TGACGCCTCG
TTCACCGAGA AGAACGCGGC TGGAGACTTG GTCAACCGCA CCGAGGCCAT CGCCTCGTTC
GGTTCGGCCG CCATGGACAA CGAGGAGTGC TGGGCCTACG GGAACATCCT CCGCAGCCTC
GGCCTGGTGT ACATCGAGCA CCAGGCGCGT ATCTGACACA GCCCCACTGT ACCGGCTCTG
GCAGAGTCGT TCGGTCGCGG TGCAATGACG AATCACTGGA ACGATCTCGC GAACAGTGAT
TGTATTCTCA TCATGGGCAG CAATGCTGCC GAAAACCACC CCATCGCCTT CAAGTGGGTG
CTGCGCGCCA AGGACAAGGG CGCCACGCTC ATCCACGTGG ACCCGCGCTT CACGCGCACC
TCGGCACGTT GCGATGTCTA CGCGCCCATC CGTAGCGGCG CGGACATCCC GTTCCTCGGC
GGTCTCATCA AGTACATTCT CGACAACAAG CTCTATTTCA CGGACTACGT GCGCGAGTAC
ACCAACGCCT CGCTCATCGT GGGCGAGAAG TTCTCGTTCA AGGACGGGCT CTTCTCCGGC
TACGACGCGG CGAACAAGAA GTACGACAAG AGCATGTGGG CCTTCGAACT CGATGCCAAC
GGCGTGCCCA AGCGCGACCC GGCACTCAAG CACCCGCGCT GCGTCATCAA CCTGCTGAAG
AAGCACTACG AGCGGTACAA CCTCGACAAG GTCGCCGCCA TCACCGGCAC GTCGAAGGAA
CAGCTGCAGC AGGTCTACAA GGCCTATGCC GCCACCGGCA AGCCCGACAA GGCGGGCACC
ATCATGTACG CCATGGGCTG GACGCAGCAC TCCGTCGGTG TGCAGAACAT CCGCGCCATG
GCCATGATAC AGCTGCTGCT GGGCAACATC GGCGTGGCAG GGGGCGGCGT CAACGCGCTG
CGCGGCGAGT CCAACGTGCA GGGTTCCACC GACCAGGGCC TGCTGGCCCA CATATGGCCC
GGTTACAACC CCGTGCCCAA CAGCAAGGCC GCCACGCTTG AGCTGTACAA TGCCGCCACG
CCCCAGTCCA AGGACCCCAT GAGCGTCAAC TGGTGGCAGA ACAGGCCCAA GTATGTGGCC
AGCTACCTCA AGGCGCTGTA CCCGGACGAA GAACCCGCGG CGGCCTACGA CTACCTGCCG
CGCATCGACG CCGGCAGGAA GCTCACCGAC TACTTCTGGC TGAACATCTT CGAGAAGATG
GACAAGGGCG AGTTCAAGGG CCTTTTCGCG TGGGGCATGA ACCCCGCATG CGGCGGCGCC
AACGCCAACA AGAACCGCAA GGCCATGGGC AAACTCGAAT GGCTGGTCAA CGTGAACCTC
TTCGAGAACG AGACCAGTTC GTTCTGGAAG GGGCCGGGCA TGAACCCCGC CGAGATAGGC
ACCGAGGTCT TCTTCCTGCC GTGCTGCGTC TCCATCGAGA AGGAAGGTTC GGTGGCGAAC
TCGGGCCGCT GGATGCAGTG GCGCTATCGC GGGCCCAAGC CCTACGCCGA GACCAGGCCC
GACGGCGACA TCATGCTCGA CATGTTCAAG AAGGTGCGTG AGCTCTACGC CAAGGAAGGG
GGAGCCTACC CCGCACCGAT CGCGAAGCTG AACATTGCCG ACTGGGAAGA GCACAACGAG
TTCTCGCCCA CCAAGGTGGC GAAACTCATG AACGGCTACT TCCTGAAGGA TACCGAAGTG
GGCGGCAAGC AGTTCAAGAA GGGCCAGCAG GTGCCCAGCT TCGCCTTCCT CACCGCCGAC
GGTTCGACCT GTTCGGGCAA CTGGCTGCAT GCCGGTTCGT TCACCGACGC GGGCAACCTG
ATGGCCCGCC GTGACAAGAC CCAGACGCCG GAACAGGCGC GCATCGGCCT GTTCCCCAAC
TGGTCGTTCT GCTGGCCCGT CAACCGTCGC ATCCTCTACA ACCGTGCCTC CGTGGACAAG
ACCGGCAAGC CGTGGAATCC GGCCAAGGCC GTCATCGAAT GGAAGGACGG CAAGTGGGTG
GGCGACGTGG TGGACGGTGG CGGCGACCCC GGCACCAAGC ATCCCTTCAT CATGCAGACG
CATGGCTTCG GCGCACTGTA CGGCCCCGGT CGTGAAGAGG GTCCCTTCCC CGAGCATTAC
GAACCCCTCG AGTGCCCGGT GTCCAAGAAC CCCTTCTCGA AGCAGCTGCA CAACCCCGTG
GCGTTCCAGA TCGAAGGCGA GAAGAAGGCG GTGTGCGATC CGCGCTACCC CTTCATCGGC
ACGACCTATC GCGTCACGGA GCACTGGCAG ACCGGCCTCA TGACCCGCCG TTGCGCGTGG
CTCGTCGAAG CCGAACCCCA GATCTTCTGC GAGATCAGCA AGGAACTGGC GAAGCTGCGC
GGCATAGGCA ACGGCGACAC CGTCAAGGTG TCGAGCCTGC GCGGTGCGCT TGAAGCCGTC
GCCATCGTCA CGGAGCGCAT CAGACCCTTC AAGATCGAAG GTGTCGATGT CCACATGGTG
GGCCTGCCGT GGCATTACGG CTGGATGGTG CCGAAGAACG GCGGCGACAC GGCCAACCTG
CTGACCCCGT CTGCGGGCGA CCCGAACACC GGCATTCCGG AAACCAAGGC GTTCATGGTG
GACGTCCGCA AGGTCTAG
 
Protein sequence
MTVTRRHFLK LSAGAAVAGA FTGLGLSLAP TVARAELQKL QWAKQTTSIC CYCAVGCGLI 
VHTAKDGQGR AVNVEGDPDH PINEGSLCPK GASIFQLGEN DQRGTQPLYR APFSDTWKPV
TWDFALTEIA KRIKKTRDAS FTEKNAAGDL VNRTEAIASF GSAAMDNEEC WAYGNILRSL
GLVYIEHQAR IUHSPTVPAL AESFGRGAMT NHWNDLANSD CILIMGSNAA ENHPIAFKWV
LRAKDKGATL IHVDPRFTRT SARCDVYAPI RSGADIPFLG GLIKYILDNK LYFTDYVREY
TNASLIVGEK FSFKDGLFSG YDAANKKYDK SMWAFELDAN GVPKRDPALK HPRCVINLLK
KHYERYNLDK VAAITGTSKE QLQQVYKAYA ATGKPDKAGT IMYAMGWTQH SVGVQNIRAM
AMIQLLLGNI GVAGGGVNAL RGESNVQGST DQGLLAHIWP GYNPVPNSKA ATLELYNAAT
PQSKDPMSVN WWQNRPKYVA SYLKALYPDE EPAAAYDYLP RIDAGRKLTD YFWLNIFEKM
DKGEFKGLFA WGMNPACGGA NANKNRKAMG KLEWLVNVNL FENETSSFWK GPGMNPAEIG
TEVFFLPCCV SIEKEGSVAN SGRWMQWRYR GPKPYAETRP DGDIMLDMFK KVRELYAKEG
GAYPAPIAKL NIADWEEHNE FSPTKVAKLM NGYFLKDTEV GGKQFKKGQQ VPSFAFLTAD
GSTCSGNWLH AGSFTDAGNL MARRDKTQTP EQARIGLFPN WSFCWPVNRR ILYNRASVDK
TGKPWNPAKA VIEWKDGKWV GDVVDGGGDP GTKHPFIMQT HGFGALYGPG REEGPFPEHY
EPLECPVSKN PFSKQLHNPV AFQIEGEKKA VCDPRYPFIG TTYRVTEHWQ TGLMTRRCAW
LVEAEPQIFC EISKELAKLR GIGNGDTVKV SSLRGALEAV AIVTERIRPF KIEGVDVHMV
GLPWHYGWMV PKNGGDTANL LTPSAGDPNT GIPETKAFMV DVRKV