Gene Dvul_0761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0761 
Symbol 
ID4664364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp930646 
End bp933657 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content63% 
IMG OID639818979 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_966211 
Protein GI120601811 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.109779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.059567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATGC CTCGCAGAAC GTTCATCAAG CTCGCTTCGG CTTCCGCAGG CGCTCTCGCC 
TTCGCCGGGC TGGGGCAGAG CCTCGCGCCC ACGGTGGCGC GGGCTGCGGA ACTCAAGATC
GCCAAGGCGA AGGTCACACC CTCTGTCTGC TGCTTCTGTG CGGTGGGATG CGGGTTGCTG
GTCTACACCG ACACCCAGAC CGGACGCTCC ATCAACATCG AGGGCGACCC CGACCATCCC
ACCAACGAAG GGACGCTGTG CCCCAAGGGT GCGTCCATCT GGCAGACCAC CGAACGCAGC
AAGCGTGTCA CCAAGGTGCT GTACCGTGCA CCCGGTGCCG CGGCGTGGGA GGAGAAGTCG
TGGGACTGGG CCCTGCCGCG CATCGCCCGC AAGATCAAGG AGACGCGCGA CGCCACCTTC
GAACTCACCA ATGACAAGGG GCAGACCGTC AACCGCACCC GGGGCATAGC CTCGGTGGGC
TCGGCCGCCG TCGACAACGA GGAGGGCTGG CTCATGCAAG CCATGATGCG CGCGCTCGGC
CTCGTGTACA TCGAGAGCCA CGCCCGTATC TGACACAGCT CGACTGTGGG GGCTCTGGCA
GAGTCCTACG GACGCGGTGC GATGACGAAT CACTGGATCG ACCTCAAGAA CAGCGACGTC
ATTCTCATCA TGGGGAGTAA CGCCGCCGAG AACCATCCCA TATCCTTCAA GTGGGTGACG
CGTGCGCAGG AACGCGGCGC AACGCTCATC CATGTGGACC CCCGCTTCAC GCGCACCTCC
GCCAAGGCAG ACATCCACGC CCACATCAGG TCGGGTACGG ACATTGCCTT CTTCGGCGGC
CTCATCCGAT ATATCCTCGA AAACGAGCTG TTCTTCCGGC AGTACGTGGT CGACTACACC
AACGCCTCGT ACATCGTCGG CCCGGACTTC GGTTTCGCCG ATGGTCTGTT CACCGGTTTC
GACCCGGAGA AGGGAACCTA CAACACCAAG AAATGGGCCT TCGCCGCCGA CGAGAACGGC
ATGACCCTGA AAGACCCCAC GCTGAACGAC CCGCGTTGCG TCTTTCAACT CATGAAGGCC
CATTACGCTC GCTATGACAG GAAGACCGTC TCAGACGTGA CGGGCATCTC CGAAGAGCAG
CTGGCGACGC TGTGGAGTAC CTTCGCCTCG ACGGGCAAGC CCGACAGGGC GGGAACCATC
CTCTATGCCA TGGGGCAGTG CCAGCACACC GTGGGCGTGC AGAACATCCG TGCGCTTTCG
ATGATACAGC TGCTGCTTGG CAACATCGGC ATCGCCGGGG GCGGGGTCAA CGCGCTGCGC
GGCGAATCCA ACGTGCAGGG CACCACCGAC ATCTCGCTGC TGTGCGACAA CCTCTCCGGC
TACCTGCCCA CCCCCAAGGC TTCGTGGGCG ACCTTCGACG ACTATGTGAA GGGCACGACG
CCGGTGGACA AGGACCCGAA GAGCGCCAAC TGGTGGTCGA ACCGCGGCAA GTATCTCGCC
TCGTACATGA AGTCGGTGTA TCCCACGGCC AGCCATCAGG ACGGCTACCT CTGGCACCCC
AAGGTCGATG ACGGGAAGAT AACCGACTAC TCGTGGTTGC AGATATTCGA GCGCATGAGC
AAGGGCGGCT TCAAGGGTGC CTTCGTGTGG GGGCAGAACC CCTGTGCGGG CGGGGCCAAC
GCGGGCAAGA ACCGCAAGGC CATGGAGACG CTGGACTGGA TGGTGGTGGT CAACCTCTTC
GAGAACGAAA GTTCGCTCTT CTGGAAGGGG CCGGGGGTAG ACCCCGCCAA GGTCAAGACC
GAGGTGTTCT TCCTGCCTGC GTGCATGAGC GTCGAGAAGG GCGGTTCCAT CGCCAATTCC
GGTCGCTGGC TGCAATGGCG TGAACCGGGG CCGAAACCCA TGGGCGACAG CCGTTCGGAC
GGCGACATCG TCCTCGACCT CTATGACGAG ATACGCAAAC TCTACCGCGA GGAGAAGGGA
GCTTTCCCCG AACCCGTGCT GGCGCTGGAC ACCGACTACC GTACCGACGG CAAGTACGAT
CACCACAAGG TGGCGAAGAC GCTGAACGGC AAGTTCCTCG CTGATGTGAC CATAGGCGAC
AAGACCTACA AGGCGGGGCA GCAGGTACCC GGCTTCGCCA TGTTGCAGGC CGACGGTTCG
ACGACGTCCG GGTGTTGGAT ATTCACCGGG TGCTATACCG ACGCCGGTAA CATGATGGCG
CGCCGAGACC GTACGCAGAC CCCGGAACAG GCCGCCATAG GGCTGTTCCC CAACTGGTCG
TATGCATGGC CCGCCAACCG TCGCATCCTG TACAACCGTG CGGCGGTGGA CATGACCGGC
AAGCCCTTCG ACCCCAAGCG TGCCGTCATC GCATGGAACG GCGAGAAGTG GGTGGGCGAC
GTGCCTGACG GCGGCTGGAA GCCCGGAGAG AAGTTGCCCT TCATCATGAT ACGCGAGGGG
CGCGGTCAGT TGTTCGGCCC CGGCAGGGTC GACGGGCCTT TCCCGGAGCA CTACGAACCG
TTCGAGAGTC CTCTCGAAAG CCATCCCTTC TCGAAGCAGC GGGTCAACCC CACTGCGCTG
GCGTTCAGCC ACGAACCCAA GGCGGTGCGC GACAAGCGCT ACCCCTTCAT CTGCACCACC
TACCGCGTCA CCGAACAGTG GCAGTCGGGC ACGATGACCC GCAACACCGG GTGGCTCAAG
GAGATGCAGC CGGAGGGCTT CTGCGAGATA AGCCGCGAAC TGGCCAAGGA ACTCGGCATC
GCCAACGGCG ACGCCGTGGT GCTCGAATCG CTGCGGGGCA AGGTGCAGGT GGTCGCCATC
GTCACGCCAC GTCTCAAGCC CTTCAAGGTC ATGGGCGAGG TCATGCACGA GGTGGGCATA
CCGTGGCAGT TCGGCTGGGG GCAGCATGTG GGCAAGGGCG ACTCTGCCAA CCTGCTCTCG
CCTTCGGTGG GCGACCCCAA CACTGGCATT CCCGAGACCA AGGTCTTCAT GGTCAACCTG
CGCAAGGCCT AG
 
Protein sequence
MRMPRRTFIK LASASAGALA FAGLGQSLAP TVARAAELKI AKAKVTPSVC CFCAVGCGLL 
VYTDTQTGRS INIEGDPDHP TNEGTLCPKG ASIWQTTERS KRVTKVLYRA PGAAAWEEKS
WDWALPRIAR KIKETRDATF ELTNDKGQTV NRTRGIASVG SAAVDNEEGW LMQAMMRALG
LVYIESHARI UHSSTVGALA ESYGRGAMTN HWIDLKNSDV ILIMGSNAAE NHPISFKWVT
RAQERGATLI HVDPRFTRTS AKADIHAHIR SGTDIAFFGG LIRYILENEL FFRQYVVDYT
NASYIVGPDF GFADGLFTGF DPEKGTYNTK KWAFAADENG MTLKDPTLND PRCVFQLMKA
HYARYDRKTV SDVTGISEEQ LATLWSTFAS TGKPDRAGTI LYAMGQCQHT VGVQNIRALS
MIQLLLGNIG IAGGGVNALR GESNVQGTTD ISLLCDNLSG YLPTPKASWA TFDDYVKGTT
PVDKDPKSAN WWSNRGKYLA SYMKSVYPTA SHQDGYLWHP KVDDGKITDY SWLQIFERMS
KGGFKGAFVW GQNPCAGGAN AGKNRKAMET LDWMVVVNLF ENESSLFWKG PGVDPAKVKT
EVFFLPACMS VEKGGSIANS GRWLQWREPG PKPMGDSRSD GDIVLDLYDE IRKLYREEKG
AFPEPVLALD TDYRTDGKYD HHKVAKTLNG KFLADVTIGD KTYKAGQQVP GFAMLQADGS
TTSGCWIFTG CYTDAGNMMA RRDRTQTPEQ AAIGLFPNWS YAWPANRRIL YNRAAVDMTG
KPFDPKRAVI AWNGEKWVGD VPDGGWKPGE KLPFIMIREG RGQLFGPGRV DGPFPEHYEP
FESPLESHPF SKQRVNPTAL AFSHEPKAVR DKRYPFICTT YRVTEQWQSG TMTRNTGWLK
EMQPEGFCEI SRELAKELGI ANGDAVVLES LRGKVQVVAI VTPRLKPFKV MGEVMHEVGI
PWQFGWGQHV GKGDSANLLS PSVGDPNTGI PETKVFMVNL RKA