Gene Dvul_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_0501 
Symbol 
ID4662715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp641004 
End bp644042 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content61% 
IMG OID639818711 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_965951 
Protein GI120601551 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.662776 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA CGCGGCGGAG TTTTCTCAAG CTCGTCGGGG TGAGTGTCGT CGGACTCTCT 
CTCGGGCAGC TGGGGTTCGA TCTTGAAGAC GCGCAGGCTT ATGCCGTCAA ACTCAAGATC
GAAGGCGCGA AGGAAGTGGG CACCGTGTGT CCCTTCTGTT CCGTCTGCTG TCAGGTCATC
GCCTACGTGC GCAACGGCAA GCTCGTCTCC ACCGAGGGCG ACCCCGATTT CCCGGTCAAC
GAAGGTGCGC TGTGTGCCAA GGGTGCCGCG CTCTTCTCCA TGTACACCAA CCCCCATCGC
CTGACGAAGC CGCTCTACCG TGCGCCTCAC AGCGACAAGT GGGTGGAGAA GGACTGGGAC
TGGACACTGA ACCAGATCGC GCGTCGCGTG AAGGACGCCC GCGACAAGGA CATGATCCTC
AAGAACGACA AGGGGCAGAC GGTCAACCGT CTTGAGAGCA TCTTCATGAT GGGTACCTCG
CACGCCTCCA ACGAGGAATG TGCCGTCATC CATCAAGCCA TGAGAGGCCT GGGCGTCGTC
CATATGGACC ACCAGGCCCG GGTCTGACAC AGTCCCACTG TTGCGGCTCT GGCAGAGTCG
TTCGGTCGAG GGGCTATGAC CAACCACTGG ATCGATATCA AGAATACCGA TGCAGTGCTC
ATCATCGGCA GCAATGCCGC AGAACATCAT CCTGTCGCCT TCAAGTGGAT CATGCGGGCG
CGCGACAACG GCGCCGTGCT CATGCATGTC GACCCCAAGT TCTCGCGCAC GTCGGCACGT
TGCGACTTCC ATGTGCCCCT GCGTTCGGGC ACCGACATCG CCTTCCTCGG CGGTATGGTG
AACCACATCA TCGCGAAGGA CCTCTACTTC AAGGACTACG TCGCCAACTA CACCAACGCC
GCGTTCGTCG TGGGCAAGGA CTATGCCTTC GAGGACGGCA TCTTCAGCGG GTACGACCCC
AAGACCCGCA CCTACGACCG CAGCAAATGG GAGTTCGAGA AGGGACCCGA CGGTGGCCCG
GTCATGGACC CCACCCTCAA GAACGAGCGG TGCGTCTTCA ACCTGATGAA GAAGCACTAT
GAGCGCTATA CGCTGAAGAA CGTCTCCGAC GTCACCGGCG TCTCCGAAGA GAACCTGCTG
CGCGTCTATG ATGCCTTCTG TGCCACGGGC AGACCCGACA AGGCGGGCAC CATCCTGTAC
GCCCTTGGCT GGACGCAGCA TACCGTGGGC GTGCAGAACA TCCGTACCTC GACCCTCATC
CAGCTGCTTC TGGGCAACAT CGGTGTGGCG GGCGGCGGTA TCAACGCGCT GCGTGGTGAA
CCCAACGTTC AGGGTTCCAC CGACCACGCG CTGCTGTACC ATATCCTGCC GGGCTACAAC
GCCATGCCGG TGGCACAGTG GCAGACGCTG GCAGACTACA ACAAGGCCAA CACGCCCGTC
ACCACGCTGA AGAACAGCGC CAACTGGTGG AGCAACAGGC CGAAGTACGT CGCCAGCCTG
CTGAAGGGCT GGTTCGGTGA CGCGGCGACC CCCGAGAACG ACTTCTGCTA CGAGTATCTG
CCCAAGCTCG AAAAGGGCGA GGACTACTCG TACATGTACG TCATGGACCG CATGTATCAT
GGCAAGCTGA AAGGGGGCTT CATCTTCGGC GTGAACCCCA TGAACAGCTT CCCCAACACC
AACAAGATGC GTGCGGCGCT CGACAAGCTC GACTGGCTGG TGTGCTCCGA GTTGCACAAC
TCCGAGACCA CGGACAACTG GAAGCGTCCC GGCGTCGACC CCAAGGCGTG CAAGACCGAG
GTGTTCCTGC TGCCCTCCGC CCACCGCGTC GAGAAGGCTG GCACCATCAG CAACAGCGGA
CGCTGGCTGC AATGGTTCGA CAAGGCCGTG GAGCCGGGAC AGGCGCGCAA CTTCGCCGAC
ATCTTCGTGC CGTTGGTGAA CAAGATTCGC GCCCTGTACA AGGCCGAGGG CGGTACGCTG
CCCGACCCGG TGCTGAAGCT GCACTGGACC GACAAGTTCG ACCCCGAAGA ATGGACACGC
CGCATCAACG GTTTCTTCTG GGCCGACACC AAGGTGGGCG ACAAGGAATA CAAGCGCGGC
CAACTGGTGC CTGCGTTCGT CCATCTGAAG GATGACGGCA GCACCTCGTC GTTGAACTGG
GTATACACCG GAAGCTACAC CGAAGAGGAC GGCAACAAGT CCAAACGCCG CGATGCCCGG
CAGACGCCCA TGCAGGCGGC CATCGGCCTG TTCCCCAACT GGTCGTGGTG CTGGCCTGTG
AACCGGCGCA TCCTCTACAA CCGTGCGTCG GTGGACGTGA ACGGCAAGCC GTGGAACCCG
AAGAAGGCCG TCATCGAATG GGACGGTGCC AAGTGGGTGG GCGACGTGCC CGACGGCCCA
TGGCCGCCCA TGGCGGACAA GGAGAAGGGT AAGCTGCCGT TCATCATGAA CAAGGACGGC
TTCGCCCAGT TCTATGGCAC CGGCCGCATG GACGGGCCTT TCCCCGAGCA TTACGAACCT
GCCGAGACGC CGCTCGACAG CCATCCGTTC TCGAAGCAGC TGTCCAGCCC CGTCTACAAG
TTCCATACCT CCGATATGGA TCAGATAGCC AAGGCAGCCG ACCCCAAATA CCCCATCGTG
CTGACCACCT ACAGCCTGAC CGAGCACTGG TGCGGCGGCG GTGAGACCCG CAACGTGCCC
AACCTGCTCG AAACGGAGCC TCAGCTCTAT ATCGAGATGA GTCCCGAACT GGCTGAAGAG
AAAGGTATCA AGAACGGTGA CGGCGTCATC GTCGAGAGCA TCCGTGGCAG GGCCGAGGCC
ATCGCCATGG TCACGGTGCG CATCAGGCCG TTCACGGTCA TGGGCAAGAC CGTCCACCTC
GTGGGGATGC CCTTCGCCTA TGGCTGGACG ACGCCCAAAT GCGGCGACTC GACCAACCGC
CTGACGGTGG GTGCCTACGA TCCGAACACC ACCATTCCTG AAAGCAAGGC CTGCCTCGTC
AACGTGCGCA AGGCCGACAA GCTGACCGAA ATAGCCTAA
 
Protein sequence
MKTTRRSFLK LVGVSVVGLS LGQLGFDLED AQAYAVKLKI EGAKEVGTVC PFCSVCCQVI 
AYVRNGKLVS TEGDPDFPVN EGALCAKGAA LFSMYTNPHR LTKPLYRAPH SDKWVEKDWD
WTLNQIARRV KDARDKDMIL KNDKGQTVNR LESIFMMGTS HASNEECAVI HQAMRGLGVV
HMDHQARVUH SPTVAALAES FGRGAMTNHW IDIKNTDAVL IIGSNAAEHH PVAFKWIMRA
RDNGAVLMHV DPKFSRTSAR CDFHVPLRSG TDIAFLGGMV NHIIAKDLYF KDYVANYTNA
AFVVGKDYAF EDGIFSGYDP KTRTYDRSKW EFEKGPDGGP VMDPTLKNER CVFNLMKKHY
ERYTLKNVSD VTGVSEENLL RVYDAFCATG RPDKAGTILY ALGWTQHTVG VQNIRTSTLI
QLLLGNIGVA GGGINALRGE PNVQGSTDHA LLYHILPGYN AMPVAQWQTL ADYNKANTPV
TTLKNSANWW SNRPKYVASL LKGWFGDAAT PENDFCYEYL PKLEKGEDYS YMYVMDRMYH
GKLKGGFIFG VNPMNSFPNT NKMRAALDKL DWLVCSELHN SETTDNWKRP GVDPKACKTE
VFLLPSAHRV EKAGTISNSG RWLQWFDKAV EPGQARNFAD IFVPLVNKIR ALYKAEGGTL
PDPVLKLHWT DKFDPEEWTR RINGFFWADT KVGDKEYKRG QLVPAFVHLK DDGSTSSLNW
VYTGSYTEED GNKSKRRDAR QTPMQAAIGL FPNWSWCWPV NRRILYNRAS VDVNGKPWNP
KKAVIEWDGA KWVGDVPDGP WPPMADKEKG KLPFIMNKDG FAQFYGTGRM DGPFPEHYEP
AETPLDSHPF SKQLSSPVYK FHTSDMDQIA KAADPKYPIV LTTYSLTEHW CGGGETRNVP
NLLETEPQLY IEMSPELAEE KGIKNGDGVI VESIRGRAEA IAMVTVRIRP FTVMGKTVHL
VGMPFAYGWT TPKCGDSTNR LTVGAYDPNT TIPESKACLV NVRKADKLTE IA