Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0501 |
Symbol | |
ID | 4662715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 641004 |
End bp | 644042 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639818711 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_965951 |
Protein GI | 120601551 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.662776 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAA CGCGGCGGAG TTTTCTCAAG CTCGTCGGGG TGAGTGTCGT CGGACTCTCT CTCGGGCAGC TGGGGTTCGA TCTTGAAGAC GCGCAGGCTT ATGCCGTCAA ACTCAAGATC GAAGGCGCGA AGGAAGTGGG CACCGTGTGT CCCTTCTGTT CCGTCTGCTG TCAGGTCATC GCCTACGTGC GCAACGGCAA GCTCGTCTCC ACCGAGGGCG ACCCCGATTT CCCGGTCAAC GAAGGTGCGC TGTGTGCCAA GGGTGCCGCG CTCTTCTCCA TGTACACCAA CCCCCATCGC CTGACGAAGC CGCTCTACCG TGCGCCTCAC AGCGACAAGT GGGTGGAGAA GGACTGGGAC TGGACACTGA ACCAGATCGC GCGTCGCGTG AAGGACGCCC GCGACAAGGA CATGATCCTC AAGAACGACA AGGGGCAGAC GGTCAACCGT CTTGAGAGCA TCTTCATGAT GGGTACCTCG CACGCCTCCA ACGAGGAATG TGCCGTCATC CATCAAGCCA TGAGAGGCCT GGGCGTCGTC CATATGGACC ACCAGGCCCG GGTCTGACAC AGTCCCACTG TTGCGGCTCT GGCAGAGTCG TTCGGTCGAG GGGCTATGAC CAACCACTGG ATCGATATCA AGAATACCGA TGCAGTGCTC ATCATCGGCA GCAATGCCGC AGAACATCAT CCTGTCGCCT TCAAGTGGAT CATGCGGGCG CGCGACAACG GCGCCGTGCT CATGCATGTC GACCCCAAGT TCTCGCGCAC GTCGGCACGT TGCGACTTCC ATGTGCCCCT GCGTTCGGGC ACCGACATCG CCTTCCTCGG CGGTATGGTG AACCACATCA TCGCGAAGGA CCTCTACTTC AAGGACTACG TCGCCAACTA CACCAACGCC GCGTTCGTCG TGGGCAAGGA CTATGCCTTC GAGGACGGCA TCTTCAGCGG GTACGACCCC AAGACCCGCA CCTACGACCG CAGCAAATGG GAGTTCGAGA AGGGACCCGA CGGTGGCCCG GTCATGGACC CCACCCTCAA GAACGAGCGG TGCGTCTTCA ACCTGATGAA GAAGCACTAT GAGCGCTATA CGCTGAAGAA CGTCTCCGAC GTCACCGGCG TCTCCGAAGA GAACCTGCTG CGCGTCTATG ATGCCTTCTG TGCCACGGGC AGACCCGACA AGGCGGGCAC CATCCTGTAC GCCCTTGGCT GGACGCAGCA TACCGTGGGC GTGCAGAACA TCCGTACCTC GACCCTCATC CAGCTGCTTC TGGGCAACAT CGGTGTGGCG GGCGGCGGTA TCAACGCGCT GCGTGGTGAA CCCAACGTTC AGGGTTCCAC CGACCACGCG CTGCTGTACC ATATCCTGCC GGGCTACAAC GCCATGCCGG TGGCACAGTG GCAGACGCTG GCAGACTACA ACAAGGCCAA CACGCCCGTC ACCACGCTGA AGAACAGCGC CAACTGGTGG AGCAACAGGC CGAAGTACGT CGCCAGCCTG CTGAAGGGCT GGTTCGGTGA CGCGGCGACC CCCGAGAACG ACTTCTGCTA CGAGTATCTG CCCAAGCTCG AAAAGGGCGA GGACTACTCG TACATGTACG TCATGGACCG CATGTATCAT GGCAAGCTGA AAGGGGGCTT CATCTTCGGC GTGAACCCCA TGAACAGCTT CCCCAACACC AACAAGATGC GTGCGGCGCT CGACAAGCTC GACTGGCTGG TGTGCTCCGA GTTGCACAAC TCCGAGACCA CGGACAACTG GAAGCGTCCC GGCGTCGACC CCAAGGCGTG CAAGACCGAG GTGTTCCTGC TGCCCTCCGC CCACCGCGTC GAGAAGGCTG GCACCATCAG CAACAGCGGA CGCTGGCTGC AATGGTTCGA CAAGGCCGTG GAGCCGGGAC AGGCGCGCAA CTTCGCCGAC ATCTTCGTGC CGTTGGTGAA CAAGATTCGC GCCCTGTACA AGGCCGAGGG CGGTACGCTG CCCGACCCGG TGCTGAAGCT GCACTGGACC GACAAGTTCG ACCCCGAAGA ATGGACACGC CGCATCAACG GTTTCTTCTG GGCCGACACC AAGGTGGGCG ACAAGGAATA CAAGCGCGGC CAACTGGTGC CTGCGTTCGT CCATCTGAAG GATGACGGCA GCACCTCGTC GTTGAACTGG GTATACACCG GAAGCTACAC CGAAGAGGAC GGCAACAAGT CCAAACGCCG CGATGCCCGG CAGACGCCCA TGCAGGCGGC CATCGGCCTG TTCCCCAACT GGTCGTGGTG CTGGCCTGTG AACCGGCGCA TCCTCTACAA CCGTGCGTCG GTGGACGTGA ACGGCAAGCC GTGGAACCCG AAGAAGGCCG TCATCGAATG GGACGGTGCC AAGTGGGTGG GCGACGTGCC CGACGGCCCA TGGCCGCCCA TGGCGGACAA GGAGAAGGGT AAGCTGCCGT TCATCATGAA CAAGGACGGC TTCGCCCAGT TCTATGGCAC CGGCCGCATG GACGGGCCTT TCCCCGAGCA TTACGAACCT GCCGAGACGC CGCTCGACAG CCATCCGTTC TCGAAGCAGC TGTCCAGCCC CGTCTACAAG TTCCATACCT CCGATATGGA TCAGATAGCC AAGGCAGCCG ACCCCAAATA CCCCATCGTG CTGACCACCT ACAGCCTGAC CGAGCACTGG TGCGGCGGCG GTGAGACCCG CAACGTGCCC AACCTGCTCG AAACGGAGCC TCAGCTCTAT ATCGAGATGA GTCCCGAACT GGCTGAAGAG AAAGGTATCA AGAACGGTGA CGGCGTCATC GTCGAGAGCA TCCGTGGCAG GGCCGAGGCC ATCGCCATGG TCACGGTGCG CATCAGGCCG TTCACGGTCA TGGGCAAGAC CGTCCACCTC GTGGGGATGC CCTTCGCCTA TGGCTGGACG ACGCCCAAAT GCGGCGACTC GACCAACCGC CTGACGGTGG GTGCCTACGA TCCGAACACC ACCATTCCTG AAAGCAAGGC CTGCCTCGTC AACGTGCGCA AGGCCGACAA GCTGACCGAA ATAGCCTAA
|
Protein sequence | MKTTRRSFLK LVGVSVVGLS LGQLGFDLED AQAYAVKLKI EGAKEVGTVC PFCSVCCQVI AYVRNGKLVS TEGDPDFPVN EGALCAKGAA LFSMYTNPHR LTKPLYRAPH SDKWVEKDWD WTLNQIARRV KDARDKDMIL KNDKGQTVNR LESIFMMGTS HASNEECAVI HQAMRGLGVV HMDHQARVUH SPTVAALAES FGRGAMTNHW IDIKNTDAVL IIGSNAAEHH PVAFKWIMRA RDNGAVLMHV DPKFSRTSAR CDFHVPLRSG TDIAFLGGMV NHIIAKDLYF KDYVANYTNA AFVVGKDYAF EDGIFSGYDP KTRTYDRSKW EFEKGPDGGP VMDPTLKNER CVFNLMKKHY ERYTLKNVSD VTGVSEENLL RVYDAFCATG RPDKAGTILY ALGWTQHTVG VQNIRTSTLI QLLLGNIGVA GGGINALRGE PNVQGSTDHA LLYHILPGYN AMPVAQWQTL ADYNKANTPV TTLKNSANWW SNRPKYVASL LKGWFGDAAT PENDFCYEYL PKLEKGEDYS YMYVMDRMYH GKLKGGFIFG VNPMNSFPNT NKMRAALDKL DWLVCSELHN SETTDNWKRP GVDPKACKTE VFLLPSAHRV EKAGTISNSG RWLQWFDKAV EPGQARNFAD IFVPLVNKIR ALYKAEGGTL PDPVLKLHWT DKFDPEEWTR RINGFFWADT KVGDKEYKRG QLVPAFVHLK DDGSTSSLNW VYTGSYTEED GNKSKRRDAR QTPMQAAIGL FPNWSWCWPV NRRILYNRAS VDVNGKPWNP KKAVIEWDGA KWVGDVPDGP WPPMADKEKG KLPFIMNKDG FAQFYGTGRM DGPFPEHYEP AETPLDSHPF SKQLSSPVYK FHTSDMDQIA KAADPKYPIV LTTYSLTEHW CGGGETRNVP NLLETEPQLY IEMSPELAEE KGIKNGDGVI VESIRGRAEA IAMVTVRIRP FTVMGKTVHL VGMPFAYGWT TPKCGDSTNR LTVGAYDPNT TIPESKACLV NVRKADKLTE IA
|
| |