Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_2365 |
Symbol | |
ID | 4662145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | - |
Start bp | 2758185 |
End bp | 2761202 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639820613 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_967808 |
Protein GI | 120603408 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.1698 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGTCA CACGCAGACA TTTCCTCAAA TTGAGCGCCG GGGCCGCCGT GGCAGGTGCT TTCACGGGGC TCGGACTCAG CCTCGCGCCC ACGGTGGCGC GGGCCGAGTT GCAGAAACTC CAGTGGGCGA AACAGACCAC ATCCATATGC TGCTACTGTG CGGTGGGTTG CGGTCTCATC GTCCATACCG CCAAGGACGG ACAGGGCCGC GCCGTGAACG TCGAGGGCGA CCCCGACCAC CCCATCAACG AAGGTTCGCT CTGTCCCAAG GGCGCATCCA TCTTCCAGCT GGGCGAGAAC GACCAGCGCG GCACGCAGCC GCTCTACCGC GCCCCCTTCA GTGATACATG GAAGCCGGTG ACCTGGGACT TCGCCCTCAC CGAGATCGCC AAGCGCATCA AGAAGACCCG TGACGCCTCG TTCACCGAGA AGAACGCGGC TGGAGACTTG GTCAACCGCA CCGAGGCCAT CGCCTCGTTC GGTTCGGCCG CCATGGACAA CGAGGAGTGC TGGGCCTACG GGAACATCCT CCGCAGCCTC GGCCTGGTGT ACATCGAGCA CCAGGCGCGT ATCTGACACA GCCCCACTGT ACCGGCTCTG GCAGAGTCGT TCGGTCGCGG TGCAATGACG AATCACTGGA ACGATCTCGC GAACAGTGAT TGTATTCTCA TCATGGGCAG CAATGCTGCC GAAAACCACC CCATCGCCTT CAAGTGGGTG CTGCGCGCCA AGGACAAGGG CGCCACGCTC ATCCACGTGG ACCCGCGCTT CACGCGCACC TCGGCACGTT GCGATGTCTA CGCGCCCATC CGTAGCGGCG CGGACATCCC GTTCCTCGGC GGTCTCATCA AGTACATTCT CGACAACAAG CTCTATTTCA CGGACTACGT GCGCGAGTAC ACCAACGCCT CGCTCATCGT GGGCGAGAAG TTCTCGTTCA AGGACGGGCT CTTCTCCGGC TACGACGCGG CGAACAAGAA GTACGACAAG AGCATGTGGG CCTTCGAACT CGATGCCAAC GGCGTGCCCA AGCGCGACCC GGCACTCAAG CACCCGCGCT GCGTCATCAA CCTGCTGAAG AAGCACTACG AGCGGTACAA CCTCGACAAG GTCGCCGCCA TCACCGGCAC GTCGAAGGAA CAGCTGCAGC AGGTCTACAA GGCCTATGCC GCCACCGGCA AGCCCGACAA GGCGGGCACC ATCATGTACG CCATGGGCTG GACGCAGCAC TCCGTCGGTG TGCAGAACAT CCGCGCCATG GCCATGATAC AGCTGCTGCT GGGCAACATC GGCGTGGCAG GGGGCGGCGT CAACGCGCTG CGCGGCGAGT CCAACGTGCA GGGTTCCACC GACCAGGGCC TGCTGGCCCA CATATGGCCC GGTTACAACC CCGTGCCCAA CAGCAAGGCC GCCACGCTTG AGCTGTACAA TGCCGCCACG CCCCAGTCCA AGGACCCCAT GAGCGTCAAC TGGTGGCAGA ACAGGCCCAA GTATGTGGCC AGCTACCTCA AGGCGCTGTA CCCGGACGAA GAACCCGCGG CGGCCTACGA CTACCTGCCG CGCATCGACG CCGGCAGGAA GCTCACCGAC TACTTCTGGC TGAACATCTT CGAGAAGATG GACAAGGGCG AGTTCAAGGG CCTTTTCGCG TGGGGCATGA ACCCCGCATG CGGCGGCGCC AACGCCAACA AGAACCGCAA GGCCATGGGC AAACTCGAAT GGCTGGTCAA CGTGAACCTC TTCGAGAACG AGACCAGTTC GTTCTGGAAG GGGCCGGGCA TGAACCCCGC CGAGATAGGC ACCGAGGTCT TCTTCCTGCC GTGCTGCGTC TCCATCGAGA AGGAAGGTTC GGTGGCGAAC TCGGGCCGCT GGATGCAGTG GCGCTATCGC GGGCCCAAGC CCTACGCCGA GACCAGGCCC GACGGCGACA TCATGCTCGA CATGTTCAAG AAGGTGCGTG AGCTCTACGC CAAGGAAGGG GGAGCCTACC CCGCACCGAT CGCGAAGCTG AACATTGCCG ACTGGGAAGA GCACAACGAG TTCTCGCCCA CCAAGGTGGC GAAACTCATG AACGGCTACT TCCTGAAGGA TACCGAAGTG GGCGGCAAGC AGTTCAAGAA GGGCCAGCAG GTGCCCAGCT TCGCCTTCCT CACCGCCGAC GGTTCGACCT GTTCGGGCAA CTGGCTGCAT GCCGGTTCGT TCACCGACGC GGGCAACCTG ATGGCCCGCC GTGACAAGAC CCAGACGCCG GAACAGGCGC GCATCGGCCT GTTCCCCAAC TGGTCGTTCT GCTGGCCCGT CAACCGTCGC ATCCTCTACA ACCGTGCCTC CGTGGACAAG ACCGGCAAGC CGTGGAATCC GGCCAAGGCC GTCATCGAAT GGAAGGACGG CAAGTGGGTG GGCGACGTGG TGGACGGTGG CGGCGACCCC GGCACCAAGC ATCCCTTCAT CATGCAGACG CATGGCTTCG GCGCACTGTA CGGCCCCGGT CGTGAAGAGG GTCCCTTCCC CGAGCATTAC GAACCCCTCG AGTGCCCGGT GTCCAAGAAC CCCTTCTCGA AGCAGCTGCA CAACCCCGTG GCGTTCCAGA TCGAAGGCGA GAAGAAGGCG GTGTGCGATC CGCGCTACCC CTTCATCGGC ACGACCTATC GCGTCACGGA GCACTGGCAG ACCGGCCTCA TGACCCGCCG TTGCGCGTGG CTCGTCGAAG CCGAACCCCA GATCTTCTGC GAGATCAGCA AGGAACTGGC GAAGCTGCGC GGCATAGGCA ACGGCGACAC CGTCAAGGTG TCGAGCCTGC GCGGTGCGCT TGAAGCCGTC GCCATCGTCA CGGAGCGCAT CAGACCCTTC AAGATCGAAG GTGTCGATGT CCACATGGTG GGCCTGCCGT GGCATTACGG CTGGATGGTG CCGAAGAACG GCGGCGACAC GGCCAACCTG CTGACCCCGT CTGCGGGCGA CCCGAACACC GGCATTCCGG AAACCAAGGC GTTCATGGTG GACGTCCGCA AGGTCTAG
|
Protein sequence | MTVTRRHFLK LSAGAAVAGA FTGLGLSLAP TVARAELQKL QWAKQTTSIC CYCAVGCGLI VHTAKDGQGR AVNVEGDPDH PINEGSLCPK GASIFQLGEN DQRGTQPLYR APFSDTWKPV TWDFALTEIA KRIKKTRDAS FTEKNAAGDL VNRTEAIASF GSAAMDNEEC WAYGNILRSL GLVYIEHQAR IUHSPTVPAL AESFGRGAMT NHWNDLANSD CILIMGSNAA ENHPIAFKWV LRAKDKGATL IHVDPRFTRT SARCDVYAPI RSGADIPFLG GLIKYILDNK LYFTDYVREY TNASLIVGEK FSFKDGLFSG YDAANKKYDK SMWAFELDAN GVPKRDPALK HPRCVINLLK KHYERYNLDK VAAITGTSKE QLQQVYKAYA ATGKPDKAGT IMYAMGWTQH SVGVQNIRAM AMIQLLLGNI GVAGGGVNAL RGESNVQGST DQGLLAHIWP GYNPVPNSKA ATLELYNAAT PQSKDPMSVN WWQNRPKYVA SYLKALYPDE EPAAAYDYLP RIDAGRKLTD YFWLNIFEKM DKGEFKGLFA WGMNPACGGA NANKNRKAMG KLEWLVNVNL FENETSSFWK GPGMNPAEIG TEVFFLPCCV SIEKEGSVAN SGRWMQWRYR GPKPYAETRP DGDIMLDMFK KVRELYAKEG GAYPAPIAKL NIADWEEHNE FSPTKVAKLM NGYFLKDTEV GGKQFKKGQQ VPSFAFLTAD GSTCSGNWLH AGSFTDAGNL MARRDKTQTP EQARIGLFPN WSFCWPVNRR ILYNRASVDK TGKPWNPAKA VIEWKDGKWV GDVVDGGGDP GTKHPFIMQT HGFGALYGPG REEGPFPEHY EPLECPVSKN PFSKQLHNPV AFQIEGEKKA VCDPRYPFIG TTYRVTEHWQ TGLMTRRCAW LVEAEPQIFC EISKELAKLR GIGNGDTVKV SSLRGALEAV AIVTERIRPF KIEGVDVHMV GLPWHYGWMV PKNGGDTANL LTPSAGDPNT GIPETKAFMV DVRKV
|
| |