Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0761 |
Symbol | |
ID | 4664364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 930646 |
End bp | 933657 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639818979 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_966211 |
Protein GI | 120601811 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.109779 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.059567 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAATGC CTCGCAGAAC GTTCATCAAG CTCGCTTCGG CTTCCGCAGG CGCTCTCGCC TTCGCCGGGC TGGGGCAGAG CCTCGCGCCC ACGGTGGCGC GGGCTGCGGA ACTCAAGATC GCCAAGGCGA AGGTCACACC CTCTGTCTGC TGCTTCTGTG CGGTGGGATG CGGGTTGCTG GTCTACACCG ACACCCAGAC CGGACGCTCC ATCAACATCG AGGGCGACCC CGACCATCCC ACCAACGAAG GGACGCTGTG CCCCAAGGGT GCGTCCATCT GGCAGACCAC CGAACGCAGC AAGCGTGTCA CCAAGGTGCT GTACCGTGCA CCCGGTGCCG CGGCGTGGGA GGAGAAGTCG TGGGACTGGG CCCTGCCGCG CATCGCCCGC AAGATCAAGG AGACGCGCGA CGCCACCTTC GAACTCACCA ATGACAAGGG GCAGACCGTC AACCGCACCC GGGGCATAGC CTCGGTGGGC TCGGCCGCCG TCGACAACGA GGAGGGCTGG CTCATGCAAG CCATGATGCG CGCGCTCGGC CTCGTGTACA TCGAGAGCCA CGCCCGTATC TGACACAGCT CGACTGTGGG GGCTCTGGCA GAGTCCTACG GACGCGGTGC GATGACGAAT CACTGGATCG ACCTCAAGAA CAGCGACGTC ATTCTCATCA TGGGGAGTAA CGCCGCCGAG AACCATCCCA TATCCTTCAA GTGGGTGACG CGTGCGCAGG AACGCGGCGC AACGCTCATC CATGTGGACC CCCGCTTCAC GCGCACCTCC GCCAAGGCAG ACATCCACGC CCACATCAGG TCGGGTACGG ACATTGCCTT CTTCGGCGGC CTCATCCGAT ATATCCTCGA AAACGAGCTG TTCTTCCGGC AGTACGTGGT CGACTACACC AACGCCTCGT ACATCGTCGG CCCGGACTTC GGTTTCGCCG ATGGTCTGTT CACCGGTTTC GACCCGGAGA AGGGAACCTA CAACACCAAG AAATGGGCCT TCGCCGCCGA CGAGAACGGC ATGACCCTGA AAGACCCCAC GCTGAACGAC CCGCGTTGCG TCTTTCAACT CATGAAGGCC CATTACGCTC GCTATGACAG GAAGACCGTC TCAGACGTGA CGGGCATCTC CGAAGAGCAG CTGGCGACGC TGTGGAGTAC CTTCGCCTCG ACGGGCAAGC CCGACAGGGC GGGAACCATC CTCTATGCCA TGGGGCAGTG CCAGCACACC GTGGGCGTGC AGAACATCCG TGCGCTTTCG ATGATACAGC TGCTGCTTGG CAACATCGGC ATCGCCGGGG GCGGGGTCAA CGCGCTGCGC GGCGAATCCA ACGTGCAGGG CACCACCGAC ATCTCGCTGC TGTGCGACAA CCTCTCCGGC TACCTGCCCA CCCCCAAGGC TTCGTGGGCG ACCTTCGACG ACTATGTGAA GGGCACGACG CCGGTGGACA AGGACCCGAA GAGCGCCAAC TGGTGGTCGA ACCGCGGCAA GTATCTCGCC TCGTACATGA AGTCGGTGTA TCCCACGGCC AGCCATCAGG ACGGCTACCT CTGGCACCCC AAGGTCGATG ACGGGAAGAT AACCGACTAC TCGTGGTTGC AGATATTCGA GCGCATGAGC AAGGGCGGCT TCAAGGGTGC CTTCGTGTGG GGGCAGAACC CCTGTGCGGG CGGGGCCAAC GCGGGCAAGA ACCGCAAGGC CATGGAGACG CTGGACTGGA TGGTGGTGGT CAACCTCTTC GAGAACGAAA GTTCGCTCTT CTGGAAGGGG CCGGGGGTAG ACCCCGCCAA GGTCAAGACC GAGGTGTTCT TCCTGCCTGC GTGCATGAGC GTCGAGAAGG GCGGTTCCAT CGCCAATTCC GGTCGCTGGC TGCAATGGCG TGAACCGGGG CCGAAACCCA TGGGCGACAG CCGTTCGGAC GGCGACATCG TCCTCGACCT CTATGACGAG ATACGCAAAC TCTACCGCGA GGAGAAGGGA GCTTTCCCCG AACCCGTGCT GGCGCTGGAC ACCGACTACC GTACCGACGG CAAGTACGAT CACCACAAGG TGGCGAAGAC GCTGAACGGC AAGTTCCTCG CTGATGTGAC CATAGGCGAC AAGACCTACA AGGCGGGGCA GCAGGTACCC GGCTTCGCCA TGTTGCAGGC CGACGGTTCG ACGACGTCCG GGTGTTGGAT ATTCACCGGG TGCTATACCG ACGCCGGTAA CATGATGGCG CGCCGAGACC GTACGCAGAC CCCGGAACAG GCCGCCATAG GGCTGTTCCC CAACTGGTCG TATGCATGGC CCGCCAACCG TCGCATCCTG TACAACCGTG CGGCGGTGGA CATGACCGGC AAGCCCTTCG ACCCCAAGCG TGCCGTCATC GCATGGAACG GCGAGAAGTG GGTGGGCGAC GTGCCTGACG GCGGCTGGAA GCCCGGAGAG AAGTTGCCCT TCATCATGAT ACGCGAGGGG CGCGGTCAGT TGTTCGGCCC CGGCAGGGTC GACGGGCCTT TCCCGGAGCA CTACGAACCG TTCGAGAGTC CTCTCGAAAG CCATCCCTTC TCGAAGCAGC GGGTCAACCC CACTGCGCTG GCGTTCAGCC ACGAACCCAA GGCGGTGCGC GACAAGCGCT ACCCCTTCAT CTGCACCACC TACCGCGTCA CCGAACAGTG GCAGTCGGGC ACGATGACCC GCAACACCGG GTGGCTCAAG GAGATGCAGC CGGAGGGCTT CTGCGAGATA AGCCGCGAAC TGGCCAAGGA ACTCGGCATC GCCAACGGCG ACGCCGTGGT GCTCGAATCG CTGCGGGGCA AGGTGCAGGT GGTCGCCATC GTCACGCCAC GTCTCAAGCC CTTCAAGGTC ATGGGCGAGG TCATGCACGA GGTGGGCATA CCGTGGCAGT TCGGCTGGGG GCAGCATGTG GGCAAGGGCG ACTCTGCCAA CCTGCTCTCG CCTTCGGTGG GCGACCCCAA CACTGGCATT CCCGAGACCA AGGTCTTCAT GGTCAACCTG CGCAAGGCCT AG
|
Protein sequence | MRMPRRTFIK LASASAGALA FAGLGQSLAP TVARAAELKI AKAKVTPSVC CFCAVGCGLL VYTDTQTGRS INIEGDPDHP TNEGTLCPKG ASIWQTTERS KRVTKVLYRA PGAAAWEEKS WDWALPRIAR KIKETRDATF ELTNDKGQTV NRTRGIASVG SAAVDNEEGW LMQAMMRALG LVYIESHARI UHSSTVGALA ESYGRGAMTN HWIDLKNSDV ILIMGSNAAE NHPISFKWVT RAQERGATLI HVDPRFTRTS AKADIHAHIR SGTDIAFFGG LIRYILENEL FFRQYVVDYT NASYIVGPDF GFADGLFTGF DPEKGTYNTK KWAFAADENG MTLKDPTLND PRCVFQLMKA HYARYDRKTV SDVTGISEEQ LATLWSTFAS TGKPDRAGTI LYAMGQCQHT VGVQNIRALS MIQLLLGNIG IAGGGVNALR GESNVQGTTD ISLLCDNLSG YLPTPKASWA TFDDYVKGTT PVDKDPKSAN WWSNRGKYLA SYMKSVYPTA SHQDGYLWHP KVDDGKITDY SWLQIFERMS KGGFKGAFVW GQNPCAGGAN AGKNRKAMET LDWMVVVNLF ENESSLFWKG PGVDPAKVKT EVFFLPACMS VEKGGSIANS GRWLQWREPG PKPMGDSRSD GDIVLDLYDE IRKLYREEKG AFPEPVLALD TDYRTDGKYD HHKVAKTLNG KFLADVTIGD KTYKAGQQVP GFAMLQADGS TTSGCWIFTG CYTDAGNMMA RRDRTQTPEQ AAIGLFPNWS YAWPANRRIL YNRAAVDMTG KPFDPKRAVI AWNGEKWVGD VPDGGWKPGE KLPFIMIREG RGQLFGPGRV DGPFPEHYEP FESPLESHPF SKQRVNPTAL AFSHEPKAVR DKRYPFICTT YRVTEQWQSG TMTRNTGWLK EMQPEGFCEI SRELAKELGI ANGDAVVLES LRGKVQVVAI VTPRLKPFKV MGEVMHEVGI PWQFGWGQHV GKGDSANLLS PSVGDPNTGI PETKVFMVNL RKA
|
| |