Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_3503 |
Symbol | |
ID | 8430498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 3706692 |
End bp | 3709394 |
Gene Length | 2703 bp |
Protein Length | 900 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 645035727 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_003192845 |
Protein GI | 258516623 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGTTCTG TTACATTAAC CATTAATGAT CAACAGGTTA CGGTTCCTAA GGGTACTTCC GTCTTGTACG CTGCCAGAAA AATAGGAATT GATATTCCTA CTTTCTGCCA TGACCAGGAG CTTGCCAGAT TCGGAGCTTG CCGTATCTGT GTTGTGGAAG TACCGGGCAT GCGCAACCTG CCTGCTTCCT GTGTTACCGA AGCAACGGAC GGTATGGTGG TATACACTGA GTCCGAAACG GTAGTGGAGG CACGCAAAAC AATTCTCGAA TTGATGTTGG CCAATCACCC GGCTGACTGC CTGACTTGCA GCAAAAACGG TGACTGCAGA TTGCAAGATT ATGCCTATCG TTATAATATC AGGGGAGACG TTTTCTTCGG GGAGAAGCAT AATTATCCTA TCGAAGACAG CAACCCGTTC ATTATAAGAG ATATGAATAA ATGCATTCTG TGCGGCAAAT GTGTTCGCGC TTGTGCAGAG GTGCAGGGCC GCGGTGTGAT TGATTTTGCT TACAGGGGTT TTAATGCTAA AGTGGCCACC GCCATGGATT TGCCTTTGAT TGAATCGGAA TGTGTATTCT GCGGCAGTTG TGTGGCTGTT TGCCCGGTGG GTGCCCTGAC TGAGAAAGCT ATGAGCGGTA AAGCACGTAT CTGGGATATT AAAAAGGTGC GCACCACTTG CCCGTTCTGT GGAGTAGGCT GCAACTTCGA CCTGAATGTT GCCGACGGTA AAGTTATCGG TTCTACCTCC AATCCCGACA GCCCGGTTAA TGGCCGTCAT TTGTGCGTAA AAGGCCGTTT TGGGATAGAT TATATTCATA ACCCGAAACG TTTAACCACG CCTCTAATTA AGAAAAACGG TGAATTTGTT GAAGCCGGTT GGGATGAGGC TCTTGACCTG GTTGCTTCTA AATTGACTGA AGTGAAGAAT AAGTACGGCA GTGATGCTGT AGCCGCACTT TCTTCAGCCC GTTGTACCAA TGAAGACAAT TATGTACTAC AAAAACTTCT TCGCGCGGCA ATCGGCACCA ATAACGTCGA TCACTGCGCC CGTACCTGAC ACGCTCCCAC AGTAGCTGGT CTGGCTACAA GTTTTGGTAG TGGCGCAATG ACAAACTCTT TTAGCGATAT ATTGAAAACT GATTTGCTGT TTGTAATCGG TTCAAACGCA ACTGAAGCAC ACCCGATGGC AGGTGCTAAA ATGCTTCAGG CAGTTCAGAA GGGCATTAAG ATGGTAGTAG TTGACCCTCG CCGCATTGAA CTGGCAGAAA AGGCTGATTA CTGGCTGCAG CTTAAGCCCG GTACAGATAT TCCGTTGTTA AACGGTTTAA TGCATATTAT TATCAAGGAA GATCTCTACG ATAAGAAGTT CGTGGAAGAG CGAACCGAAG GTTTTGAGGA ACTCAAGGCT ACGGTAGAAA ATTACCCGCC GGAAAAAGTT TCGGAAATGA CAGGTATTCC GGTAGAAGAC TTATATGATG TGGCAAGACT CTATGCAACC TCCTACAATG CGCTTATTTG CTATACACTG GGTATCACCG AGCATATTTG CGGTGTGTTC AACGTTATGA GTATTGCCAA CCTGGCTATG CTTACCGGAC ATATCGGCAG GCCCGGTTCC GGCGTGAACC CGCAGCGCGG TCAGAATAAC GTGCAAGGTG CCTGTGATAT GGGTGCACTG CCTAACGTTT ATCCCGGTTA CCAACCTGTT ATTAACCCGG ATGCTCAGGC CAAGTTTGAA AAGGCATGGG GTGTTCCTTT ATCCGGCAAG CTTGGTTTGA CTATTCCTGA TATGATGGAC GCGGCGGTGG AAGGAAAAGT CAAGGCCATG TATATATTGG GCGAAGATCC CGTGCTCACA GATCCTGACG CTCATCATAT CCGCAAGGCT ATGAGTAAAC TGGACTTCCT GGTAGTGCAG GAATTATTCA TGTCAGAGAC AGCCAAATAC GCTGATGTAA TACTGCCCGG AGCAAGTTTT GGTGAAAAAG ACGGTACCTT CTCCAACTCT GAAAGAAGAG TGCAGAGGGT TCGCAAGGCT ATTGATCCGA TAGCAAATAC CAAAGCTGAC TGGCAAATTG TCTGCGAAGT GAGCAACCAT ATGGGGTATC CCATGAATTT TGCTTCGCCG GAAGAAATCT TCAATGAAAT GGCTTCATTG ACTCCCTCCT ATTGTGGGAT GAATTATGAA AGAATAGATG CGAAGGGATT GCAATGGCCT TGCCCGACCC TGGATCACCC GGGTACCCCT GTACTGCATA CGCAGAGCTT TACCAGAGGT AAGGGTTTGT TCAAGGGTAT TGATCATGTT CCACCGGCTG AAATGCCTGA TGCTGAATAT CCGTACCTGT TGTCCACCGG GCGGATACTG TATCACTACA ATATCACAAC CCGTTACTCG CAAGGTTTGG ATGCTCACAG ACCGGAAGAA ATGGCTCAGA TTAATCCGGT TGATGCCTGT AAGTTTGGTG TGGAAACAGG TGGTAAACTT AGAGTTACTT CCCGCCGAGG TTCTGTGGTA ACCAAGGTCG TCGTAACCGA CAAGGTACCG GCAGGATTGA TTTGGATGAG CTTCCACTAC TGGGAGACAC CTACTAACGA GCTTACTGTC GATGCGTTTG ACCCGATCAG TAAGACTGGT GAGTATAAGG TGGCCGCTGT TAAATTAGAA AAAATTCAAG AGTCGAAAGA AATAGGTGCT TAA
|
Protein sequence | MGSVTLTIND QQVTVPKGTS VLYAARKIGI DIPTFCHDQE LARFGACRIC VVEVPGMRNL PASCVTEATD GMVVYTESET VVEARKTILE LMLANHPADC LTCSKNGDCR LQDYAYRYNI RGDVFFGEKH NYPIEDSNPF IIRDMNKCIL CGKCVRACAE VQGRGVIDFA YRGFNAKVAT AMDLPLIESE CVFCGSCVAV CPVGALTEKA MSGKARIWDI KKVRTTCPFC GVGCNFDLNV ADGKVIGSTS NPDSPVNGRH LCVKGRFGID YIHNPKRLTT PLIKKNGEFV EAGWDEALDL VASKLTEVKN KYGSDAVAAL SSARCTNEDN YVLQKLLRAA IGTNNVDHCA RTUHAPTVAG LATSFGSGAM TNSFSDILKT DLLFVIGSNA TEAHPMAGAK MLQAVQKGIK MVVVDPRRIE LAEKADYWLQ LKPGTDIPLL NGLMHIIIKE DLYDKKFVEE RTEGFEELKA TVENYPPEKV SEMTGIPVED LYDVARLYAT SYNALICYTL GITEHICGVF NVMSIANLAM LTGHIGRPGS GVNPQRGQNN VQGACDMGAL PNVYPGYQPV INPDAQAKFE KAWGVPLSGK LGLTIPDMMD AAVEGKVKAM YILGEDPVLT DPDAHHIRKA MSKLDFLVVQ ELFMSETAKY ADVILPGASF GEKDGTFSNS ERRVQRVRKA IDPIANTKAD WQIVCEVSNH MGYPMNFASP EEIFNEMASL TPSYCGMNYE RIDAKGLQWP CPTLDHPGTP VLHTQSFTRG KGLFKGIDHV PPAEMPDAEY PYLLSTGRIL YHYNITTRYS QGLDAHRPEE MAQINPVDAC KFGVETGGKL RVTSRRGSVV TKVVVTDKVP AGLIWMSFHY WETPTNELTV DAFDPISKTG EYKVAAVKLE KIQESKEIGA
|
| |