Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_1708 |
Symbol | |
ID | 8428674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | + |
Start bp | 1801195 |
End bp | 1802502 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 645034041 |
Product | hydrogenase large subunit domain protein |
Protein accession | YP_003191188 |
Protein GI | 258514966 |
COG category | [R] General function prediction only |
COG ID | [COG4624] Iron only hydrogenase large subunit, C-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAAA ATGATTTTGC TCACCGTATA CTGGTAAATC TGGCTCGCCA AATGAAACTG GGAGAAATAC TTGACAGAAA AAAGCTCTCG GATAAAATCA TTGCCGAGTG TTTCCCTGAA AAAATCGAAG ATGAGCAAAT CTGGCGTGAC AGGATAGACA AGCGGCTGGA TTTTTTGCTG CAACACAGCC GGCAGGAGGA ATCAGAACCG CTGTTGGTAC ACTTGCTTGA AGAAAGCTGT GAGGAATGCA GTGTGGAAAA GCGGCCCTGT GTTCATGCAT GTCCTACAGG AGCTATAACT TATGACCGGC ATGGTAAAGG CAGCATAAAT ACCGCGCTGT GCGTGGAGTG CGGATGGTGT GTGGATACAT GTATTTCGGG TGTGATTATA GCCCGCTCTG AATTCGCACA GGTTGCAACT ATGCTGCTGC AAAGTAAGGT CAACCCAGTA TATGCCATAC TGGCACCCTC TTTTGTAGGG CAGTTTGGGC CCGGTGTAAC CCCGGAAATA TTAAAAGCCG CTCTCAAGGC GCTGGGTTTT AGCGGTGTCT ATGAAGTAGC CATGGCCGCG GATATAGTTG TTCTTGAAGA AGCCAGAGAG TTCTGTGAGC GCATGAAGAG CAGGGAAAAG TTTATGATCA CCTCCTGCTG CTGTCCTGCT TTTATAAAAT TGGTGGAAAA GGTGAGGCCT AAAGTTGCCC ACCTGATTTC TCCTTCCATG TCACCGATGA TTATTATGGG AAAAATGCTT AAGGGGAGGG AGGAAGAATG TCGCGTAGTT TTTATCGGTC CCTGTATAGC TAAAAAAGCA GAAGCTAAAA GACCTGATTT ACAGCCGGCT GTTGATTGCG TATTAACATT TAAGGAAACT AAAGCTTTAC TGGAGGCTGC TGAATTATCA CTTGACGGTT CACTGGGGCA GAGTGAGGTG CAGGATGCAT CGCATGACGG GCGTATTTTT GCACATACCG GTGGTGTTTC CGAGGCTATT CACAGGGCTG TACAGAGGCG TGCGCCGGAT TTAGAGTTCA GGCCGGTTAA AGGCAACGGG TTAAAACAAT GCAGCGAATT GCTGAAGCAG CTGGAAGAAG GCAGGTTGGA TGCCAACTTT ATGGAGGGTA TGGGCTGCCC GGAAGGCTGT GTCGGAGGTC CGGGAACCAA TATCAAAGCT GCCGAGGCGG CGGTTTTGGT CAGAGAATTT GCAGACAGGG CGCCAAAGCA GCAAAGTGAT GACAATATCT TTGCCCTACA ATGGATGAAG GAATATTACA AAGCTGCGGA TACCGAATCT ATCAAGCTGG ATATGTGA
|
Protein sequence | MNKNDFAHRI LVNLARQMKL GEILDRKKLS DKIIAECFPE KIEDEQIWRD RIDKRLDFLL QHSRQEESEP LLVHLLEESC EECSVEKRPC VHACPTGAIT YDRHGKGSIN TALCVECGWC VDTCISGVII ARSEFAQVAT MLLQSKVNPV YAILAPSFVG QFGPGVTPEI LKAALKALGF SGVYEVAMAA DIVVLEEARE FCERMKSREK FMITSCCCPA FIKLVEKVRP KVAHLISPSM SPMIIMGKML KGREEECRVV FIGPCIAKKA EAKRPDLQPA VDCVLTFKET KALLEAAELS LDGSLGQSEV QDASHDGRIF AHTGGVSEAI HRAVQRRAPD LEFRPVKGNG LKQCSELLKQ LEEGRLDANF MEGMGCPEGC VGGPGTNIKA AEAAVLVREF ADRAPKQQSD DNIFALQWMK EYYKAADTES IKLDM
|
| |