Gene Dtox_1708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1708 
Symbol 
ID8428674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1801195 
End bp1802502 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content48% 
IMG OID645034041 
Producthydrogenase large subunit domain protein 
Protein accessionYP_003191188 
Protein GI258514966 
COG category[R] General function prediction only 
COG ID[COG4624] Iron only hydrogenase large subunit, C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAA ATGATTTTGC TCACCGTATA CTGGTAAATC TGGCTCGCCA AATGAAACTG 
GGAGAAATAC TTGACAGAAA AAAGCTCTCG GATAAAATCA TTGCCGAGTG TTTCCCTGAA
AAAATCGAAG ATGAGCAAAT CTGGCGTGAC AGGATAGACA AGCGGCTGGA TTTTTTGCTG
CAACACAGCC GGCAGGAGGA ATCAGAACCG CTGTTGGTAC ACTTGCTTGA AGAAAGCTGT
GAGGAATGCA GTGTGGAAAA GCGGCCCTGT GTTCATGCAT GTCCTACAGG AGCTATAACT
TATGACCGGC ATGGTAAAGG CAGCATAAAT ACCGCGCTGT GCGTGGAGTG CGGATGGTGT
GTGGATACAT GTATTTCGGG TGTGATTATA GCCCGCTCTG AATTCGCACA GGTTGCAACT
ATGCTGCTGC AAAGTAAGGT CAACCCAGTA TATGCCATAC TGGCACCCTC TTTTGTAGGG
CAGTTTGGGC CCGGTGTAAC CCCGGAAATA TTAAAAGCCG CTCTCAAGGC GCTGGGTTTT
AGCGGTGTCT ATGAAGTAGC CATGGCCGCG GATATAGTTG TTCTTGAAGA AGCCAGAGAG
TTCTGTGAGC GCATGAAGAG CAGGGAAAAG TTTATGATCA CCTCCTGCTG CTGTCCTGCT
TTTATAAAAT TGGTGGAAAA GGTGAGGCCT AAAGTTGCCC ACCTGATTTC TCCTTCCATG
TCACCGATGA TTATTATGGG AAAAATGCTT AAGGGGAGGG AGGAAGAATG TCGCGTAGTT
TTTATCGGTC CCTGTATAGC TAAAAAAGCA GAAGCTAAAA GACCTGATTT ACAGCCGGCT
GTTGATTGCG TATTAACATT TAAGGAAACT AAAGCTTTAC TGGAGGCTGC TGAATTATCA
CTTGACGGTT CACTGGGGCA GAGTGAGGTG CAGGATGCAT CGCATGACGG GCGTATTTTT
GCACATACCG GTGGTGTTTC CGAGGCTATT CACAGGGCTG TACAGAGGCG TGCGCCGGAT
TTAGAGTTCA GGCCGGTTAA AGGCAACGGG TTAAAACAAT GCAGCGAATT GCTGAAGCAG
CTGGAAGAAG GCAGGTTGGA TGCCAACTTT ATGGAGGGTA TGGGCTGCCC GGAAGGCTGT
GTCGGAGGTC CGGGAACCAA TATCAAAGCT GCCGAGGCGG CGGTTTTGGT CAGAGAATTT
GCAGACAGGG CGCCAAAGCA GCAAAGTGAT GACAATATCT TTGCCCTACA ATGGATGAAG
GAATATTACA AAGCTGCGGA TACCGAATCT ATCAAGCTGG ATATGTGA
 
Protein sequence
MNKNDFAHRI LVNLARQMKL GEILDRKKLS DKIIAECFPE KIEDEQIWRD RIDKRLDFLL 
QHSRQEESEP LLVHLLEESC EECSVEKRPC VHACPTGAIT YDRHGKGSIN TALCVECGWC
VDTCISGVII ARSEFAQVAT MLLQSKVNPV YAILAPSFVG QFGPGVTPEI LKAALKALGF
SGVYEVAMAA DIVVLEEARE FCERMKSREK FMITSCCCPA FIKLVEKVRP KVAHLISPSM
SPMIIMGKML KGREEECRVV FIGPCIAKKA EAKRPDLQPA VDCVLTFKET KALLEAAELS
LDGSLGQSEV QDASHDGRIF AHTGGVSEAI HRAVQRRAPD LEFRPVKGNG LKQCSELLKQ
LEEGRLDANF MEGMGCPEGC VGGPGTNIKA AEAAVLVREF ADRAPKQQSD DNIFALQWMK
EYYKAADTES IKLDM