Gene Dtox_0465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0465 
Symbol 
ID8427400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp478823 
End bp480103 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content45% 
IMG OID645032839 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003190017 
Protein GI258513795 
COG category 
COG ID 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.220677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGTT ACTGTGCCGC AGTTCGTTGG GCATTTAAAA GACTGCTGGA CGGATGGAAA 
GTACAGGACA TTCGTATAAC TGTACAAGGA AAGTTCAGAC TTAACTCCCG GCAGGCTAAC
GATGCAGTAT ATGATGCCCA GACCACAATC AAAAGCCAAT ATGAATTAGT GCAGATGTAC
TGTGAAAACG CCAAAGCAAA GGTTGAATTT ACAGAAAAGC GTATCGCCAA GGCTAAATCA
CCGGCTAAGG TTGCCAAACT GCAAAAACGG TTAGAAAAGG AACAGCGTAA ACTGGTCTTC
TGGCAAAAGC ACCTGGATAA CAATACCTTT CCGCCTGTTG TATTCGGAGG AAAGAAGCTC
TTTCAAGAAC GCTGCAAAGG TAATATTACC AGGGAAGAGT GGCAGGAAGT CAGAAGTAAC
CGTTATCTGT CACGGGGAGA TAAAACCAAA GGCGGCAACC TAAATACCCG CATATACGAA
GACCAAGACC AAATCTATCT TGATATAGCC GCCGACCCGG TACAGAAAGG GAAATCCGTT
CGGTATAACC GCATAATGGT GCCGGTCTAT TTAGCTCAAA AGCCATCGAA AAAGACCGGC
AAGATTAACG GTATCAACTA CCGGCAAATG GTTTTGGATT ATCTTAAAAC AGGCAGTGCC
TATCAGGTAG AAATCCTCTG CAGAGACGGG AAATATTACG TCCATGTGAG TATTGAAGAA
GAAGTTCCGA TGCCATATAA CCATAAGGGT GCGTTTGGTG TAGACACCAA CCCGGACGGA
TTAGGCGTAA CCCAGGTAGA ATGTCTGGGG CAATACCGGG GCAGTGAATG GCTTGGTCAA
GGCGAATGGA CTTATGCCAG AACAAACCGG AGAGATAACC GAACCTGCGA AATGGCTAAG
AAAGTAATCC TCCAGGCTAA AGAAAAAGGT TACGCCTTGG CGGTAGAGGA CTTGAAGTTT
AAAAATGACA AGTCCGTAAC GGCCAAGTTT AACCGAATGA GTCACAGTTT TGTCCGGTCG
AAGTTTCTAA AAGCAGTTGA CCGGTGTGCT GCCCGTGAGG GAGTGCCGAT ATTAAAGGTA
AAACCGGCTT TTACTTCGGT CATAGGCATC CTAAAATACA AGCACATGTA CGGCATAGCT
GTTCACGAAG CGGCAGGCTA TGTCATAGCC CGGCGTGGCT TGGGCTTTGA TCATGAGAAG
ATACCCAAGA TATTGCTTGA TAAACTGGTT AAAAAGAAAC TTGAATTTAA ACAAAGCTTG
TGTCAAAGTT TGGTAAGTTA A
 
Protein sequence
MDRYCAAVRW AFKRLLDGWK VQDIRITVQG KFRLNSRQAN DAVYDAQTTI KSQYELVQMY 
CENAKAKVEF TEKRIAKAKS PAKVAKLQKR LEKEQRKLVF WQKHLDNNTF PPVVFGGKKL
FQERCKGNIT REEWQEVRSN RYLSRGDKTK GGNLNTRIYE DQDQIYLDIA ADPVQKGKSV
RYNRIMVPVY LAQKPSKKTG KINGINYRQM VLDYLKTGSA YQVEILCRDG KYYVHVSIEE
EVPMPYNHKG AFGVDTNPDG LGVTQVECLG QYRGSEWLGQ GEWTYARTNR RDNRTCEMAK
KVILQAKEKG YALAVEDLKF KNDKSVTAKF NRMSHSFVRS KFLKAVDRCA AREGVPILKV
KPAFTSVIGI LKYKHMYGIA VHEAAGYVIA RRGLGFDHEK IPKILLDKLV KKKLEFKQSL
CQSLVS