Gene Dtox_2121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2121 
Symbol 
ID8429103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2295725 
End bp2296957 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content47% 
IMG OID645034442 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003191573 
Protein GI258515351 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00142013 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.115245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAAAT GTAAAAATAA CCAATCAAAG AAGTCAAAGT CAAAATGCAT CAACATTTTG 
GTGAATAAAT TTCCTGTATA TCTAACCCCG GAGCAAACTT CCCTGGCCTG TACCCTGCAA
AGAGAGGCAT CTAAAGTATG GAACACAACT TGCACTGTTC ACCGTACAAT CTATATAAAA
CATCACTGCT GGCTCGACGA AGGTGCCATG AAAGCATTCG TTAAAGGCAA ATACGGTGTT
CATTCCCAGT CGGCGCAGGC TATAGTGGAA ACTTACTTTG AGTGCTGTGA GCGCACCGGG
AAGCTGCGCG AACAAGGGGT TACAGATTGG CGCTATCCCC ATCGCAGAAA ACGTTTTTTC
ACCGTAACCT GGAAGCCACT TGGTATAACT TACGAAGGAA AGATGCTGAC TCTCTCCAAC
GGACGCGGCA GGGAATCACT CATACTTAAC TTACCCAAAA GGCTCTCCGG AGCCGTCATT
AAGCTGGTTC AACTTGTATG GCACCGTAAC CTTTACTGGC TGCATGTAAC GGTAGAAAAA
CCGGCCTTGA AAAAAGTACA GGGCGGCGTT ACAGCAGCCA TTGACCCCGG TGAGGTACAT
GCTGTAGCTA TCACAGACGG TAAGAAATCT TTGGCAGTGA GCGGCAGATT GCTGCGGTCT
CTGCGCCGGC TCAGGAATAA GGTGCTGCGC AGGTTGCAAA AAGCTATTTC TAAAACTAAA
AAAGGCTCAA AACAGCGCAA TAAGCTTTTA GCTGCAAAGT ACCGGTTTTT GAACAATATT
GAGCGCCGAA TTGAGCACGT CATGCATACC ATTTCAGCTA TTGTTTCAAA ATGGTGCTTT
GAGCGTAACG TCAATACCGT CTATATAGGC AATCCAGAAG GCGTGCGCAA GAAGGACTGC
GGTAAAAAGC ACAACCAGCG GATGAGTCAA TGGACTTTCG GTGAATTACG CAGGATGCTG
GAGTATAAGT TAAAGCGTCA TGGCATTAAG CTGATACCAG TGGATGAACG CGGTACTTCG
GGTACTTGTC CGGCTTGTGC AGAGTATACC AAGCAAACAG GCCGCACCTA TAAATGCGGC
AAGTGCGGTT TCGCCGGCCC GCACCGGGAT ATGGTCGGTG CTTCCGGGAT TCTGGATAAA
TCGGTTAACG GTAAATTCAC CAAAGGCCGT AAGTTACCTG AGAAGGTCGA ATATGCACGG
CTGAAGGTGC TGGCACTGAA AAAAACTGCT TAA
 
Protein sequence
MSKCKNNQSK KSKSKCINIL VNKFPVYLTP EQTSLACTLQ REASKVWNTT CTVHRTIYIK 
HHCWLDEGAM KAFVKGKYGV HSQSAQAIVE TYFECCERTG KLREQGVTDW RYPHRRKRFF
TVTWKPLGIT YEGKMLTLSN GRGRESLILN LPKRLSGAVI KLVQLVWHRN LYWLHVTVEK
PALKKVQGGV TAAIDPGEVH AVAITDGKKS LAVSGRLLRS LRRLRNKVLR RLQKAISKTK
KGSKQRNKLL AAKYRFLNNI ERRIEHVMHT ISAIVSKWCF ERNVNTVYIG NPEGVRKKDC
GKKHNQRMSQ WTFGELRRML EYKLKRHGIK LIPVDERGTS GTCPACAEYT KQTGRTYKCG
KCGFAGPHRD MVGASGILDK SVNGKFTKGR KLPEKVEYAR LKVLALKKTA