Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtox_2121 |
Symbol | |
ID | 8429103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfotomaculum acetoxidans DSM 771 |
Kingdom | Bacteria |
Replicon accession | NC_013216 |
Strand | - |
Start bp | 2295725 |
End bp | 2296957 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 645034442 |
Product | transposase, IS605 OrfB family |
Protein accession | YP_003191573 |
Protein GI | 258515351 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00142013 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.115245 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAAAT GTAAAAATAA CCAATCAAAG AAGTCAAAGT CAAAATGCAT CAACATTTTG GTGAATAAAT TTCCTGTATA TCTAACCCCG GAGCAAACTT CCCTGGCCTG TACCCTGCAA AGAGAGGCAT CTAAAGTATG GAACACAACT TGCACTGTTC ACCGTACAAT CTATATAAAA CATCACTGCT GGCTCGACGA AGGTGCCATG AAAGCATTCG TTAAAGGCAA ATACGGTGTT CATTCCCAGT CGGCGCAGGC TATAGTGGAA ACTTACTTTG AGTGCTGTGA GCGCACCGGG AAGCTGCGCG AACAAGGGGT TACAGATTGG CGCTATCCCC ATCGCAGAAA ACGTTTTTTC ACCGTAACCT GGAAGCCACT TGGTATAACT TACGAAGGAA AGATGCTGAC TCTCTCCAAC GGACGCGGCA GGGAATCACT CATACTTAAC TTACCCAAAA GGCTCTCCGG AGCCGTCATT AAGCTGGTTC AACTTGTATG GCACCGTAAC CTTTACTGGC TGCATGTAAC GGTAGAAAAA CCGGCCTTGA AAAAAGTACA GGGCGGCGTT ACAGCAGCCA TTGACCCCGG TGAGGTACAT GCTGTAGCTA TCACAGACGG TAAGAAATCT TTGGCAGTGA GCGGCAGATT GCTGCGGTCT CTGCGCCGGC TCAGGAATAA GGTGCTGCGC AGGTTGCAAA AAGCTATTTC TAAAACTAAA AAAGGCTCAA AACAGCGCAA TAAGCTTTTA GCTGCAAAGT ACCGGTTTTT GAACAATATT GAGCGCCGAA TTGAGCACGT CATGCATACC ATTTCAGCTA TTGTTTCAAA ATGGTGCTTT GAGCGTAACG TCAATACCGT CTATATAGGC AATCCAGAAG GCGTGCGCAA GAAGGACTGC GGTAAAAAGC ACAACCAGCG GATGAGTCAA TGGACTTTCG GTGAATTACG CAGGATGCTG GAGTATAAGT TAAAGCGTCA TGGCATTAAG CTGATACCAG TGGATGAACG CGGTACTTCG GGTACTTGTC CGGCTTGTGC AGAGTATACC AAGCAAACAG GCCGCACCTA TAAATGCGGC AAGTGCGGTT TCGCCGGCCC GCACCGGGAT ATGGTCGGTG CTTCCGGGAT TCTGGATAAA TCGGTTAACG GTAAATTCAC CAAAGGCCGT AAGTTACCTG AGAAGGTCGA ATATGCACGG CTGAAGGTGC TGGCACTGAA AAAAACTGCT TAA
|
Protein sequence | MSKCKNNQSK KSKSKCINIL VNKFPVYLTP EQTSLACTLQ REASKVWNTT CTVHRTIYIK HHCWLDEGAM KAFVKGKYGV HSQSAQAIVE TYFECCERTG KLREQGVTDW RYPHRRKRFF TVTWKPLGIT YEGKMLTLSN GRGRESLILN LPKRLSGAVI KLVQLVWHRN LYWLHVTVEK PALKKVQGGV TAAIDPGEVH AVAITDGKKS LAVSGRLLRS LRRLRNKVLR RLQKAISKTK KGSKQRNKLL AAKYRFLNNI ERRIEHVMHT ISAIVSKWCF ERNVNTVYIG NPEGVRKKDC GKKHNQRMSQ WTFGELRRML EYKLKRHGIK LIPVDERGTS GTCPACAEYT KQTGRTYKCG KCGFAGPHRD MVGASGILDK SVNGKFTKGR KLPEKVEYAR LKVLALKKTA
|
| |