Gene Dtox_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1823 
Symbol 
ID8428801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp1931782 
End bp1933542 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content39% 
IMG OID645034161 
ProductTransposase-like protein 
Protein accessionYP_003191296 
Protein GI258515074 
COG category[L] Replication, recombination and repair 
COG ID[COG5421] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.615645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCTCA AGAAATCCGT TAAGACAGTC AAAGGGAAAA AATACTCCCA TTATAGTATA 
GTCGAATCAT TTAGAGACAA CGGTAAAGTT AAACACCGCT TAATTTTTGC AATTGGCCCT
CTTGATGATG AAGCAGCCGA TCGGTTACGC TTAACGCTTA ATGCCCACTC TAACCAAGAT
CTTGTTGTGG CCAAATCTGA TGATATTGTT GTCACAAAGC ATGGAGCATA TTTAGATGTA
GCTGTTCTGG TTCATATCTG GCAACAATGG CAGTTTCACG AGTTCTTTCA GGATGACCGC
TGGGTTACAG GTATGGTAAT TAACCGTTGT ATTGACCCGG TTGCGAAATG CAATGTGCAG
GAGTGGATGA CCAAAACAGT ACTGCCTGCC TATATAGATA CAGATCCATT GTCAATGAAT
GCATTTGATA TCTTTCGAGA ACTAGACCGA CTATGCCAGC GAGAAACCGA GCTACAGTCA
TATATGTTTC GTAAAATTCA AGAAAAGCGA CCAAATAGCC TGGATGTGTT TTTTTATGAT
ATTACCTCTA CATATGTAAC AGGAAGTCGC TGCGTACTTA CTAAGTTTGG CTACTCGCGG
GATCACCGTC CCGATTGCGA ACAAATTGTT ATTGCTTTGA TGATTACCCC GGATGGTTAC
CCTTTTTATT GGAAGTTGCT GGAGGGTAAT ACTCAGGATG TTTCCACAGT TTGCGACTTA
ATCCAAAACG TCAAGACTTG TTTTCCCATA CAACACTGCA CCATGGTTTT TGACCGAGGT
ATGGTATCTG CTGATAACCT TAAAACATTA GAGAAAACAA ATTGGGATTA TGTTTCGGCA
ATGGACAGGG ACGAGATTAA CGTATTATCA TTTTTCGAAA CAGCATTGCC AACCCCTCCT
ATGCCGGAGG ACTGGGAACA AGTCTTGGCG ATGCAGGAAT TTCAACCAAT AGATGATGAC
ATATTGTATT ACCGAGAATT TGAAGATGAT AATCGGCGAT ACATTATAAC TTTTGACGTG
GCACGTTTTC TTGATGAGCA CCAAATACAA AGAAATAAGG TAGAACAAAT TAATAGGTGG
CTAATCAAAA AAAATGGGGA TTTGAAACAA GCCAAAAAAT CAAGAAATCG TGACACTCTT
GAGCGAGAAA TCAGCAAAAT CTCGAAACGG TTTCATGTTC ATAAATATTT GTCCGTTCAG
ATTACGCCCT GTTCTCGTAC CGTTACAACT AAAACTGGTA AATCTCGTAC TGTTGAATCA
TTTCAGCTTT CAGATACTAT TGACAACACT GCTTTGCAGA AAGAACAACG TTTGTATGGA
ATCACATGTT TTATCTCCAA TATTACCCAA GAGCGTATAT CTGCTCAGGA AATAGTACAG
TGGTATCGAC GGAAAAATAA AATTGAAGAA GCCTTTAGGG AGATAAAATC ACATCTTGAA
TTACGTCCAA TTTATTTAAC CAGGGAGAAA AGAGTAAGGG CCCATGTTGC TGTTTGCATG
CTAGCCTATT TTCTGAGAAA TGATATTGAG CTCCAACTTA AGGAGCACGG AATTTCCAAT
TCAACTGAGA CGGTTTTAGC CTTATTAGCT GAGTGCAAGG CTAATCGCTG GGTCTTTGAT
AAATCGGAGG CAAAGACACA CTTAAATATC ACAAAGGTCT CCGAAAAGCA ACAACAAATA
TTAAAAGCGC TTGGATGTGA ATCAATTGTG GACGTAAAGC ATGTTAAAAA CATTTTACAA
AAGGCCGAAA ATTGGCTGTA G
 
Protein sequence
MFLKKSVKTV KGKKYSHYSI VESFRDNGKV KHRLIFAIGP LDDEAADRLR LTLNAHSNQD 
LVVAKSDDIV VTKHGAYLDV AVLVHIWQQW QFHEFFQDDR WVTGMVINRC IDPVAKCNVQ
EWMTKTVLPA YIDTDPLSMN AFDIFRELDR LCQRETELQS YMFRKIQEKR PNSLDVFFYD
ITSTYVTGSR CVLTKFGYSR DHRPDCEQIV IALMITPDGY PFYWKLLEGN TQDVSTVCDL
IQNVKTCFPI QHCTMVFDRG MVSADNLKTL EKTNWDYVSA MDRDEINVLS FFETALPTPP
MPEDWEQVLA MQEFQPIDDD ILYYREFEDD NRRYIITFDV ARFLDEHQIQ RNKVEQINRW
LIKKNGDLKQ AKKSRNRDTL EREISKISKR FHVHKYLSVQ ITPCSRTVTT KTGKSRTVES
FQLSDTIDNT ALQKEQRLYG ITCFISNITQ ERISAQEIVQ WYRRKNKIEE AFREIKSHLE
LRPIYLTREK RVRAHVAVCM LAYFLRNDIE LQLKEHGISN STETVLALLA ECKANRWVFD
KSEAKTHLNI TKVSEKQQQI LKALGCESIV DVKHVKNILQ KAENWL