Gene Dtox_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2234 
Symbol 
ID8429217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2403788 
End bp2405107 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content36% 
IMG OID645034542 
Producthypothetical protein 
Protein accessionYP_003191672 
Protein GI258515450 
COG category[V] Defense mechanisms 
COG ID[COG4268] McrBC 5-methylcytosine restriction system component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000177538 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGGATA GCCAGGTAAT TAGAGCGAAG GATTGCTTAA CTTTTGAGGT TACGTCTATA 
TGTGAAGAAG CAATTCGCCA TTTTGCGCAT TCCTATGATT GCACAAAGAA TGTTTTCAGA
TTTTCACCTA AGCGTTATGC TGACGATGAT GCTTCTAAGG TATTAGAGTA TGATTACTGG
CAAAGGATGT GGCGGGCAGG GCGCTATGTT GGTAACGCAA GTTTCAGTTA TAACAATCGG
AATTATGACA TTGTTATAGA GCCACGGTAT GGAGAACTGT TCCTGTTCAA GATGATAGAA
GAAATATTCA ATGTTAAGTT GGTGACTTCA AATGCAGCAT TACAAAGAAA GAACGATTTT
GGCTTCTTGA TAAGGCGGCT AATCTCTTTT ATTTGGCTGC ATAAGTTGGC AAATGCTAAT
AAGCATGGGC TTCCTCGTCA TAACGTTAAG AAAAATTACA CAGGATATAA TGTTAAAGGA
CGAATCAACG TTAAAAAATC AGTTATTTCT TTGCTTACAA AGGAACAAGT TGTTTCTGAA
TTTTACGAGA AAGAGATAGA CGAAACCATA GCCAGGATAC TAGTACAGGC TTATTGGATC
TTAGTTAAGG ATTATGAACT AGGGATATTA AGCCTTCCGG ATAACGCTCG TGAAATAATC
AACCTTTTAA AAAGCTCTCG GTTCTCAAGT CAGAGTGTTA GCCAAAACGA ATATGACAGA
ATTAACTATA AAGAGATTTA CCAGTCATTT AGAGAAGTAG TAGATTTCTC CTGGGATATA
ATAAATAATA AAACACCATC AAAGTCCGTA TGCACACAAT CACAGAATGG CTTTAGTTTT
TTTATCGATA TGGCAGAAAT CTGGGAACTG TATATTCGTA CAGCATTAAG CAAACATCTT
AAAAAGGATA AATGGAAGGT AATGTTGGAT TATGCTGTAG TTTATGAAGA TACTTTTTTC
AAAAGATGTC TGATACCCGA TATAGTTGTG AAAAGAGGCG CTGATGTTGC TGTTTTTGAT
GCAAAATACA AAGCCATGAA TTATAGCAGT CTGGATGTTG ATAGAAATGA CTTTTTTCAA
ATCCATACCT ATATGAATTA TTACGCACAA GGAAAAAGAT TACTGGCTGG CGGTTTGATT
TACCCCTTAG AGAAATCTCT TCCGTTAAAT TGTAAAAGTG ATTCTCTATT CAGTTCAAAT
GAGACAAATG CTACATTTTT TATCGATGGG TTTAATGTTA CCGATGCAGG AGAGTTCGAA
AAAGAGAAAA TGGAATTTCT ACAACGAATT TCTGCGGTGC TCGCTATTAC AAATACATAA
 
Protein sequence
MLDSQVIRAK DCLTFEVTSI CEEAIRHFAH SYDCTKNVFR FSPKRYADDD ASKVLEYDYW 
QRMWRAGRYV GNASFSYNNR NYDIVIEPRY GELFLFKMIE EIFNVKLVTS NAALQRKNDF
GFLIRRLISF IWLHKLANAN KHGLPRHNVK KNYTGYNVKG RINVKKSVIS LLTKEQVVSE
FYEKEIDETI ARILVQAYWI LVKDYELGIL SLPDNAREII NLLKSSRFSS QSVSQNEYDR
INYKEIYQSF REVVDFSWDI INNKTPSKSV CTQSQNGFSF FIDMAEIWEL YIRTALSKHL
KKDKWKVMLD YAVVYEDTFF KRCLIPDIVV KRGADVAVFD AKYKAMNYSS LDVDRNDFFQ
IHTYMNYYAQ GKRLLAGGLI YPLEKSLPLN CKSDSLFSSN ETNATFFIDG FNVTDAGEFE
KEKMEFLQRI SAVLAITNT