Gene Dtox_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2023 
Symbol 
ID8429005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2193883 
End bp2195304 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content44% 
IMG OID645034350 
Producthistidine kinase 
Protein accessionYP_003191481 
Protein GI258515259 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.14541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.618926 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAAACA GAAGTAGCAA GAAAGATATA AAAACCGTAC CCTTATTAAC CTATTGGACA 
CGCTTTTATT TTCTGGTGCT GATGTCCTGC CTGTTGCTGC TGGCTGTCAT TGCCGTTATC
TGGATCAGGC TGACTGTCTA TGATAATCGT TACCAGATTT TAGAAGTAAG GGCTGAAAAG
CTGGCTGAAA TTTATGAAGA TACTGCCGGA CAAAACTTGT TGCCCAGAAA TATGAACCGC
ACAAGAGAAA TACGTTCGGA ACTGTTCCTG GTGCAGATAG TGGATCGCGC CGGAAATATT
CAAACCATCA GGAAAGATAA AGCCAGCAAC AAAAATGCGG CAGCGATAAA CAATCTGTCT
AAACTGAATG GTCAAATATT GGCCGGGCAA AAGATTCGTG AGCAATTTAA AATTGGCAAT
CTCACCTGGC TGCGAGTAGG TGTGCCTTTA TATCAAAAAG GAATAATCAC CGGTGCTTTA
TATATAAGTA TTCCGGCGAG CGGTTTGCTG GCAGATTTAC ACTTTATGCA TATTTCCATA
TTTTTAATTA TTGCTATTAT CGTGCTGGCG GGCTGGCTGG TATTGTATTT TCTATCTCGA
AAGCTTACCC GTCCTCTGCT GGCAATTGCC GGGGCTGCGC AGGCAATTGC CTCCGGTCAT
TATGAACCTC GTCTGCCTCA AAATCTGAAG GAGCGTGAAC TGCAGCAGCT TGTGACGTCT
TTTAGTGATA TGGCCTTTCA ATTAAAGCAG CTGGAGCGCA TGAGAGCGGA TCTCCTGGCA
GGTGTATCAC ATGAACTGCG TACGCCTGTT ACCTCTATCC GGGGGATGAT TCAGGCGGTG
CATGGTAAAG TTGTGACTGG ACCGGAAGCA GACGAGTTTT TGTATATTTC GTTAAATGAA
TCAATTAGAT TGCAGCAAAT GGTTGAGGAT TTACTGGATT TTTCTACATT TGAAGCAGGT
GCGGAACCTA TTGAAAAAGA ATACTTTAAT TTATTAGGCT TATTAGATGA AGTTATATGC
CAGGTCAGTA TTTTAGCAGG TTATGAGCAG ATTAGCTTTG ACAGGAAAGG TTTGGACCAG
GAAGTATGGA TGACAGGTGA CAGGGGCCGC CTGCGGCAGG TTTTTCTGAA TTTATTTAGC
AACAGCCAGA AAGCTTCTGC TACAGTAATT AAAATCGTGC TGCGGGTAGG TGAGGACAAC
ATCGAGATAG ATGTCGAGGA CAACGGAAAA GGGATTGAAG AAGCAGACAG GGAGTATATA
TTCGAAAGGT TTTACCGTGG ACAAGGTAAA AAAATGAAGC CACGGGGGTT GGGATTGGGT
TTAACGGTCA GCAGGATGTT GTCACGGGCC CACGGAGGGG ATCTGTTTTT ATTGAGAACC
TCGTCGGAGG GCAGTGTCTT TTGTTTAACT CTGCCACTAT AG
 
Protein sequence
MLNRSSKKDI KTVPLLTYWT RFYFLVLMSC LLLLAVIAVI WIRLTVYDNR YQILEVRAEK 
LAEIYEDTAG QNLLPRNMNR TREIRSELFL VQIVDRAGNI QTIRKDKASN KNAAAINNLS
KLNGQILAGQ KIREQFKIGN LTWLRVGVPL YQKGIITGAL YISIPASGLL ADLHFMHISI
FLIIAIIVLA GWLVLYFLSR KLTRPLLAIA GAAQAIASGH YEPRLPQNLK ERELQQLVTS
FSDMAFQLKQ LERMRADLLA GVSHELRTPV TSIRGMIQAV HGKVVTGPEA DEFLYISLNE
SIRLQQMVED LLDFSTFEAG AEPIEKEYFN LLGLLDEVIC QVSILAGYEQ ISFDRKGLDQ
EVWMTGDRGR LRQVFLNLFS NSQKASATVI KIVLRVGEDN IEIDVEDNGK GIEEADREYI
FERFYRGQGK KMKPRGLGLG LTVSRMLSRA HGGDLFLLRT SSEGSVFCLT LPL