Gene Dtox_3084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3084 
Symbol 
ID8430078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3279271 
End bp3280761 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content46% 
IMG OID645035340 
Producthistidine kinase 
Protein accessionYP_003192459 
Protein GI258516237 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000309277 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00206907 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTTTCA GAAGTATTGT TGGCAAACTC TGGCTGGCTA TTCTGGTCTT TGTAGTGGTT 
ATTTTAGCAG TTATGAGTAT CGTACAAAAC TTCGTATTAA AGGATATTTA TCTTAACCAG
CAGTCTGAGC AGCTTATTGA GGAAGGCAGA AAACTGGCTG CATATATAGA GGGCCAACCA
TCTCAAGAGC AGTTAGCCGA CAGGATTAAA ACTGTTTCGG AGGTGTTGAA CGCCTCGGTC
ATGGTGGTTG ACCGCAGCGG TATAGTAATT CAAGGGCGGG GCGGCATGCA TTGGGGTATG
ATGAGGCGGC ATTTCCAGAT GTTTAATGAG CGGGGCCAAA TGCAGCAGCA GTTTAGCCTG
CCCATTGACC GGGAAGACCT GAAGCTATTG TTGGACGGTC AAACGGTTGT GCACAGGGGC
TACAACCAGG CTTTTGAGAT GGAGTTGCTC TGGGCTGCTA TCCCTGTTCG GGACAAGGAC
GCAACCCAAT CGGTAGTTTT TATTCATACT TTATTGCAAC CGATCGCTGA ACGAATTAAT
GTTTTGTATA CTGCCAGTTT TTACATATTG GGTGGTGTGC TGCTTTTAGC TGTGATACTC
AGTCTTTTTT TGTCACGCAG CCTGTCGAGG CCATTATTGC AGATGAACCA GACTGCTCAG
GCCATAACCA GAGGCGATTA CAGTCGCAGT GTACCGGTCA GGTCGAAGGA TGAGATTGGC
CTTTTAGCTG CTTCTCTGAA CAGCTTGTCG CAGGAAATAC AGGAAAAAAT TACTGCAATT
GAGAGATTGG ATCAAACCAG GCGGGATTTC GTGGCCAATG TATCTCATGA GTTGAGGACA
CCTTTAACAA TTATACAGGG GTATACAGAA GCCCTTTTAG ATGGTTTTGC CGAGACTGAA
GAGGAACGCC AGCAGTATCT GAACAATGTT TTGGAGGAAT CTCAGAGATT GCAGCGTTTG
GTTAATGATG TGTTGGATTT GCGCAGCTTG GAATCAGGAA AAATGGTTTT ATACAAGCAG
AAGGTTGATT TAACAAAGCT TTTGGCAAAA GTGGCAGGAA TATTTAAGCC TTTTTACGCC
CAGAAACAGG TGTTTTTAAG TCTTTCCGCT GAGGAGGTTT GGGAAGCCGT ATATGCTGAT
CCGGACCGGT TGGAACAGGT ATTTGTCAAT CTAATTGATA ACGCTTTGAG ACATACTCCG
GCAGGCGGCG AAGTAAAGAT TAGTTCAGCA CTTGAGACAG ATAAGGTTTT TGTTACTGTG
AAGGATTCGG GACCGGGAAT TTCTTCGAAA GATTTGCCCT TAATCTGGGA GAGATTTTAT
AAAGTTGATA AAGCCCGTTC TAGAGGCGGG GAAGGCAGCG GCTTAGGCCT GGCCATAACC
AAAGCCATTA TTGACGCTCA TGGAGGATGG GTAGAAGTGA CCAGTGAACC GGGCAGAGGG
ACTGTTATGA CTGTTTGCTT GCCTGTTTTC AGAGAAAATG CGGTTCACTA A
 
Protein sequence
MIFRSIVGKL WLAILVFVVV ILAVMSIVQN FVLKDIYLNQ QSEQLIEEGR KLAAYIEGQP 
SQEQLADRIK TVSEVLNASV MVVDRSGIVI QGRGGMHWGM MRRHFQMFNE RGQMQQQFSL
PIDREDLKLL LDGQTVVHRG YNQAFEMELL WAAIPVRDKD ATQSVVFIHT LLQPIAERIN
VLYTASFYIL GGVLLLAVIL SLFLSRSLSR PLLQMNQTAQ AITRGDYSRS VPVRSKDEIG
LLAASLNSLS QEIQEKITAI ERLDQTRRDF VANVSHELRT PLTIIQGYTE ALLDGFAETE
EERQQYLNNV LEESQRLQRL VNDVLDLRSL ESGKMVLYKQ KVDLTKLLAK VAGIFKPFYA
QKQVFLSLSA EEVWEAVYAD PDRLEQVFVN LIDNALRHTP AGGEVKISSA LETDKVFVTV
KDSGPGISSK DLPLIWERFY KVDKARSRGG EGSGLGLAIT KAIIDAHGGW VEVTSEPGRG
TVMTVCLPVF RENAVH