Gene Dtox_3007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3007 
Symbol 
ID8429997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3208750 
End bp3210273 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content39% 
IMG OID645035260 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_003192383 
Protein GI258516161 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.738719 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGATC AAAGAATAAA TATATTAATG GTTGACGACC GTCCGGAAAA CTTGCTGGCT 
TTGGAGGCAG TGCTCATATC TCAGAATTAT AACCTGGTTC GAGCTTATTC AGGGGAAGAA
GCTTTAAAGT GTGTTTTAAA GAATGATTTC GCTTTGATCT TATTGGATGT TCAAATGCCG
GGACTGGATG GATTTGAAAC CGCCAGGTTA ATAAGATCAA GGAAAAAATC TAAAAATGTA
CCGATTGTTT TTATTACCGC AATAAATCAG GCTAATGAGT TTGTCGTCCA GGGATACAAA
CTTGGTGCTA TGGACTATAT TATTAAACCT TTTCATCCTG ATACATTAAA AATAAAAATT
GAATTATATA TTAGTATTTA CCGGGATCGT GAGAACCTTG AGAAGTTGGT GCAGGCGCGT
ACTTCTGATT TATCGGCAGC AAACAAGCGC CTGCAGCAAG AGGTTATGGA ACGAATAGAA
GTCTCGGAAA GGCTGGCAAA ATCAAACGAA AGAATTGCAA GCATTCTTGA AAGTACCACA
GACGCCTTTT TTGCACTGGA TGGCCAGTGG AGGTTTATAT ATGTTAACTC GGCGGCAGAA
CGATTATGGG GTATAACAAG AGAAAAACTA ATTGGTAAAA TCATGTGGGA TGAGTTCCCT
GAAGCCATTG GCACTTCTCT TGAAAGTGAG TATTATAAAG CAGTATTCCG GCAGAAGGCC
GTTCATTTTG AAGCATTTTT ACCTGTAATG CATAGATGGG TAGAGGTTCA CGCTTACCCT
TACCAGGATG GATTATCAGT TTATTTTCAT GATATAAGTG AGCGAAAACG TGCAGAGCGG
GAAATAGCAC GTTTGGATCG GCTGGATTTA GTTGGAGAAA TGGCTGCCGG TATTGCCCAC
GAGATAAGAA ACCCGATGAC TACAGTTAGG GGCTTCCTGC AATTGCTGGG AACTAAAAAG
GAGTGTGTTA AATATCAGGA ATACTATAGC CTAATGATTG ATGAGCTTGA TCGGGCTAAT
TCAATAATAT CTGAATTTCT TTCTTTAGCC AGAAACAATA CAAATGATTT TAAATTACAG
AGTCTTAATT GTATTATAAA TGTGTTAATG CCGCTAATTA CGGCTGATGC CATCAAGTCC
GATAAAAACA TAATAGCACA ATATGGAGAT ATTCAAGATT TATTATTGAA TGAACAGGAG
ATACGACAAA TTGTTCTTAA TCTTACTCGA AATGGACTGG AAGCAATGCC TTCCGGTAAG
ATTTTAACTA TAAAAACGTA TATGGAAGGT GATGAAGTTG TTCTGTCGGT ACAAGACCAG
GGGAAAGAAA TCGCCCCTGA TATACTGAAT AAAATGGGCA CACCTTTTTT TACTACCAAA
AACAATGGAA CGGGTTTAGG TCTGGCAACA TGTTACAGCA TCGCCAATAG ACATAACGCA
GCTATATCAG TAGAAACCGG TCCTGCCGGT ACCACTTTTT TTATACGGTT TAAAATACCA
AATGAACTGC GTCCAGATCT TTAG
 
Protein sequence
MDDQRINILM VDDRPENLLA LEAVLISQNY NLVRAYSGEE ALKCVLKNDF ALILLDVQMP 
GLDGFETARL IRSRKKSKNV PIVFITAINQ ANEFVVQGYK LGAMDYIIKP FHPDTLKIKI
ELYISIYRDR ENLEKLVQAR TSDLSAANKR LQQEVMERIE VSERLAKSNE RIASILESTT
DAFFALDGQW RFIYVNSAAE RLWGITREKL IGKIMWDEFP EAIGTSLESE YYKAVFRQKA
VHFEAFLPVM HRWVEVHAYP YQDGLSVYFH DISERKRAER EIARLDRLDL VGEMAAGIAH
EIRNPMTTVR GFLQLLGTKK ECVKYQEYYS LMIDELDRAN SIISEFLSLA RNNTNDFKLQ
SLNCIINVLM PLITADAIKS DKNIIAQYGD IQDLLLNEQE IRQIVLNLTR NGLEAMPSGK
ILTIKTYMEG DEVVLSVQDQ GKEIAPDILN KMGTPFFTTK NNGTGLGLAT CYSIANRHNA
AISVETGPAG TTFFIRFKIP NELRPDL