Gene Dtox_0945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0945 
Symbol 
ID8427884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp965732 
End bp967093 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content60% 
IMG OID645033285 
ProductRadical SAM domain protein 
Protein accessionYP_003190459 
Protein GI258514237 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00793831 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0982171 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCAAA TCGATCCTCG TCTTCATCTG TTTGAGGTTG ATGGCTCCAA TTACGCGGTC 
AGCACGGAAT ACATGCGGTT TGCCGGGATC GATCCGGAGA CGGCGGCGGT GGTAAGGCTG
GGCGGGCCGG CTGAACCGGT CCTGAGCCGG CTGGGGGTCC CGCTGGAACC TTGCGGACGG
AAGCGGTTCC TGTTTGCGCC GCCGGACACT CTGGTATTGA TGCTGACCTA TGCCTGCAAC
ATGGCCTGCC GGTATTGCTG CCAGGGAGAA ATTCCTGATA TCCGGGAAAA CCAGATGTCC
GAGGCAGTGT CGTGCCGGTC TGTCGACTGG CTGATCCGGA ACTCGGCAGG TTCGAAAACG
GTTAATATCG GATTTTTTGG CGGTGAGCCC TTGCTTAAAT TTTCGCTGAT GCAAAAAATT
GCCGTCTACG CCGAACGGCA AGCGGCAGCC GCCGACAAGC GGGTGCGCTA CGGCATCATG
ACCAACGGCC TGCTGCTGAC CGACCCTGTC ATTGATTTTC TAGCTGAGCG GCAAGTCGAG
GTTACCGTCA GTTTTGACGG GCCGCCTGCC GTACAAGACC GCAACCGGCC GCTGAAAGGC
GGCGGGGGCT CTTTCCATCT TATTGCTGAA AAAATCCGGA AACTGCTGGC GCGCTGCCCG
GACGCCACGC TCCGCCCCAC ACTCTATGCC GGCGCCGATC TTGACGAGGT GCTTCAAACA
GCCCGGCAGC TAGGTTTCCG GCAGTGCCGC ATTGAAAAAG TGTCCAGTTC GTTATTGCCG
GAAGGGAAAA AAAATGACGA AGCGGCATCT TCCGGCCAGT TGGCGGCCCA TTTGCGCCGC
CAGGGCGAAC GTTTTCTAAC CGCTGTCCGG GACCGGGATG CGGCCGGATT GCGGCGGATC
GCCGTGGATG GCGCGTTCAT GGAGGGATTA AGGCAAATGT TCCATGCGGA CTGGGCAGGC
ACGGTCCGGC GGCGTTGGTT TTCCTGCGGC ACCGGCAGGC AGCTGTTGGC GGTGGCGGTT
AACGGCGATT TGTATCCATG CCCGCGCTTC CTGGCCTTGC CGGAATACAG GGTGGGTTCG
GTTGCTGAGG ACGGTTTCCA GGGAGAACTG CACCAAAAAA GCCTTTTGAT CCACAGCGAG
GATTGCCGGT CCTGTTGGGC GCGTTATTTC TGCGGCGGCG CCTGCATTGT GGAACACTTG
GGTGGTACCG GTTCAATTTT CCGGGTCAAC CCGAACACTT GCCGCCTGCG GCGGGCCAGA
ATCGAAACAG CCGTCCGGGT GATTGCCGAA TGCTCAGACG AGGATAAAGC GTTCTTGCAG
GAGACCGGCG TTTTGCCGGG CCGGCTGGCG GTAAAAGCGT GA
 
Protein sequence
MGQIDPRLHL FEVDGSNYAV STEYMRFAGI DPETAAVVRL GGPAEPVLSR LGVPLEPCGR 
KRFLFAPPDT LVLMLTYACN MACRYCCQGE IPDIRENQMS EAVSCRSVDW LIRNSAGSKT
VNIGFFGGEP LLKFSLMQKI AVYAERQAAA ADKRVRYGIM TNGLLLTDPV IDFLAERQVE
VTVSFDGPPA VQDRNRPLKG GGGSFHLIAE KIRKLLARCP DATLRPTLYA GADLDEVLQT
ARQLGFRQCR IEKVSSSLLP EGKKNDEAAS SGQLAAHLRR QGERFLTAVR DRDAAGLRRI
AVDGAFMEGL RQMFHADWAG TVRRRWFSCG TGRQLLAVAV NGDLYPCPRF LALPEYRVGS
VAEDGFQGEL HQKSLLIHSE DCRSCWARYF CGGACIVEHL GGTGSIFRVN PNTCRLRRAR
IETAVRVIAE CSDEDKAFLQ ETGVLPGRLA VKA