Gene Dtox_2994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2994 
Symbol 
ID8429984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3185218 
End bp3186318 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content43% 
IMG OID645035248 
ProductRadical SAM domain protein 
Protein accessionYP_003192371 
Protein GI258516149 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00145585 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.391108 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGATA AGCAAATTTG TTTCATTGTA AAAGTTACGA CACGCTGCAA TCTTGCTTGC 
AGTTACTGTT ATGAAGAAAA GACCGATGAG GACATGGACC TTTCAGTAGT AGAATCTTTG
ACACAAAAGG CTTTAGCCGC TACTTCCCGT GTGCAGTTTT GCTGGCATGG CGGGGAAACT
TTGCTAAGGG GAATCAGTTT TTATGAAGAG GTGGTTTCCT GCCAGAAGAG ATTCAGGGGT
GAGCACAATA AAATCTTAAA TACTCTGCAG ACTAATGGCT TATTGTTGGA CGAGGACTGG
TATGCCTGGT TTTCCGGGAG CGGGTTCAGA GTGGGGATTT CCTGTGACGG TCCTACCTGC
CATGATATAA ATCGCAAAAC CATTGCCGGT AACGGGTCCT TCAAAAATAT ATTGACTACC
TTATCAAAGA TGCGAGAGAC TAAAAATGAC CGGCTTTGTG GGGGATTACT GGCAGTTGTA
ACTCCTGAAA TGCTGGAACA CAGTGAAAAT TTGTTGGAAG AATTTATTTT GCTTGGGGTT
AAAAAGCTTG ACTTTTTAAG GTACAAAGCA CCGGACGGTG GGCTTTCAAC TGAAGAATAT
TACGGTTTTA TAAGAAGCAT TTTTAACCAG TGGCTTAAAC TGGATGATGC TTCACTCAAA
ATTCGCACCA TCGACAGTAC CTTGAATTAT TTTATCCGTG GCAAATCACG TCTTTGTCGT
TACCTTGGTG ACTGTAAGAG ATTTTTGACC GTAAGGCCAA ATGGCGATGT ATACCCTTGC
GAGTGCCTGC ATGGTAGTTC TATGTATCTT AAGTTAGGAA ATATCCTGCT GGACGATTTA
AATGATATCT ACAGAAAGGC TGGGCAGGTA ATCAAGCAGC ATAAATTACC GCCTTCCTGC
AGCGACTGCT TCTTTACCGC CCTCTGTCGT AATGCATGTG CCGGGGCGGA AAGGCTGGAA
GGAGTGTGTT TAGAGAAAAA AATGTTCTTC CGGGATATCT CTAATTTGGT TCAAACAGTA
CGAGAAAAAA ATGTAAATGC AGAAAGGAGG TCGTGGTTGT GGGTAGCAGC AACATTGAGA
ATGATATTGC AAGGAGATTA G
 
Protein sequence
MDDKQICFIV KVTTRCNLAC SYCYEEKTDE DMDLSVVESL TQKALAATSR VQFCWHGGET 
LLRGISFYEE VVSCQKRFRG EHNKILNTLQ TNGLLLDEDW YAWFSGSGFR VGISCDGPTC
HDINRKTIAG NGSFKNILTT LSKMRETKND RLCGGLLAVV TPEMLEHSEN LLEEFILLGV
KKLDFLRYKA PDGGLSTEEY YGFIRSIFNQ WLKLDDASLK IRTIDSTLNY FIRGKSRLCR
YLGDCKRFLT VRPNGDVYPC ECLHGSSMYL KLGNILLDDL NDIYRKAGQV IKQHKLPPSC
SDCFFTALCR NACAGAERLE GVCLEKKMFF RDISNLVQTV REKNVNAERR SWLWVAATLR
MILQGD