Gene Dtox_2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2055 
Symbol 
ID8429037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2233983 
End bp2235788 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content40% 
IMG OID645034376 
Productsulfatase 
Protein accessionYP_003191507 
Protein GI258515285 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTAATA AATTATTGAA TCAACATATA CATGAAATGC CTCTTTGCCA TAAACCAAAT 
TTTCTTGTTA TTTTAGTAGA TCAACAGCGT TATGCTGTCA GCTATGAGAA CGAAGAAATA
AAGGTATGGA GAAAGACTCG TTTAAAAGCC CAAGAATTTT TAAAAAGCCG GGGATTTGAA
TTCAAGAATC ATTATGCCGG AAGTGCCGCC TGTTGTCCCA GTCGTGCCAC CCTTTATACC
GGGCAATACC CCTCTCTTCA TGGTGTCAGC CAAACAGACG GAGCTGCTAA GGGAGCATAT
GATCCAGACA TGTTCTGGCT TAATCCCAAT ACAGTACCAA CTATGGGAGA TTATTTTAGG
ACAGCAGGTT ACCAAACCTA CTGGAAGGGG AAATGGCATG CCTCTGCCGC AGATATTCTA
GTTCCCGGCA CCCATAAGCC ATTTCTCTCT TATAATCAGG GAAATGGTGT ACCTATTCCG
GATAATGAAA AGCTATATAT TAATGCAAAT GTACTTGCAA GTTTTGGATT TAACGGTTGG
ATAGGGCCCG AGCCCCATGG TGTTAACCCG AGGAATACGG GTTCTTCTGC GGCTGCCGGA
TTAAGCGGAA GAGATGTGGT GTATAGTCAG GATACAGTAG AACTGATTAG AGTATTAGAA
AAAGAATACA ACGAATCAGA CGAGTGCCGG CCAAGGCCAT GGCTTATAAT GTGCTCCTTT
GTAAATCCCC ATGATATAGC CTTGTTTGGG GCTATTAGCG GGTCATTGCC GCAATTTAAT
TTTAAAGTTA ATCTCTCTGT TCCCTACATT TCACCCGCAC CTACGGCATC AGAGTCCTTA
TTAACTAAAC CAAGTGCCCA ATCCAGCTAT AGAAGAATTT ATGCTTATGC CTTTCAGCCG
CTGCTGGATA CTTTGTTCTA TCGCCAGCTC TATTATAGTC TGGAAATGGA AGCAGATACC
CAAATATGCA GGGTTATTAA TGCCCTCAGG GAAACATCCT TTTATAATAA TACTATTATA
ATCTTCACTT CCGATCATGG GGAGCTTCTT GGAGCCCATG GTCTATTTCA AAAATGGTAT
CAGGCTTATG AAGAATCAAT CCATGTACCA TTAATAATTC ATAACCCAAC TCTTTTTGAC
AAACCTGAAT CTACAGATAT GCTGACCAGT CATGTAGATA TTCTGCCCAC AATGTTAGGA
ATAAGCGGGC TGGATACAGG GGCAATTCAT AAAGTTTTAG CTAACAGCCA TACGGAGGTT
CATTCTCTTG TTGGCAGAAA TCTTTCGCCA TTGCTGAAGA GTAAGACAGA TTTTATAGAA
GCCGGTGAGG CAATTTACTT TATGACCGAT GATAACATTA CTAAAGGTCT TAACCAGATA
AGTTTTGCGG GTGTACCTTA TCATTCTGTA GCTCAGCCTA ATTCTATAGA GACAGTAATT
GCGGCTCTTC CTACCGGCAG GGGCGGAACA AAGCAAATAT GGAAGTACTC ACGTTACTTT
GATAACCCTC ATTTTTGGAA CATTTCCGGC AGAAGAGATC AGTTTGTTTA TAATGGACCA
GTGAGAAGAA AATTCAATCC ATGCAATTAT AATGATACTC CCATTAGACC TCAAGCTGAC
CAATATGAAA TATACAATAT TACAACCGAT CCTTTGGAAA TACGAAATGT TTCTTATGAG
TCATATAACA ACCGATATTT TATGCAAATC AGGGAAATAC TCAATGAACT TTTAGAAGAA
CAACGTAAAA AGAAAAGGTT GTACCCTGTT AGCGGAAATG TGCATGGTAA AACCCCCAAT
TACTAA
 
Protein sequence
MTNKLLNQHI HEMPLCHKPN FLVILVDQQR YAVSYENEEI KVWRKTRLKA QEFLKSRGFE 
FKNHYAGSAA CCPSRATLYT GQYPSLHGVS QTDGAAKGAY DPDMFWLNPN TVPTMGDYFR
TAGYQTYWKG KWHASAADIL VPGTHKPFLS YNQGNGVPIP DNEKLYINAN VLASFGFNGW
IGPEPHGVNP RNTGSSAAAG LSGRDVVYSQ DTVELIRVLE KEYNESDECR PRPWLIMCSF
VNPHDIALFG AISGSLPQFN FKVNLSVPYI SPAPTASESL LTKPSAQSSY RRIYAYAFQP
LLDTLFYRQL YYSLEMEADT QICRVINALR ETSFYNNTII IFTSDHGELL GAHGLFQKWY
QAYEESIHVP LIIHNPTLFD KPESTDMLTS HVDILPTMLG ISGLDTGAIH KVLANSHTEV
HSLVGRNLSP LLKSKTDFIE AGEAIYFMTD DNITKGLNQI SFAGVPYHSV AQPNSIETVI
AALPTGRGGT KQIWKYSRYF DNPHFWNISG RRDQFVYNGP VRRKFNPCNY NDTPIRPQAD
QYEIYNITTD PLEIRNVSYE SYNNRYFMQI REILNELLEE QRKKKRLYPV SGNVHGKTPN
Y