Gene Dbac_1085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_1085 
Symbol 
ID8376747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp1190663 
End bp1193122 
Gene Length2460 bp 
Protein Length819 aa 
Translation table11 
GC content61% 
IMG OID645000321 
Productsulfatase 
Protein accessionYP_003157609 
Protein GI256828881 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.141132 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACAC CACGAAGGGC GCTGACCGCC CTGGGCGGCC TGGGCATGGC CCTGGCCGCG 
TTCTTCATGC AGCAGCCCGT CGTCGCCCTG GCGCAGCAGG CGTCGGATGG CCTGGACCGT
ACGGTTCTGC CCATAACGGA GCCAAAGCGC GAGAGGTCCA AGGAAATCGA TGCGAGCAAA
GCCATTGCCC CGCCCAGGTT TGCCGTCACG CCTCCCAAGG GTGCACCCAA CGTGGTCGTC
GTGCTCATCG ACGATCTCGG CTTTGCCGGG ACCAGCGCCT TCGGCGGGCC CATCGACACG
CCCACCTTCG ACCGCATCGC CGGTGAAGGC GTGTATTACA ACAACTTTCA CACCACGGCG
GTCTCCTCGC CTACGCGGTC GGCCCTCAAA AGCGGTCGCA ACCACCACGT CAACAACATG
GGCGGCATCA CCGAGATGGG CACGGCTTTT CCCGGCAACA CCGGGCAGAT CCCCGGCGAA
GTCGCCCCGG TTGCCGAGAT GCTGCGTCTG AACGGATACA GCACCGCCGC CTTTGGCAAA
TGGCACGAAA CCGCAGCCTG GGAGACCAGC GTGTCAGGGC CGTTCGATCG TTGGCCGACT
CGCCAGGGCT TCGACAAGTT CTACGGGTTC CTGGGCGGCG AGACCAACCA GTGGGCACCG
TTCATCTATG ATGGCACCCA TCAGGTGGAA CTGCCTGATG ATCCTGACTA TCATTTTATG
ACTGACATGA CCGACCAAAC CGTGGCCTGG ATCAAACATC AGAAGGCCCT GACCCCGGAC
AAGCCGTTTT TCGTCTATTT CGCCCCCGGA GCGGTTCATG CGCCGCATCA TGTGCCCCAG
GAATGGATTG CCAAGCAGAA GGGCAAGTTC GATCAGGGCT GGGATACGCT GCGCGAACAA
ATTCTGGCCC GGCAGATTGA GCGCGGCGTG GTGCCCAAGG GCACGAAGCT CGCCCCCAAG
CCCGAGGCGA TTCCGGACTG GAACTCTCTG TCCGATGATG AAAAGCGCCT CTTCAGCCGC
CAGGCCGAAG TCTTTGCGGC TTTTCTGGAC ATGACCGACT ATGAGGTAGG TAGGGTGATC
AAGGCCCTGG AGGAGACGGG CCAACTTGAC AACACGATGG TCATCTTCGT TTACGGCGAC
AACGGCACCA GCGCCGAGGG CGGGCGTAAC GGCATGTTCA GCGAGATGAC GTATTTCAAC
GGCGTGCAGG AAACCGTGCC GGATATGCTC AAATTCATCG ACAAGTGGGG CGGCCCCGAG
ACCTATCCGC ACATGGCGGC CGGTTGGGCC GTGGCCCTCG ACACTCCGTA CCAGTGGACC
AAGCAGGTTG CCTCGGACCA CGGGGGAACC AAGGTCGGCA TGGCCATCCG CTGGCCCGCC
GGTATGAAGG CCAAGGGTGA GCTTCGCAGA CAGTTTCACC ATGTCATCGA CGTGGCTCCG
ACCATTTTGG AGGCCGCCAA TCTGCCCGAA CCAAAGACTG TCAACGGTGT CGAGCAGTAT
CCCATGGACG GCGTGAGCAT GGTTTACAGC TTCGATGACG CCAAGGCCCA GGAACGGCAC
ACGACCCAGT ACTTCGAGAT GTCCGGCAAC CGCGCCATCT ATCACGACGG TTGGTTTGCA
CGGACCATCT TCAAGGCGCC TTGGGAAGCC AAGCCCCGCC GTGACGTGGC GGACGATTCC
GCATGGGAAC TCTACGACAC TCGCACCGAC TTCAGCCTGA TCAATGACCT CTCAAAGCAA
AACCCGCAGA AGCTTAAAGA AATGCAGGCG TTGTTCCTGG TGGAAGCGGA GCGGAATTTT
GCGCTGCCCA TGGAAGGCCG CATCTTCGAG CGTCTCAATG CGGAACTGGT AGGGCGCCCC
GACCTGATGG CCGGCCGGAC GTCCATCACC CTGGCCGGTG GCATGACCGG CATGGGTGAA
AACGTGTTCC TGAACATCAA GAACAAGTCC AAGACCGTCA CCGCCGAGAT CGAGGTGCCC
GAAGACAAGG ATGCCAACGG CATCATCATT GCCCAGGGTG GTCGTTTCGG CGGCTGGGCC
ATGTACGTCA AGGACGGTGT GCCAGCCTAC GACTACAATT TCCTGGGCAT GGAGCGTACG
ACCGTAACCG GGACCGAGAA GCTTAAGGGC GGAAAATACA CGCTGCGCTT CGAGTTCGCC
TACGATGGCG GAGGGTTTGG CAAGGGCGGG ATGGGAACGC TTTATGTGAA CGACAAGAAG
GTCGGAGAGG GTCGCATCGA GCGCACGCAG CCAATGATAT TCTCCGCGGA TGAAACCGCC
GATGTGGGCA TCGACCTGGC GACGCCGGTG GTGGAATCCA TCGGGGCGGA AGCGAGGTCC
CGCTTCAACG GCCGGATTCA GAAGGTGACA GTGGAAGTCA AGGCCGCAAA GCCTGCAGAG
AAGGCCGAAG CCGACGCCGC AGCTACCCTC CTGGCCCATA AGAAGGCCCT GGCGGATTAG
 
Protein sequence
MATPRRALTA LGGLGMALAA FFMQQPVVAL AQQASDGLDR TVLPITEPKR ERSKEIDASK 
AIAPPRFAVT PPKGAPNVVV VLIDDLGFAG TSAFGGPIDT PTFDRIAGEG VYYNNFHTTA
VSSPTRSALK SGRNHHVNNM GGITEMGTAF PGNTGQIPGE VAPVAEMLRL NGYSTAAFGK
WHETAAWETS VSGPFDRWPT RQGFDKFYGF LGGETNQWAP FIYDGTHQVE LPDDPDYHFM
TDMTDQTVAW IKHQKALTPD KPFFVYFAPG AVHAPHHVPQ EWIAKQKGKF DQGWDTLREQ
ILARQIERGV VPKGTKLAPK PEAIPDWNSL SDDEKRLFSR QAEVFAAFLD MTDYEVGRVI
KALEETGQLD NTMVIFVYGD NGTSAEGGRN GMFSEMTYFN GVQETVPDML KFIDKWGGPE
TYPHMAAGWA VALDTPYQWT KQVASDHGGT KVGMAIRWPA GMKAKGELRR QFHHVIDVAP
TILEAANLPE PKTVNGVEQY PMDGVSMVYS FDDAKAQERH TTQYFEMSGN RAIYHDGWFA
RTIFKAPWEA KPRRDVADDS AWELYDTRTD FSLINDLSKQ NPQKLKEMQA LFLVEAERNF
ALPMEGRIFE RLNAELVGRP DLMAGRTSIT LAGGMTGMGE NVFLNIKNKS KTVTAEIEVP
EDKDANGIII AQGGRFGGWA MYVKDGVPAY DYNFLGMERT TVTGTEKLKG GKYTLRFEFA
YDGGGFGKGG MGTLYVNDKK VGEGRIERTQ PMIFSADETA DVGIDLATPV VESIGAEARS
RFNGRIQKVT VEVKAAKPAE KAEADAAATL LAHKKALAD