Gene Mjls_3978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3978 
Symbol 
ID4879687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4202925 
End bp4205306 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content67% 
IMG OID640141290 
Productsulfatase 
Protein accessionYP_001072244 
Protein GI126436553 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0573144 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATT CGTCACCCCA GAACACCGGA CAGCAGGCAG CGGCCGCGGT GCGGCGCGAC 
GTGCTTCCGA TCCCGGATCC CCGACACGTC GGGTTGACGA CGTACGACGC GAAGGATCCG
AATACGACGT ATCCGCCGAT CACGCGGCTG CGCCCACCCG ACGGTGCGCC GAATGTGCTG
ATCGTGCTGA TCGACGACGT GGGTTTCGGC GCGAGTTCGG CCTTCGGCGG CCCGTGCCGC
ACACCGGTCG CGGAGCGGCT GGCGGCCAAC GGTGTGAAAC TCAACCGCTT CCACACCACC
GCGCTGTGCT CGCCGACGCG CCAGGCGCTG CTCACCGGAC GCAACCACCA CTCGGTGGGG
ATGGGCGGTG TCACCGAGAT CGCGACGTCC GCGCCGGGAT ACAGCAGCAT CCGGCCGAAG
GACAAGGCGC CGATCTCGGA AACGCTTCGG CTCAACGGAT ACTCGACGAG TCAGTTCGGC
AAGTGCCACG AGGTCCCGGT GTGGGAGGTG TCGCCGGTCG GGCCGTTCGA CCAGTGGCCC
ACCGGGTCGG GGTTCGAGCA CTTCTACGGG TTCGTCGGCG GCGAGGCCAA CCAGTACTAC
CCCGGACTGT ACGAGGGCAC CACACCCGTC GAACCGGCCA AGACGCCCGA GCAGGGTTAC
ACCCTGACCG AGGACCTCGC CGACCGGGCG ATCACCTGGG TGCGCCAACA GCAGGCGCTC
ACCCCGGACA AGCCGTTCTT CATGTATTTC GCGCCCGGTG CCACCCACGC GCCACACCAC
GTGCCCCGGG AGTGGTCCGA CCGGTACCGC GGCGAGTTCG ACGCCGGCTG GGACGTGTTG
CGCGAGCAGA TCTTCGCCCG CCAGAAGGAA CTCGGCGTGA TCCCGCAGGA CGCCGAGCTG
ACCCGGCGCC ACGACGAGAT CCCCGGCTGG GACGACATGC CCGACCGGCT CAAGCCGGTG
CTCGCCCGGC AGATGGAGAT CTATGCCGGC TTCATGGAGC AGACCGACCA CGAGGTCGGG
CGGCTGATCG ACGCGGTCGA CGGCCTCGGC GCACTCGACA ACACGCTGAT CTACTACATC
ATCGGCGACA ACGGCGCGTC GGCGGAGGGC ACGCCGAACG GGTGCTTCAA CGAGATGTGC
ACGCTCAACG GGCTGGCCGG GATCGAGACC GAGGATTTCC TGCTGTCCAA GATCGACGAC
TTCGGCACGC CGGACGCGTA CAACCACTAT GCGGTGGGCT GGGCGCACGC CCTGTGCGCG
CCGTATCAGT GGACCAAGCA GGTCGCCTCC CACTGGGGCG GCACCCGCAA CGGCACGATC
GTGCACTGGC CCGCGGGGAT CGACGCCGTC GGCCAGACCC GCGACCAGTT CCACCACGTG
ATCGACGTCG TGCCAACGAT TCTCGAGGCG GCCGACATCC CCGCACCACT GATGGTCAAC
GGCATCGCGC AGGCGCCCAT CGAGGGCTTC AGCATGATGT CGACGCTGCG GGCCGCCGAT
GCCGCCGAGA CCCACCGGGT GCAGTACTTC GAGATGTTCG GCAACCGCGG CATCTACCAC
GACGGTTGGA CGGCGGTCAC CAAACACCGG ACGCCGTGGC TGTCCGACCA GCCGGCGCTC
GAGGACGACG TCTGGGAGCT GTACGGGCCG GGCGACTGGA CCCAGGCGCA CGACCTGGCG
GCCGATGATC CGGCGAAACT CGCCGAACTA CAGCGTCTCT GGCTCATCGA GGCGGTCAAG
TACGACGTGT TGCCGATCGA CGACAGGTCG TACGAGCGCT TCAACCCGGA CATCTCCGGG
CGGCCGGTGC TCATCACCGG CAGCACCCAG ACGATCTTCC CCGGTATGCG GCTGATGGAG
AGCTGCGTGC TCAACATCAA GAACAAGTCG CATTCGGTGA CGGCCGAGCT GTCGGTGCCC
GAGTCCGGTG CGCAGGGGGT GGTGGTCAGC CAGGGCGGCG GTGTCGGTGG TTGGTGTCTG
TACGCGCACG AGGGCCGGTT GAAATACTGT TACAACTTCC TCGGCATCGA GTACTACTAC
GTCACCGCCG AGGCTCCGCT CTCGGCCGGC GATCACCTCC TGCGTATGGA ATTCGCCTAC
GACGGTGGGG GTTTGGGCAA GGGCGGCACC GTCACGCTCT TCTGTGACGG TGAAGCCGTC
GGGACGGGGC GGGTCGACCA GAGCGAGCCG ATGGCGTTCT CCGCCGACGA GATGTGCGAC
GTCGGCTCGG ACAGCGGCTC GCCGACGTCA CCCGACTACG GGCCGCACGA CAACGCTTTC
ACCGGCCGAA TCGACTGGGT GAAGATCGAT ATCGGCGCCG CCGACCACGA CCACCTGATC
ACCGCGGAGG ACAAGCTGAA CATCGCGATG TCGCGGCAGT GA
 
Protein sequence
MSDSSPQNTG QQAAAAVRRD VLPIPDPRHV GLTTYDAKDP NTTYPPITRL RPPDGAPNVL 
IVLIDDVGFG ASSAFGGPCR TPVAERLAAN GVKLNRFHTT ALCSPTRQAL LTGRNHHSVG
MGGVTEIATS APGYSSIRPK DKAPISETLR LNGYSTSQFG KCHEVPVWEV SPVGPFDQWP
TGSGFEHFYG FVGGEANQYY PGLYEGTTPV EPAKTPEQGY TLTEDLADRA ITWVRQQQAL
TPDKPFFMYF APGATHAPHH VPREWSDRYR GEFDAGWDVL REQIFARQKE LGVIPQDAEL
TRRHDEIPGW DDMPDRLKPV LARQMEIYAG FMEQTDHEVG RLIDAVDGLG ALDNTLIYYI
IGDNGASAEG TPNGCFNEMC TLNGLAGIET EDFLLSKIDD FGTPDAYNHY AVGWAHALCA
PYQWTKQVAS HWGGTRNGTI VHWPAGIDAV GQTRDQFHHV IDVVPTILEA ADIPAPLMVN
GIAQAPIEGF SMMSTLRAAD AAETHRVQYF EMFGNRGIYH DGWTAVTKHR TPWLSDQPAL
EDDVWELYGP GDWTQAHDLA ADDPAKLAEL QRLWLIEAVK YDVLPIDDRS YERFNPDISG
RPVLITGSTQ TIFPGMRLME SCVLNIKNKS HSVTAELSVP ESGAQGVVVS QGGGVGGWCL
YAHEGRLKYC YNFLGIEYYY VTAEAPLSAG DHLLRMEFAY DGGGLGKGGT VTLFCDGEAV
GTGRVDQSEP MAFSADEMCD VGSDSGSPTS PDYGPHDNAF TGRIDWVKID IGAADHDHLI
TAEDKLNIAM SRQ