Gene Mjls_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_1052 
Symbol 
ID4876793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp1135119 
End bp1137470 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content67% 
IMG OID640138366 
Productsulfatase 
Protein accessionYP_001069351 
Protein GI126433660 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.845593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCACGG AATTCAACGG CAAGATCGAA CTGGACATCC GCGATTCCGA GCCCGACTGG 
GGTCCGTACG CCGCGCCGAC CGCACCCGAG GGCGCGCCCA ACGTGCTGTA CCTCGTGTGG
GACGACACCG GTATCGCGAC ATGGGACTGC TTCGGCGGCC TGGTCGACAT GCCGGCGATG
AGCCGTATCG CCGAACGCGG TGTGCGCCTG TCGCAGGTCC ACACCACGGC GCTGTGCTCG
CCGACCCGTG CCTCCCTGCT CACCGGTCGC AACGCGACGA CCGTCGGGAT GGCCACCATC
GAGGAGTTCA CCGACGGCTT CCCGAACTGC AGCGGCCGCA TCCCGTTCGA CACCGCACTG
ATCTCGGAGG TCCTCGCGGA GAACGGCTAC AACACCTACT GCGTAGGCAA GTGGCACCTG
ACCCCGCTCG AGGAGTCGAA TCTGGCTGCC ACCAAACGAC ACTGGCCGCT GTCTCGTGGG
TTCGAACGGT TCTACGGCTT CATGGGCGGC GAAACCGACC AGTGGTATCC CGAACTCGTC
TACGACAACC ACCCGGTCGC CCCGCCCGGC ACCCCCGAGG ACGGCTACCA CCTGTCGAAG
GACCTCGCGG ACAGGACGAT CGAGTTCATC CGCGACGCCA AGGTGATCGC CCCCGACAAA
CCGTGGTTCT CCTACGTCTG CCCGGGTGCC GGCCACGCCC CGCACCACGT GTTCAAGGAA
TGGGCCGACC GCTATGCGGG CCGCTTCGAC ATGGGCTACG AGGCCTACCG CGAGATCGTG
CTGGAGAACC AGAGACGCCT CGGCATCGTC CCGCCGGACA CCGAGCTCTC ACCGATGAAC
CCGTACGCCG ACGTCACCGG GCCCAAAGGG GAGCCGTGGC CCGTCCAGGA CACGGTGCGG
CCGTGGGATT CCCTGTCCGA CAACGAGAAG CGGCTCTTCT GCCGGATGGC GGAGGTCTTC
GCCGGATTCC TGTCCTACAC CGATGCGCAG ATCGGCCGGA TCCTCGACTA CCTGGAGGAG
TCGGGTCAGC TCGACAACAC GATCATCGTC GTGATCTCCG ACAACGGGGC CAGCGGCGAA
GGCGGACCCG ACGGTTCGGT CAACGAGACG AAGTTCTTCA ACGGGTACAT CGACACTGCG
GAGGAGGGGC TCAAGGTCAT CGACGATCTC GGTGGCCCGC ACACCTACAA CCACTATCCG
ACCGGCTGGG CGATGGCGTT CAACACGCCC TACAAACTGT TCAAGCGCTA CGCCTCCCAT
GAGGGCGGCA TCGCCGACAC CGCGATCATC TCGTGGCCCG ACGGCATCGC CGCGCACGGT
GAGGTGCGGG ACAACTACGT CAACGTCTGC GACATCACCC CGACGGTGTA CGACCTGCTG
GGGCTCACCG CCCCGGCTTC CGTGCGCGGG GTGCCGCAGA AACCGCTCGA CGGTGTGAGT
TTCAAAGTGA CACTGGATAA TCCGACCGCG CCCACCGGCA AGGAGACCCA GTTCTACTCG
ATGCTCGGCA CCCGGGGGAT CTGGCACCAG GGCTGGTTCG CCAACACCGT GCACGCGGCG
TCGCCGGCCG GCTGGTCGCA TTTCGACGAC GACCGCTGGG AGTTGTTCCA CATCGAGGCC
GACCGCGCCC AGGTGCACGA TCTGGCCGCC GAACACCCGG AGAAACTCGA GGAACTCAAG
GCGCTGTGGT TCAGCGAGGC CGCCAAGTAC AACGGGCTGC CGCTCGGCGA TCTCAACATC
TTCGACACCA TCGGGCGGTG GCGGCCCAGC CTGTCGGGTG CGCGGGACGC CTACGTGTAC
TACCCGGGCA CCGCGGATGT CGGAACCGGC GCCGTCGTCG AGGTCCAGGG CCGCTCGTTC
GTGGTGCTGG CCGAGGTCAC CGTCGACGAC GACACCGCAC AGGGTGTGGT GTTCAAACAC
GGTGGCGCAC ACGGTGGCCA CGTGATGTAC GTGCAGGACG GCCGGCTGCA CTACACGTAC
AACTTCCTCG GCGAGACCGA ACAGAAGATG ACGTCGTCGG TGCCGATCAC CCCCGGCCGG
CACACCTTCG GGATCGCCTA CACCCGGACC GGCACCGTCG AGGGCAGCCA CACCCCCCTC
GGTGACGCCG TGCTCTACGT CGACGACGAC GCGGTCGCCT CCTACCCGGG CATGATGAGC
CACCCCGGGA CGTTCGGACT GGCCGGCGCC ACGCTCTCGG TGGGCCGCAA CAGCGGATCC
CCGGTCTCGC GGGCCTACCG GCCGCCGTTC GAGTTCACCG GCGGCACCAT CGCCCAGGTC
TCGTTCGACG TCTCCGGCAA GCCCTACCTC GACCTGGAAC GCGAGTTCGC CCGGGCCTTC
GCGAAGGACT GA
 
Protein sequence
MLTEFNGKIE LDIRDSEPDW GPYAAPTAPE GAPNVLYLVW DDTGIATWDC FGGLVDMPAM 
SRIAERGVRL SQVHTTALCS PTRASLLTGR NATTVGMATI EEFTDGFPNC SGRIPFDTAL
ISEVLAENGY NTYCVGKWHL TPLEESNLAA TKRHWPLSRG FERFYGFMGG ETDQWYPELV
YDNHPVAPPG TPEDGYHLSK DLADRTIEFI RDAKVIAPDK PWFSYVCPGA GHAPHHVFKE
WADRYAGRFD MGYEAYREIV LENQRRLGIV PPDTELSPMN PYADVTGPKG EPWPVQDTVR
PWDSLSDNEK RLFCRMAEVF AGFLSYTDAQ IGRILDYLEE SGQLDNTIIV VISDNGASGE
GGPDGSVNET KFFNGYIDTA EEGLKVIDDL GGPHTYNHYP TGWAMAFNTP YKLFKRYASH
EGGIADTAII SWPDGIAAHG EVRDNYVNVC DITPTVYDLL GLTAPASVRG VPQKPLDGVS
FKVTLDNPTA PTGKETQFYS MLGTRGIWHQ GWFANTVHAA SPAGWSHFDD DRWELFHIEA
DRAQVHDLAA EHPEKLEELK ALWFSEAAKY NGLPLGDLNI FDTIGRWRPS LSGARDAYVY
YPGTADVGTG AVVEVQGRSF VVLAEVTVDD DTAQGVVFKH GGAHGGHVMY VQDGRLHYTY
NFLGETEQKM TSSVPITPGR HTFGIAYTRT GTVEGSHTPL GDAVLYVDDD AVASYPGMMS
HPGTFGLAGA TLSVGRNSGS PVSRAYRPPF EFTGGTIAQV SFDVSGKPYL DLEREFARAF
AKD