Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mjls_1052 |
Symbol | |
ID | 4876793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. JLS |
Kingdom | Bacteria |
Replicon accession | NC_009077 |
Strand | + |
Start bp | 1135119 |
End bp | 1137470 |
Gene Length | 2352 bp |
Protein Length | 783 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640138366 |
Product | sulfatase |
Protein accession | YP_001069351 |
Protein GI | 126433660 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.845593 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCACGG AATTCAACGG CAAGATCGAA CTGGACATCC GCGATTCCGA GCCCGACTGG GGTCCGTACG CCGCGCCGAC CGCACCCGAG GGCGCGCCCA ACGTGCTGTA CCTCGTGTGG GACGACACCG GTATCGCGAC ATGGGACTGC TTCGGCGGCC TGGTCGACAT GCCGGCGATG AGCCGTATCG CCGAACGCGG TGTGCGCCTG TCGCAGGTCC ACACCACGGC GCTGTGCTCG CCGACCCGTG CCTCCCTGCT CACCGGTCGC AACGCGACGA CCGTCGGGAT GGCCACCATC GAGGAGTTCA CCGACGGCTT CCCGAACTGC AGCGGCCGCA TCCCGTTCGA CACCGCACTG ATCTCGGAGG TCCTCGCGGA GAACGGCTAC AACACCTACT GCGTAGGCAA GTGGCACCTG ACCCCGCTCG AGGAGTCGAA TCTGGCTGCC ACCAAACGAC ACTGGCCGCT GTCTCGTGGG TTCGAACGGT TCTACGGCTT CATGGGCGGC GAAACCGACC AGTGGTATCC CGAACTCGTC TACGACAACC ACCCGGTCGC CCCGCCCGGC ACCCCCGAGG ACGGCTACCA CCTGTCGAAG GACCTCGCGG ACAGGACGAT CGAGTTCATC CGCGACGCCA AGGTGATCGC CCCCGACAAA CCGTGGTTCT CCTACGTCTG CCCGGGTGCC GGCCACGCCC CGCACCACGT GTTCAAGGAA TGGGCCGACC GCTATGCGGG CCGCTTCGAC ATGGGCTACG AGGCCTACCG CGAGATCGTG CTGGAGAACC AGAGACGCCT CGGCATCGTC CCGCCGGACA CCGAGCTCTC ACCGATGAAC CCGTACGCCG ACGTCACCGG GCCCAAAGGG GAGCCGTGGC CCGTCCAGGA CACGGTGCGG CCGTGGGATT CCCTGTCCGA CAACGAGAAG CGGCTCTTCT GCCGGATGGC GGAGGTCTTC GCCGGATTCC TGTCCTACAC CGATGCGCAG ATCGGCCGGA TCCTCGACTA CCTGGAGGAG TCGGGTCAGC TCGACAACAC GATCATCGTC GTGATCTCCG ACAACGGGGC CAGCGGCGAA GGCGGACCCG ACGGTTCGGT CAACGAGACG AAGTTCTTCA ACGGGTACAT CGACACTGCG GAGGAGGGGC TCAAGGTCAT CGACGATCTC GGTGGCCCGC ACACCTACAA CCACTATCCG ACCGGCTGGG CGATGGCGTT CAACACGCCC TACAAACTGT TCAAGCGCTA CGCCTCCCAT GAGGGCGGCA TCGCCGACAC CGCGATCATC TCGTGGCCCG ACGGCATCGC CGCGCACGGT GAGGTGCGGG ACAACTACGT CAACGTCTGC GACATCACCC CGACGGTGTA CGACCTGCTG GGGCTCACCG CCCCGGCTTC CGTGCGCGGG GTGCCGCAGA AACCGCTCGA CGGTGTGAGT TTCAAAGTGA CACTGGATAA TCCGACCGCG CCCACCGGCA AGGAGACCCA GTTCTACTCG ATGCTCGGCA CCCGGGGGAT CTGGCACCAG GGCTGGTTCG CCAACACCGT GCACGCGGCG TCGCCGGCCG GCTGGTCGCA TTTCGACGAC GACCGCTGGG AGTTGTTCCA CATCGAGGCC GACCGCGCCC AGGTGCACGA TCTGGCCGCC GAACACCCGG AGAAACTCGA GGAACTCAAG GCGCTGTGGT TCAGCGAGGC CGCCAAGTAC AACGGGCTGC CGCTCGGCGA TCTCAACATC TTCGACACCA TCGGGCGGTG GCGGCCCAGC CTGTCGGGTG CGCGGGACGC CTACGTGTAC TACCCGGGCA CCGCGGATGT CGGAACCGGC GCCGTCGTCG AGGTCCAGGG CCGCTCGTTC GTGGTGCTGG CCGAGGTCAC CGTCGACGAC GACACCGCAC AGGGTGTGGT GTTCAAACAC GGTGGCGCAC ACGGTGGCCA CGTGATGTAC GTGCAGGACG GCCGGCTGCA CTACACGTAC AACTTCCTCG GCGAGACCGA ACAGAAGATG ACGTCGTCGG TGCCGATCAC CCCCGGCCGG CACACCTTCG GGATCGCCTA CACCCGGACC GGCACCGTCG AGGGCAGCCA CACCCCCCTC GGTGACGCCG TGCTCTACGT CGACGACGAC GCGGTCGCCT CCTACCCGGG CATGATGAGC CACCCCGGGA CGTTCGGACT GGCCGGCGCC ACGCTCTCGG TGGGCCGCAA CAGCGGATCC CCGGTCTCGC GGGCCTACCG GCCGCCGTTC GAGTTCACCG GCGGCACCAT CGCCCAGGTC TCGTTCGACG TCTCCGGCAA GCCCTACCTC GACCTGGAAC GCGAGTTCGC CCGGGCCTTC GCGAAGGACT GA
|
Protein sequence | MLTEFNGKIE LDIRDSEPDW GPYAAPTAPE GAPNVLYLVW DDTGIATWDC FGGLVDMPAM SRIAERGVRL SQVHTTALCS PTRASLLTGR NATTVGMATI EEFTDGFPNC SGRIPFDTAL ISEVLAENGY NTYCVGKWHL TPLEESNLAA TKRHWPLSRG FERFYGFMGG ETDQWYPELV YDNHPVAPPG TPEDGYHLSK DLADRTIEFI RDAKVIAPDK PWFSYVCPGA GHAPHHVFKE WADRYAGRFD MGYEAYREIV LENQRRLGIV PPDTELSPMN PYADVTGPKG EPWPVQDTVR PWDSLSDNEK RLFCRMAEVF AGFLSYTDAQ IGRILDYLEE SGQLDNTIIV VISDNGASGE GGPDGSVNET KFFNGYIDTA EEGLKVIDDL GGPHTYNHYP TGWAMAFNTP YKLFKRYASH EGGIADTAII SWPDGIAAHG EVRDNYVNVC DITPTVYDLL GLTAPASVRG VPQKPLDGVS FKVTLDNPTA PTGKETQFYS MLGTRGIWHQ GWFANTVHAA SPAGWSHFDD DRWELFHIEA DRAQVHDLAA EHPEKLEELK ALWFSEAAKY NGLPLGDLNI FDTIGRWRPS LSGARDAYVY YPGTADVGTG AVVEVQGRSF VVLAEVTVDD DTAQGVVFKH GGAHGGHVMY VQDGRLHYTY NFLGETEQKM TSSVPITPGR HTFGIAYTRT GTVEGSHTPL GDAVLYVDDD AVASYPGMMS HPGTFGLAGA TLSVGRNSGS PVSRAYRPPF EFTGGTIAQV SFDVSGKPYL DLEREFARAF AKD
|
| |