Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mjls_3978 |
Symbol | |
ID | 4879687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. JLS |
Kingdom | Bacteria |
Replicon accession | NC_009077 |
Strand | - |
Start bp | 4202925 |
End bp | 4205306 |
Gene Length | 2382 bp |
Protein Length | 793 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640141290 |
Product | sulfatase |
Protein accession | YP_001072244 |
Protein GI | 126436553 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0573144 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGATT CGTCACCCCA GAACACCGGA CAGCAGGCAG CGGCCGCGGT GCGGCGCGAC GTGCTTCCGA TCCCGGATCC CCGACACGTC GGGTTGACGA CGTACGACGC GAAGGATCCG AATACGACGT ATCCGCCGAT CACGCGGCTG CGCCCACCCG ACGGTGCGCC GAATGTGCTG ATCGTGCTGA TCGACGACGT GGGTTTCGGC GCGAGTTCGG CCTTCGGCGG CCCGTGCCGC ACACCGGTCG CGGAGCGGCT GGCGGCCAAC GGTGTGAAAC TCAACCGCTT CCACACCACC GCGCTGTGCT CGCCGACGCG CCAGGCGCTG CTCACCGGAC GCAACCACCA CTCGGTGGGG ATGGGCGGTG TCACCGAGAT CGCGACGTCC GCGCCGGGAT ACAGCAGCAT CCGGCCGAAG GACAAGGCGC CGATCTCGGA AACGCTTCGG CTCAACGGAT ACTCGACGAG TCAGTTCGGC AAGTGCCACG AGGTCCCGGT GTGGGAGGTG TCGCCGGTCG GGCCGTTCGA CCAGTGGCCC ACCGGGTCGG GGTTCGAGCA CTTCTACGGG TTCGTCGGCG GCGAGGCCAA CCAGTACTAC CCCGGACTGT ACGAGGGCAC CACACCCGTC GAACCGGCCA AGACGCCCGA GCAGGGTTAC ACCCTGACCG AGGACCTCGC CGACCGGGCG ATCACCTGGG TGCGCCAACA GCAGGCGCTC ACCCCGGACA AGCCGTTCTT CATGTATTTC GCGCCCGGTG CCACCCACGC GCCACACCAC GTGCCCCGGG AGTGGTCCGA CCGGTACCGC GGCGAGTTCG ACGCCGGCTG GGACGTGTTG CGCGAGCAGA TCTTCGCCCG CCAGAAGGAA CTCGGCGTGA TCCCGCAGGA CGCCGAGCTG ACCCGGCGCC ACGACGAGAT CCCCGGCTGG GACGACATGC CCGACCGGCT CAAGCCGGTG CTCGCCCGGC AGATGGAGAT CTATGCCGGC TTCATGGAGC AGACCGACCA CGAGGTCGGG CGGCTGATCG ACGCGGTCGA CGGCCTCGGC GCACTCGACA ACACGCTGAT CTACTACATC ATCGGCGACA ACGGCGCGTC GGCGGAGGGC ACGCCGAACG GGTGCTTCAA CGAGATGTGC ACGCTCAACG GGCTGGCCGG GATCGAGACC GAGGATTTCC TGCTGTCCAA GATCGACGAC TTCGGCACGC CGGACGCGTA CAACCACTAT GCGGTGGGCT GGGCGCACGC CCTGTGCGCG CCGTATCAGT GGACCAAGCA GGTCGCCTCC CACTGGGGCG GCACCCGCAA CGGCACGATC GTGCACTGGC CCGCGGGGAT CGACGCCGTC GGCCAGACCC GCGACCAGTT CCACCACGTG ATCGACGTCG TGCCAACGAT TCTCGAGGCG GCCGACATCC CCGCACCACT GATGGTCAAC GGCATCGCGC AGGCGCCCAT CGAGGGCTTC AGCATGATGT CGACGCTGCG GGCCGCCGAT GCCGCCGAGA CCCACCGGGT GCAGTACTTC GAGATGTTCG GCAACCGCGG CATCTACCAC GACGGTTGGA CGGCGGTCAC CAAACACCGG ACGCCGTGGC TGTCCGACCA GCCGGCGCTC GAGGACGACG TCTGGGAGCT GTACGGGCCG GGCGACTGGA CCCAGGCGCA CGACCTGGCG GCCGATGATC CGGCGAAACT CGCCGAACTA CAGCGTCTCT GGCTCATCGA GGCGGTCAAG TACGACGTGT TGCCGATCGA CGACAGGTCG TACGAGCGCT TCAACCCGGA CATCTCCGGG CGGCCGGTGC TCATCACCGG CAGCACCCAG ACGATCTTCC CCGGTATGCG GCTGATGGAG AGCTGCGTGC TCAACATCAA GAACAAGTCG CATTCGGTGA CGGCCGAGCT GTCGGTGCCC GAGTCCGGTG CGCAGGGGGT GGTGGTCAGC CAGGGCGGCG GTGTCGGTGG TTGGTGTCTG TACGCGCACG AGGGCCGGTT GAAATACTGT TACAACTTCC TCGGCATCGA GTACTACTAC GTCACCGCCG AGGCTCCGCT CTCGGCCGGC GATCACCTCC TGCGTATGGA ATTCGCCTAC GACGGTGGGG GTTTGGGCAA GGGCGGCACC GTCACGCTCT TCTGTGACGG TGAAGCCGTC GGGACGGGGC GGGTCGACCA GAGCGAGCCG ATGGCGTTCT CCGCCGACGA GATGTGCGAC GTCGGCTCGG ACAGCGGCTC GCCGACGTCA CCCGACTACG GGCCGCACGA CAACGCTTTC ACCGGCCGAA TCGACTGGGT GAAGATCGAT ATCGGCGCCG CCGACCACGA CCACCTGATC ACCGCGGAGG ACAAGCTGAA CATCGCGATG TCGCGGCAGT GA
|
Protein sequence | MSDSSPQNTG QQAAAAVRRD VLPIPDPRHV GLTTYDAKDP NTTYPPITRL RPPDGAPNVL IVLIDDVGFG ASSAFGGPCR TPVAERLAAN GVKLNRFHTT ALCSPTRQAL LTGRNHHSVG MGGVTEIATS APGYSSIRPK DKAPISETLR LNGYSTSQFG KCHEVPVWEV SPVGPFDQWP TGSGFEHFYG FVGGEANQYY PGLYEGTTPV EPAKTPEQGY TLTEDLADRA ITWVRQQQAL TPDKPFFMYF APGATHAPHH VPREWSDRYR GEFDAGWDVL REQIFARQKE LGVIPQDAEL TRRHDEIPGW DDMPDRLKPV LARQMEIYAG FMEQTDHEVG RLIDAVDGLG ALDNTLIYYI IGDNGASAEG TPNGCFNEMC TLNGLAGIET EDFLLSKIDD FGTPDAYNHY AVGWAHALCA PYQWTKQVAS HWGGTRNGTI VHWPAGIDAV GQTRDQFHHV IDVVPTILEA ADIPAPLMVN GIAQAPIEGF SMMSTLRAAD AAETHRVQYF EMFGNRGIYH DGWTAVTKHR TPWLSDQPAL EDDVWELYGP GDWTQAHDLA ADDPAKLAEL QRLWLIEAVK YDVLPIDDRS YERFNPDISG RPVLITGSTQ TIFPGMRLME SCVLNIKNKS HSVTAELSVP ESGAQGVVVS QGGGVGGWCL YAHEGRLKYC YNFLGIEYYY VTAEAPLSAG DHLLRMEFAY DGGGLGKGGT VTLFCDGEAV GTGRVDQSEP MAFSADEMCD VGSDSGSPTS PDYGPHDNAF TGRIDWVKID IGAADHDHLI TAEDKLNIAM SRQ
|
| |