Gene Mjls_0899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_0899 
Symbol 
ID4876641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp971596 
End bp973392 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content71% 
IMG OID640138211 
Productsulfatase 
Protein accessionYP_001069199 
Protein GI126433508 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00729431 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTAACC CCGACATCGT CATCCTGATG ACCGACGAGG AACGCGCGGT CCCGCCGTAC 
GAGTCACCCG AGGTGCTGGC GTGGCGCGAC CGCACCCTGC CGTGCCGCAA GTGGTTCGAC
GACCACGGCG TCAGCTTCGG CAGGCACTAC ACCGGATCGC TGGCGTGCGT GCCGAGCCGG
CCGACGATCT TCACCGGCCA GTACCCGGAT CTGCACGGCG TCACCCAGAC CGACGGCATC
GGCAAGACCT ACGGCGACTC GCGCATGCGG TGGCTGCGCC CGGGTGAGGT GCCGACGCTG
GGCAACTGGT TCCGCGCCGC CGGCTACGAC ACCCATTACG ACGGTAAGTG GCACATCTCC
CACGCCGACG TCACCGACCC GGCCACCGGG CTGCCGCTGG ACACCAACGA CGACGACGGT
GTGGTCGACG CGGATGCGGT GCGGCGCTAC CTCGACGCCG ACTCGTTGGC GCCGTACGGG
TTCTCCGGCT GGGTCGGTCC CGAACCGCAC GGAGCGGCCT TGTCCAACAG CGGGTTTCGC
CGCGACCCGC TGATCGCCGC CCGGGTGGTG GCGTGGCTGG AGGACCGCTA CGCGCGCCGC
CGCGCCGGTG ACCCGCAGGC GTTGCGGCCG TTCCTGCTGG TGGCCAGCTT CGTCAACCCG
CACGACATCG TGCTGTTCCC GCAGTGGGTG CGGCGCAGCC CGGTCAAGCC GTCCCCGCTC
GACCCGCCGC ACGTCCCGGC CGCACCGACC GCCGACGAGG ACCTGTCGAC GAAACCGGCC
GCGCAGATCG CGTTCCGCGA GGCCTACTAC TCCGGATACG GCCCCGCGGC GGTGATGGAG
CGGACCTACC GGCGCAACGC CCAGCAGTAC CGGGATCTGT ACTACCGCCT GCACGCCCAG
GTCGACGGTC CGCTCGAGCG GGTGCGCCGC GCGGTCGTCG AGGGTTCGCA GGATGCGGTG
CTGGTCCGCA CGGCCGACCA CGGCGACCTG CTCGGCGCGC ACGGCGGTCT GCACCAGAAG
TGGTTCAACC TCTACGACGA GGCCACCCGC GTCCCGTTCG TCATCGCCCG CACCGGGGTC
AACGCGACCG CAGCCCGCAC GGTGACCGCC CCCACCTCAC ACGTCGACCT CGTTCCGACC
CTGCTGAGCG CCGCGGGTGT CGACGTCGCC GCCACCGCGG CCACACTCGC CGAGTCCTTC
ACCGAGGTGC ACCCCCTGCC GGGGCGTGAC CTGATGCCGG TGGTCGACGG GGCGGCCCCC
GACGAGGATC GCGCGGTGTA CCTGATGACC CGCGACAACA TGCTCGAAGG TGACAGCGGC
GCATCGGGTC TGGCGCGCAA GCTCAAGCGC ACCGTCAATC CGCCGGGGCC GCTGCGGATC
CGGGTGCCTG CACACGTCGC GTCCAACTTC GAGGGACTCG TGACGCAGGT CGACGGCCAC
CTCTGGAAGC TGGTGCGCAG CTTCGACGAT CCGGCCACCT GGACCGAACC GGGCGTGCGG
CACCTGGCCG CCAACGGTGT CGGCGGGGAG GCCTACCGTT CCAGCCCGCT CGACGACCAG
TGGGAGCTCT ACGACCTCAC CGCCGACCCG ACCGAGGCCG TCAACAGGTG GCCCGACCCC
TCACTCGACG AGTTGCGCGC ACACCTGCGC CGACAACTCA AACACGTCAG GACCGAATCG
ATTCCGGAGC GCAACCAACC GTGGCCGTAC GCCGTCCGCC GCCCACCGAC CGGAGGGGCC
CGGGTGGGCC TCGTCCGACG GGCGCTCGGA CGCCTGGGGG TCGGCGCCGC GGTTTGA
 
Protein sequence
MSNPDIVILM TDEERAVPPY ESPEVLAWRD RTLPCRKWFD DHGVSFGRHY TGSLACVPSR 
PTIFTGQYPD LHGVTQTDGI GKTYGDSRMR WLRPGEVPTL GNWFRAAGYD THYDGKWHIS
HADVTDPATG LPLDTNDDDG VVDADAVRRY LDADSLAPYG FSGWVGPEPH GAALSNSGFR
RDPLIAARVV AWLEDRYARR RAGDPQALRP FLLVASFVNP HDIVLFPQWV RRSPVKPSPL
DPPHVPAAPT ADEDLSTKPA AQIAFREAYY SGYGPAAVME RTYRRNAQQY RDLYYRLHAQ
VDGPLERVRR AVVEGSQDAV LVRTADHGDL LGAHGGLHQK WFNLYDEATR VPFVIARTGV
NATAARTVTA PTSHVDLVPT LLSAAGVDVA ATAATLAESF TEVHPLPGRD LMPVVDGAAP
DEDRAVYLMT RDNMLEGDSG ASGLARKLKR TVNPPGPLRI RVPAHVASNF EGLVTQVDGH
LWKLVRSFDD PATWTEPGVR HLAANGVGGE AYRSSPLDDQ WELYDLTADP TEAVNRWPDP
SLDELRAHLR RQLKHVRTES IPERNQPWPY AVRRPPTGGA RVGLVRRALG RLGVGAAV