Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_0893 |
Symbol | |
ID | 4109734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | - |
Start bp | 987829 |
End bp | 989625 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 638030015 |
Product | sulfatase |
Protein accession | YP_638065 |
Protein GI | 108797868 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.185177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAACC CCGACATCGT CATCCTGATG ACCGACGAGG AACGCGCGGT CCCGCCGTAC GAGACACCCG AGGTGCTGGC GTGGCGCGAC CGCACCCTGC CGTGCCGCAA GTGGTTCGAC GACCACGGCG TCAGCTTCGG CAGGCACTAC ACCGGATCGC TGGCGTGCGT GCCGAGCCGG CCGACGATCT TCACCGGCCA GTACCCGGAT CTGCACGGCG TCACCCAGAC CGACGGCATC GGCAAGACCT ACGGCGACTC GCGCATGCGG TGGCTGCGCC CGGGTGAGGT GCCGACGCTG GGCAACTGGT TCCGCGCCGC CGGCTACGAC ACCCATTACG ACGGTAAGTG GCACATCTCC CACGCCGACG TCACCGACCC GGCCACCGGG CTGCCGCTGG ACACCAACGA CGACGACGGT GTGGTCGACG CGGATGCGGT GCGGCGCTAC CTCGACGCCG ACCCGTTGGC GCCGTACGGC TTCTCCGGCT GGGTCGGTCC CGAACCGCAC GGAGCGGCCT TGTCCAACAG CGGGTTTCGC CGCGACCCGC TGATCGCCGC CCGGGTGGTG GCGTGGCTGG AGGACCGCTA CGCGCGCCGC CGCGCCGGCG ACCCGCAGGC GTTGCGGCCG TTCCTGCTGG TGGCCAGCTT CGTCAACCCG CACGACATCG TGCTGTTCCC GCAGTGGGTG CGGCGCAGCC CGGTCAAGCC GTCCCCGCTC GACCCGCCGC ACGTCCCGGC CGCACCGACC GCCGACGAGG ACCTGTCGAC GAAACCGGCC GCGCAGATCG CGTTCCGCGA GGCCTACTAC TCCGGATACG GCCCCGCGGC GGTGATGGAG CGGACCTACC GGCGCAACGC CCAGCAGTAC CGGGATCTGT ACTACCGCCT GCACGCCCAG GTCGACGGTC CGCTCGAGCG GGTGCGCCGC GCGGTCGTCG AGGGTTCGCA GGATGCGGTG CTGGTCCGCA CGGCCGACCA CGGCGACCTG CTCGGCGCGC ACGGCGGTCT GCACCAGAAG TGGTTCAACC TCTACGACGA GGCCACCCGC GTCCCGTTCG TCATCGCCCG CACCGGCGCC AACGCGACCG CAGCCCGCAC GGTCACCGCC CCCACCTCAC ACGTCGACCT CGTTCCGACC CTGCTGAGCG CCGCGGGTGT CGACGTCGCC GCCGCCGCGG CCACGCTCGC CGAGTCCTTC ACCGAGGTGC ACCCCCTGCC GGGGCGTGAC CTCATGCCGG TGGTCGACGG GGCGGCCCCC GACGAGGATC GCGCGGTGTA CCTGATGACC CGCGACAACA TGCTCGAAGG TGACAGCGGC GCATCGGGTC TGGCGCGCAA GCTCAAGCGC ACCGTCAATC CGCCGGGGCC GCTGCGGATC CGGGTGCCCG CACACGTCGC GTCCAACTTC GAGGGACTCG TGACGCAGGT CGACGGCCAC CTCTGGAAGC TGGTGCGCAG CTTCGACGAT CCGGCCACCT GGACCGAACC GGGCGTGCGG CACCTGGCCG CCAACGGTGT CGGCGGGGAG GCCTACCGTT CCAGCCCGCT CGACGACCAG TGGGAGCTCT ACGACCTCAC CGCCGACCCG ACCGAGGCCG TCAACAGGTG GCCCGACCCC TCACTCGACG AGCTGCGCGC ACACCTGCGC CGACAACTCA AACACGTCAG GACCGAATCG ATTCCGGAGC GCAACCAACC GTGGCCGTAC GCCGTCCGCC GCCCACCGAC CGGAGGGGCC CGGGTGGGCC TCGTCCGACG GGCGCTCGGA CGCCTGGGGG TCGGCGCCGC GGTTTGA
|
Protein sequence | MSNPDIVILM TDEERAVPPY ETPEVLAWRD RTLPCRKWFD DHGVSFGRHY TGSLACVPSR PTIFTGQYPD LHGVTQTDGI GKTYGDSRMR WLRPGEVPTL GNWFRAAGYD THYDGKWHIS HADVTDPATG LPLDTNDDDG VVDADAVRRY LDADPLAPYG FSGWVGPEPH GAALSNSGFR RDPLIAARVV AWLEDRYARR RAGDPQALRP FLLVASFVNP HDIVLFPQWV RRSPVKPSPL DPPHVPAAPT ADEDLSTKPA AQIAFREAYY SGYGPAAVME RTYRRNAQQY RDLYYRLHAQ VDGPLERVRR AVVEGSQDAV LVRTADHGDL LGAHGGLHQK WFNLYDEATR VPFVIARTGA NATAARTVTA PTSHVDLVPT LLSAAGVDVA AAAATLAESF TEVHPLPGRD LMPVVDGAAP DEDRAVYLMT RDNMLEGDSG ASGLARKLKR TVNPPGPLRI RVPAHVASNF EGLVTQVDGH LWKLVRSFDD PATWTEPGVR HLAANGVGGE AYRSSPLDDQ WELYDLTADP TEAVNRWPDP SLDELRAHLR RQLKHVRTES IPERNQPWPY AVRRPPTGGA RVGLVRRALG RLGVGAAV
|
| |