Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_1023 |
Symbol | |
ID | 4109862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | + |
Start bp | 1124232 |
End bp | 1126583 |
Gene Length | 2352 bp |
Protein Length | 783 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638030146 |
Product | sulfatase |
Protein accession | YP_638193 |
Protein GI | 108797996 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0349992 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCACGG AATTCAACGG CAAGATCGAA CTGGACATCC GCGATTCCGA GCCCGACTGG GGTCCGTACG CCGCGCCGAC CGCACCCGAG GGCGCGCCCA ATGTGCTGTA CCTCGTGTGG GATGACACCG GTATCGCGAC GTGGGACTGC TTCGGCGGCC TGGTCGAGAT GCCGGCGATG AGCCGTATCG CCGAACGCGG TGTGCGCCTG TCGCAGTTCC ACACCACGGC GCTGTGCTCG CCGACCCGTG CCTCCCTGCT CACCGGCCGC AACGCGACGA CCGTCGGGAT GGCCACCATC GAGGAGTTCA CCGACGGCTT CCCGAACTGC AGCGGCCGCA TCCCGTTCGA CACCGCGCTG ATCTCGGAGG TCCTCGCGGA GAACGGCTAC AACACCTACT GCGTCGGCAA GTGGCACCTC ACTCCGCTCG AGGAGTCGAA TCTGGCTGCC ACCAAACGAC ACTGGCCGCT GTCTCGTGGG TTCGAACGGT TCTACGGCTT CATGGGCGGC GAAACCGACC AGTGGTATCC CGAACTCGTC TACGACAACC ACCCGGTCGC CCCGCCCGGC ACCCCCGAGG ACGGCTATCA CCTGTCGAAG GACCTCGCGG ACAAGACGAT CGAGTTCATC CGCGACGCCA AGGTGATCGC GCCCGACAAA CCGTGGTTCT CCTACGTCTG CCCGGGTGCC GGCCACGCCC CACACCACGT GTTCAAGGAA TGGGCCGACC GCTACGCGGG CCGCTTCGAC ATGGGCTACG AGGCCTACCG CGAGATCGTG CTGGAGAACC AGAAACGCCT CGGCATCGTC CCGTCGGACA CCGAACTCTC ACCGATGAAC CCGTACGCCG ACGTCACCGG GCCCAACGGG GAGCCGTGGC CCGTCCAGGA CACGGTGCGG CCGTGGGATT CCCTGTCCGA CAACGAGAAA CGGCTCTTCT GCCGGATGGC GGAGGTCTTC GCCGGATTCC TGTCCTACAC CGATGCGCAG ATCGGCCGGA TCCTCGACTA CCTGGAGGAG TCGGGTCAGC TCGACAACAC GATCATCGTG GTGATCTCCG ACAACGGGGC CAGCGGCGAA GGCGGACCCG ACGGTTCGGT CAACGAGACG AAGTTCTTCA ACGGCTACAT CGACACCGCC GAGGAGGGGC TCAAGGTCAT CGACGATCTC GGTGGCCCGC ACACCTACAA CCACTATCCG ACCGGCTGGG CGATGGCGTT CAACACGCCC TACAAACTGT TCAAGCGCTA CGCCTCCCAT GAGGGGGGCA TCGCCGACAC CGCGATCATC TCGTGGCCCG ACGGCATCGC CGCGCACGGT GAGGTGCGGG ACAACTACGT CAACGTCTGC GACATCACCC CGACGGTGTA CGACCTGCTG GGGCTCACCG CCCCGGCCTC CGTGCGCGGG GTGCCGCAGA AACCGCTCGA CGGTGTGAGT TTCAAAGTGA CACTGGACAA TCCGACCGCG CCCACCGGCA AGGAGACGCA GTTCTACTCG ATGCTCGGCA CCCGGGGGAT CTGGCACCAG GGCTGGTTCG CCAACACCGT GCACGCGGCG TCGCCGGCCG GCTGGTCGCA TTTCGACGAC GACCGCTGGG AGTTGTTCCA CATCGAGGCC GACCGCGCCC AGGTGCACGA TCTGGCCGCC GAACACCCGG AGAAACTCGA GGAACTCAAG GCGCTGTGGT TCAGCGAGGC CGCCAAGTAC AACGGGCTGC CGCTCGGCGA TCTCAACATC TTCGACACCA TCGGGCGGTG GCGGCCCAGC CTGTCGGGTG CGCGGGACGC CTACGTGTAC TACCCGGGCA CCGCGGATGT CGGAACCGGC GCCGTCGTCG AGGTCCAGGG CCGCTCGTTC GTGGTGCTGG CCGAGGTCAC CGTCGACGAC GACACCGCAC AGGGTGTGGT GTTCAAACAC GGTGGCGCAC ACGGTGGTCA CGTGATGTAC GTGCAGGACG GCCGGCTGCA CTACGCGTAC AACTTCCTCG GCGAGACCGA ACAGAAGATG GCGTCGTCGG TGCCGATCAC CCCCGGCCGG CACACCTTCG GGATCGCCTA CACCCGGACC GGCACCGTCG AGGGCAGCCA CACCCCCCTC GGTGACGCCG TGCTCTACGT CGACGACGAC GCGGTCGCCT CCTACCCGGG CATGATGAGC CACCCCGGGA CGTTCGGACT GGCCGGCGCC ACGCTCTCGG TGGGCCGCAA CAGCGGATCC CCGGTCTCGC GGGCCTACCG GCCGCCGTTC GAGTTCACCG GCGGCACCAT CGCCCAGGTC TCGTTCGACG TCTCCGGCAA GCCCTACCTC GACCTGGAAC GCGAGTTCGC CCGGGCCTTC GCGAAGGACT GA
|
Protein sequence | MRTEFNGKIE LDIRDSEPDW GPYAAPTAPE GAPNVLYLVW DDTGIATWDC FGGLVEMPAM SRIAERGVRL SQFHTTALCS PTRASLLTGR NATTVGMATI EEFTDGFPNC SGRIPFDTAL ISEVLAENGY NTYCVGKWHL TPLEESNLAA TKRHWPLSRG FERFYGFMGG ETDQWYPELV YDNHPVAPPG TPEDGYHLSK DLADKTIEFI RDAKVIAPDK PWFSYVCPGA GHAPHHVFKE WADRYAGRFD MGYEAYREIV LENQKRLGIV PSDTELSPMN PYADVTGPNG EPWPVQDTVR PWDSLSDNEK RLFCRMAEVF AGFLSYTDAQ IGRILDYLEE SGQLDNTIIV VISDNGASGE GGPDGSVNET KFFNGYIDTA EEGLKVIDDL GGPHTYNHYP TGWAMAFNTP YKLFKRYASH EGGIADTAII SWPDGIAAHG EVRDNYVNVC DITPTVYDLL GLTAPASVRG VPQKPLDGVS FKVTLDNPTA PTGKETQFYS MLGTRGIWHQ GWFANTVHAA SPAGWSHFDD DRWELFHIEA DRAQVHDLAA EHPEKLEELK ALWFSEAAKY NGLPLGDLNI FDTIGRWRPS LSGARDAYVY YPGTADVGTG AVVEVQGRSF VVLAEVTVDD DTAQGVVFKH GGAHGGHVMY VQDGRLHYAY NFLGETEQKM ASSVPITPGR HTFGIAYTRT GTVEGSHTPL GDAVLYVDDD AVASYPGMMS HPGTFGLAGA TLSVGRNSGS PVSRAYRPPF EFTGGTIAQV SFDVSGKPYL DLEREFARAF AKD
|
| |