Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3501 |
Symbol | |
ID | 7092525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 3846403 |
End bp | 3848199 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643466792 |
Product | sulfatase |
Protein accession | YP_002363752 |
Protein GI | 217979605 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0049893 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACAGAGA AGCCCGACCG CGGCCGCAAT CTGTCGCGGC GCGCTGTCGT CGCTTCCGGG GCCGGCCTTC TCGGCGCCAA TCTTCTGCCC GGCGCCGCCA TGGCTGACGC GTCGACGGGC GAGGCGGCTG GCTCCTCTCC CATCGCGCCG GATTCGCCCC CGGGCGGCTA TAATATCCTG TTCATATTGG TCGATCAGGA GCATTTCTTC GAGGATTGGC CGATGCCCGT CCCCGGCCGC GAGTGGATCA AGACGAACGG GGTCACCTTC GTCAATCATC AGGCGGCGTC CTGCGTCTGC TCGCCGGCGA GGTCGACGAT CTATACCGGG CTGCACATTC AGCACACGGG CATTTTCGAC AACGCAAATT CCCTGTGGCA GGCCGACATG TCGATGGCGG TGAAGACCAT CGGCCACCGC ATGACCGAAC TTGGATATTA CGCTGCTTAT CAGGGCAAAT GGCATCTCAG CGTCAACCTC GATCAGGCCA AGCACGCAAT CGATGCGCCC TTCAGCAAAT ACAGGCAGAT CATCGAGAGC TACGGTTTTA AGGATTTCTT TGGCGTCGGA GATATCAACG ACACGACGCT CGGCGGCTAT AATTTCGACG ACACCACCTC GGCTTTCGTC ACGCGCTGGC TGCGCACCAA GGGCGAAGAG CTGAGGGAGG CGGGCAAGCC CTGGTATCTT GCGGTCAATT TCGTCAATCC GCATGACGTC ATGTATGTCA ATTCGGATCT CCCCGGCGAA ACCGTTCAGG GCAAGGACAC GGCCATGGCG ATCGCCCGGC CGCCCGCCAG CGCAATCTAT CAGGCCGAGT GGGATACGCC CTTGCCAGCC ACGAGAAGCC AGGCGTTTGA TGCGCCGGGG CGCCCGAGCG CGCAGAAAAT CTATCAGGAC GTCCAGGACG TTCTGGTCGG CGCATGGCCG GACGAAGACC GCCGCTGGAG GCTGCTGCGA AACTATTATT ACAACTGCAT CCGCGACTGC GACCAGCAAG TAGTCCGCGT GCTGGATTCG CTCAAAGCCA ATGGCATGGA CAAGAACACA ATCATCGTGT TCACCGCGGA TCATGGCGAA CTCGGCGGCA ATCATCAAAT GCGCGGCAAG GGCAATTCCG CCTACAGGCA ACAGAACCAT TTGCCATTGA TGATCGTCCA CCCCGCTTAT CCGGGCGGAC GAATCTGCAA GGCGGTGACG TCGCAGATCG ATTTGACGCC GACGCTGATG GCTTTGACCG GCGCCGGCGC GCCGAGCCTA AAGGCAGCCG GGGCGGATCT GGTCGGCCGC GATTTCTCGA GGTTGTTGGC TGCTCCGGAG AAGGCGAGTT TTGACTCCCT GCGGCCGGGT TCATTGTACA ATTACAACAT GCTGTCGTTT CAGGATGCGA AATGGGCCAA GAGGATGGAC GAGTTTTTGA AGCACTCGGA CATGCCGCTC GCACAGAAAA TCGCGATTCT GCTGAAAGAG GAGCCGGATT TCCACAATCG CTGCGCGATC CGCAGCGTCT TCGACGGGCG CTACCGCTTC AGCCGCTATT TCTCGCCATT GGCGTTCAAC ACGCCGGCGA GTTTTGAGGA GCTTTTGGCC CAGAACGACC TCGAACTCTA CGATCTTCAG GAGGACGAAG ACGAGGTTAC TAATCTGGCG GCGAAGCCGA AGGCCAATGC GGAGTTGATC ATGGCGATGA ATGAAAAGCT CAACGCCCGA ATCGCGCAAG AAGTGGGCGC GGACGACGGC GCCTTCTTGC CCTTGCGGGA CAGCAAGTGG CGCTTTCCGA GCGCCAGCGA GCGGTAG
|
Protein sequence | MTEKPDRGRN LSRRAVVASG AGLLGANLLP GAAMADASTG EAAGSSPIAP DSPPGGYNIL FILVDQEHFF EDWPMPVPGR EWIKTNGVTF VNHQAASCVC SPARSTIYTG LHIQHTGIFD NANSLWQADM SMAVKTIGHR MTELGYYAAY QGKWHLSVNL DQAKHAIDAP FSKYRQIIES YGFKDFFGVG DINDTTLGGY NFDDTTSAFV TRWLRTKGEE LREAGKPWYL AVNFVNPHDV MYVNSDLPGE TVQGKDTAMA IARPPASAIY QAEWDTPLPA TRSQAFDAPG RPSAQKIYQD VQDVLVGAWP DEDRRWRLLR NYYYNCIRDC DQQVVRVLDS LKANGMDKNT IIVFTADHGE LGGNHQMRGK GNSAYRQQNH LPLMIVHPAY PGGRICKAVT SQIDLTPTLM ALTGAGAPSL KAAGADLVGR DFSRLLAAPE KASFDSLRPG SLYNYNMLSF QDAKWAKRMD EFLKHSDMPL AQKIAILLKE EPDFHNRCAI RSVFDGRYRF SRYFSPLAFN TPASFEELLA QNDLELYDLQ EDEDEVTNLA AKPKANAELI MAMNEKLNAR IAQEVGADDG AFLPLRDSKW RFPSASER
|
| |