Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3123 |
Symbol | |
ID | 7093783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 3429612 |
End bp | 3431261 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643466433 |
Product | sulfatase |
Protein accession | YP_002363394 |
Protein GI | 217979247 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 92 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCCA GCGACAAGAA GCGACAGATG GCGGGGAAAA GTATGTCGCC GAATCGACGC AGCCTCCTGG TCGGCGCCAC GGCTCTCGCG GCCGGGACGC TTGCGGCCAG CCGTTCCCTT ATGGCGGCGC AAGATCAAGC GACACCCTCT GCGCCGGCGG GCCAAGCGCC GGGGCGACGG CCGAACATTC TCGTCATCTG GGGCGATGAC ATAGGGCTTT GGAACATCAG CCACAACAGC CGGGGCATGA TGGGCTACCT GACGCCAAAC ATCGACAGAA TCGCTCGCGA GGGACTCGGC TTCACGGACT ATTACGGCCA GCAAAGCTGT ACAGCGGGTA GAGCCGCCTT TCTCGGCGGC AATGTCCCGG TGCGCACAGG CATGACAAAG GTAGGTCTGC CGGGCGCCAC TCAGGGCTGG CAGAAGAGCG ACGTCACCGT GGCTACCGTG CTGAAGAGCC AGGGCTATGC AACAGGCCAG TTCGGCAAAA ATCATCAGGG CGATCGGGAC GAGCATTTGC CGACGATGCA CGGCTTCGAC GAGTTCTTCG GCAATCTCTA CCACCTCAAC GCGGAGGAAG AGCCGGAAAA CGAGGACTAT CCGACGAATC CGGATTTCCG CAAAAAATAC GGTCCGCGCG GCGTGCTTCA CAGTTGGGCG AACCCGGACG GAACGCAGAG GATCGAGAAC ACTGGCCCAC TCTCCAAGAC GCGCATGGAG ACGATCGACG ACGAAACCCT CGCGGCCGCG AAGGATTTCA TCACGCGCCA GGTCAAAGCC GGCAAGCCGT TCTTCACCTG GTGGAACGCC ACCCGCATGC ATTTTCGCAC CCACGTGAAG GCGGAGCATC GCGGCATATC GGGCCAGGAC GAATATTCGG ACGGCATGGT CGAGCACGAC GGTCAAGTCG GCGAACTGCT CAAGCTGATC GACGATCTCG GCCTCGCAAA CGATACGATC GTCATGTACT CGACCGACAA TGGGCCGCAT TTCAACGCTT GGCCGGACGG CGCCACGACG CCGTTCCGAA GCGAGAAGAA CTCGAATTGG GAAGGCGCCT ATCGCGTGCC GGCGTTCGTG CGCTGGCCAG GCAAATTTCC AGCCGGGATC ACGCTCAACG GGATCGTTGC GCATGAGGAC TGGCTGCCGA CCTTCGCGGC GATCGCCGGC GTCCCCGACA TCAAGGAGCA ATTGCTCAAG GGCGTCGAAA TCAACGGGCG CAGCTATCGC AACTACATCG ACGGTTACAA TCTGCTCGAC TATCTCACGG GAAAGACGAA AGATTCGCCT CGCAAGGAGT TTTGGTATGT AAATGACGAG GGCCAGGTCG TCGCGGCGCG CTATTCGGCT TGGAAAGTCG TTTTCCTTGA GAACCGCGCC GAGGGGCTTC AGGTCTGGCG CGAACCCTTC GTCGAATTGC GAGCGCCCCT CCTGTTCAAT CTTCGGCGTG ATCCATTCGA GTTGGCGCAA CATAATTCCA ACACATACAA CGACTGGTAT TTGAGCCGCG TTTTCGTGAT CGTTTCGATC CAGGAAATGG CGGCGAAATT TCTCGCAACG CTCAAAGATT ATCCTCCAAG TCAGTCCCCA GGCTCTTTCA ATCTCTCGAA GATCGAGGCG CAAATCAGAA ACGCCACTGG CGGCGATTAA
|
Protein sequence | MSASDKKRQM AGKSMSPNRR SLLVGATALA AGTLAASRSL MAAQDQATPS APAGQAPGRR PNILVIWGDD IGLWNISHNS RGMMGYLTPN IDRIAREGLG FTDYYGQQSC TAGRAAFLGG NVPVRTGMTK VGLPGATQGW QKSDVTVATV LKSQGYATGQ FGKNHQGDRD EHLPTMHGFD EFFGNLYHLN AEEEPENEDY PTNPDFRKKY GPRGVLHSWA NPDGTQRIEN TGPLSKTRME TIDDETLAAA KDFITRQVKA GKPFFTWWNA TRMHFRTHVK AEHRGISGQD EYSDGMVEHD GQVGELLKLI DDLGLANDTI VMYSTDNGPH FNAWPDGATT PFRSEKNSNW EGAYRVPAFV RWPGKFPAGI TLNGIVAHED WLPTFAAIAG VPDIKEQLLK GVEINGRSYR NYIDGYNLLD YLTGKTKDSP RKEFWYVNDE GQVVAARYSA WKVVFLENRA EGLQVWREPF VELRAPLLFN LRRDPFELAQ HNSNTYNDWY LSRVFVIVSI QEMAAKFLAT LKDYPPSQSP GSFNLSKIEA QIRNATGGD
|
| |