Gene Msil_3123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3123 
Symbol 
ID7093783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3429612 
End bp3431261 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content59% 
IMG OID643466433 
Productsulfatase 
Protein accessionYP_002363394 
Protein GI217979247 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCCA GCGACAAGAA GCGACAGATG GCGGGGAAAA GTATGTCGCC GAATCGACGC 
AGCCTCCTGG TCGGCGCCAC GGCTCTCGCG GCCGGGACGC TTGCGGCCAG CCGTTCCCTT
ATGGCGGCGC AAGATCAAGC GACACCCTCT GCGCCGGCGG GCCAAGCGCC GGGGCGACGG
CCGAACATTC TCGTCATCTG GGGCGATGAC ATAGGGCTTT GGAACATCAG CCACAACAGC
CGGGGCATGA TGGGCTACCT GACGCCAAAC ATCGACAGAA TCGCTCGCGA GGGACTCGGC
TTCACGGACT ATTACGGCCA GCAAAGCTGT ACAGCGGGTA GAGCCGCCTT TCTCGGCGGC
AATGTCCCGG TGCGCACAGG CATGACAAAG GTAGGTCTGC CGGGCGCCAC TCAGGGCTGG
CAGAAGAGCG ACGTCACCGT GGCTACCGTG CTGAAGAGCC AGGGCTATGC AACAGGCCAG
TTCGGCAAAA ATCATCAGGG CGATCGGGAC GAGCATTTGC CGACGATGCA CGGCTTCGAC
GAGTTCTTCG GCAATCTCTA CCACCTCAAC GCGGAGGAAG AGCCGGAAAA CGAGGACTAT
CCGACGAATC CGGATTTCCG CAAAAAATAC GGTCCGCGCG GCGTGCTTCA CAGTTGGGCG
AACCCGGACG GAACGCAGAG GATCGAGAAC ACTGGCCCAC TCTCCAAGAC GCGCATGGAG
ACGATCGACG ACGAAACCCT CGCGGCCGCG AAGGATTTCA TCACGCGCCA GGTCAAAGCC
GGCAAGCCGT TCTTCACCTG GTGGAACGCC ACCCGCATGC ATTTTCGCAC CCACGTGAAG
GCGGAGCATC GCGGCATATC GGGCCAGGAC GAATATTCGG ACGGCATGGT CGAGCACGAC
GGTCAAGTCG GCGAACTGCT CAAGCTGATC GACGATCTCG GCCTCGCAAA CGATACGATC
GTCATGTACT CGACCGACAA TGGGCCGCAT TTCAACGCTT GGCCGGACGG CGCCACGACG
CCGTTCCGAA GCGAGAAGAA CTCGAATTGG GAAGGCGCCT ATCGCGTGCC GGCGTTCGTG
CGCTGGCCAG GCAAATTTCC AGCCGGGATC ACGCTCAACG GGATCGTTGC GCATGAGGAC
TGGCTGCCGA CCTTCGCGGC GATCGCCGGC GTCCCCGACA TCAAGGAGCA ATTGCTCAAG
GGCGTCGAAA TCAACGGGCG CAGCTATCGC AACTACATCG ACGGTTACAA TCTGCTCGAC
TATCTCACGG GAAAGACGAA AGATTCGCCT CGCAAGGAGT TTTGGTATGT AAATGACGAG
GGCCAGGTCG TCGCGGCGCG CTATTCGGCT TGGAAAGTCG TTTTCCTTGA GAACCGCGCC
GAGGGGCTTC AGGTCTGGCG CGAACCCTTC GTCGAATTGC GAGCGCCCCT CCTGTTCAAT
CTTCGGCGTG ATCCATTCGA GTTGGCGCAA CATAATTCCA ACACATACAA CGACTGGTAT
TTGAGCCGCG TTTTCGTGAT CGTTTCGATC CAGGAAATGG CGGCGAAATT TCTCGCAACG
CTCAAAGATT ATCCTCCAAG TCAGTCCCCA GGCTCTTTCA ATCTCTCGAA GATCGAGGCG
CAAATCAGAA ACGCCACTGG CGGCGATTAA
 
Protein sequence
MSASDKKRQM AGKSMSPNRR SLLVGATALA AGTLAASRSL MAAQDQATPS APAGQAPGRR 
PNILVIWGDD IGLWNISHNS RGMMGYLTPN IDRIAREGLG FTDYYGQQSC TAGRAAFLGG
NVPVRTGMTK VGLPGATQGW QKSDVTVATV LKSQGYATGQ FGKNHQGDRD EHLPTMHGFD
EFFGNLYHLN AEEEPENEDY PTNPDFRKKY GPRGVLHSWA NPDGTQRIEN TGPLSKTRME
TIDDETLAAA KDFITRQVKA GKPFFTWWNA TRMHFRTHVK AEHRGISGQD EYSDGMVEHD
GQVGELLKLI DDLGLANDTI VMYSTDNGPH FNAWPDGATT PFRSEKNSNW EGAYRVPAFV
RWPGKFPAGI TLNGIVAHED WLPTFAAIAG VPDIKEQLLK GVEINGRSYR NYIDGYNLLD
YLTGKTKDSP RKEFWYVNDE GQVVAARYSA WKVVFLENRA EGLQVWREPF VELRAPLLFN
LRRDPFELAQ HNSNTYNDWY LSRVFVIVSI QEMAAKFLAT LKDYPPSQSP GSFNLSKIEA
QIRNATGGD