Gene Mext_0526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0526 
Symbol 
ID5833842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp571349 
End bp573043 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content64% 
IMG OID641366303 
Productsulfatase 
Protein accessionYP_001638012 
Protein GI163849969 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCACG ACGACACGCA CGAGAAGCAC CAGGAGTGCC CTGGAACGGG AAGCGCCCTC 
AGCCGCCGCA GCATCCTTCT CACCGGGACC TCGGCCCTCG CGGCCGCCGC GCTGGTGTCT
ACCGCCCAGG CCCAGCAGCA AACCCAGCAG CAACCCACAG CAGTCGGGCA GAAGCCGAAT
ATCATCGTCA TCATGGGTGA CGATATCGGC ATCTGGAACA TCGGCGCCTA TCACCGCGGC
ATGATGGCTG GTCGCACGCC CCATATCGAT CAGTTGGCCG CAGAGGGCAT GCTGTTCACC
GACTATTACG CCGAGGCGAG CTGCACGGCG GGGCGCGCCG CCTTCATCAC CGGCGAGCTG
CCGATCCGCA CCGGCATGAC CACGGTTGGC CAGGCCGGCG CCGCCATCGG TATTCCGGCG
GAGGCCGTGA CCATCGCGAC CGCGCTGAAG GGCATGGGCT ACGCCACCGG CCAGTTCGGC
AAGAACCACC TGGGCGACAA GAACGAGTTC CTGCCGACGG TGCACGGCTT CGACGAGTTC
TTCGGCTACC TGTATCACCT CGACGCGATG GAGGACCCGG CGCACCCCGC CTATCCGCAA
GAACTGCTGA ACAGGGTCGG CCCGCGCAAC ATGGTTCATT CGTGGGCGAC GAACGTGGAC
GATCCCACCG ACGATCCGCG CTGGGGCAGG GTCGGCAAGC AGCGGATTGA GGATGCCGGG
ACGCTCTACC CGAAGCGGAT GGAGACAATC GACGACGAAA TCCGCGACCT GGCGCTCGGC
TTCATCGACA AGGCCAAGGC CAACGGCAAG CCTTTTTTCG TCTGGCTCAA CCCGACCCGC
ATGCACGTCA CCACGCACCT ATCGCCGAAG TATCAGGCGA TGCGCAACTC CAAGAACGGC
TGGAGCATCC AGGAAGCCGG CATGGCGCAG ATCGACGATG TCGTTGGCGC GGTGATGAAG
AAGCTGAAGG ACTTAGGGGT CGACGACAAC ACCATCGTGG TCTTCACCAC CGACAACGGC
ACCGAGGTCT TCACCTGGCC AGACGGCGGG CAGACACCGT TCGCGCAGTC GAAGGGCACG
GTCATGGAGG GCGGCTTCCG CGCGCCGGCT ATGGTTCGCT GGCCCGGCAA GGTGCCGGCC
GGCACGGTCG ACAACGGCGT CATCTCGGGC CTCGACTGGT TCCCGACGCT TGTGGCGGCG
GCGGGCAACC CGGACATCGG CGAGGAGCTG AAAAAGGGCA AGCAGATCGC CGACCAGACC
TACAAGGTGC ACCTGGACGG CTACAACCAG CTGGACCTGA TCACCGGCAA GGGACCATCG
AAGCGCAACG AGGTCTGGTA TTTCGGCGAG AGCGAGCTTG GGGCTGTCCG GATCGGCGAC
TACAAGTACC GCTTCATCGA CCAGCCCGGC GGGTGGCTCG GCGACAAAAC CAAGCCTGAC
GTGCCCTACA TCACTAACCT GCGGCTCGAC CCCTTTGAGC GCACGGGGTG GCCCGATAGC
GGGACGAAGA TCGGCACACA GAACTACATG AACTGGTTCT TGTATGAGTT CTGGCGCTTC
ACCTTTGTCC AGCAGGAGGT GGAGAAGCTT GCCATGACGG CGGTCGAGTT CCCGCCGATG
CAGAAGGGCG CGAGCTTCAA TCTCGAAGCG GTCAAGGCGA AGATCGTGGC GGCTAGGTCG
GCAATGGGGA AGTAA
 
Protein sequence
MSHDDTHEKH QECPGTGSAL SRRSILLTGT SALAAAALVS TAQAQQQTQQ QPTAVGQKPN 
IIVIMGDDIG IWNIGAYHRG MMAGRTPHID QLAAEGMLFT DYYAEASCTA GRAAFITGEL
PIRTGMTTVG QAGAAIGIPA EAVTIATALK GMGYATGQFG KNHLGDKNEF LPTVHGFDEF
FGYLYHLDAM EDPAHPAYPQ ELLNRVGPRN MVHSWATNVD DPTDDPRWGR VGKQRIEDAG
TLYPKRMETI DDEIRDLALG FIDKAKANGK PFFVWLNPTR MHVTTHLSPK YQAMRNSKNG
WSIQEAGMAQ IDDVVGAVMK KLKDLGVDDN TIVVFTTDNG TEVFTWPDGG QTPFAQSKGT
VMEGGFRAPA MVRWPGKVPA GTVDNGVISG LDWFPTLVAA AGNPDIGEEL KKGKQIADQT
YKVHLDGYNQ LDLITGKGPS KRNEVWYFGE SELGAVRIGD YKYRFIDQPG GWLGDKTKPD
VPYITNLRLD PFERTGWPDS GTKIGTQNYM NWFLYEFWRF TFVQQEVEKL AMTAVEFPPM
QKGASFNLEA VKAKIVAARS AMGK