Gene Mpe_A2680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2680 
Symbol 
ID4784042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2856198 
End bp2857952 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content68% 
IMG OID640091251 
Productarylsulfatase 
Protein accessionYP_001021869 
Protein GI124267865 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.246177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTTCTG AGGAGCCCAC GATGAGACTC ACCGCATCAC CGAAACCCAC CCTCGCCGCC 
TTGACGCTGG GCGCCGCCGC ACTCCTGTGC GTCGCGACGG CCGTGCGTGC CGAGGGCACG
GCCCCGCCAA TCCGCCCCAA CATCCTGCTG ATCGTCGCCG ACGACCTGGG CTACTCCGAC
CTCGGCGCCT ACGGCGGCGA GATCGACACG CCTCACCTGG ACGCGATCGC CCGTGACGGC
GTGCGCCTGA CGAGCTTCTA CTCGGCGCCG TTCTGCTCGC CCACGCGCGC CATGCTGATG
TCGGGCACGG ACAACCACCT GGCCGGCTTC GGCGGCATGG CCGAACTGCT CACCCCGGAC
CAGAAGGGCA GGCCGGGCTA CGAGGGCTTC CTCAACGATC GGGTCGTGCC CTTCCCGCAG
TTGCTGCGGG ACAGCGGGTA CCACACCTAC ATGGCTGGCA AGTGGCACCT CGGGGTCACG
CCCGAAGTGA GCCCGGCCCG GCGCGGCTTC GAGCAGTCGT ACGCGATGGT GCAGGGCGGC
GCCGGGCATT TCGACCAGAC CGGCATCATC ACCGGCGACC CAGCCAAGCC CCCGCGCGCG
ATCTATAACG AGAACGGCCA GCTCGTCGAC GTGCCCGCGC GAGGCTTCTA CTCGAGCGAG
TTCTTCGCGC GCCGCATGAT CTCCTACATC GACCGCGGCC GCGGCGACGG CAAGCCCTTC
TTCGGCTACC TCGCCTTCAC CGCGCCGCAC TGGCCCCTGC AGGCCTACGA CGAGACGATC
CGCAAGTACG AGGGCCGCTA CGACGTCGGC TACGACGCGA TCCGCGACCA GCGCACCACG
CGCCAGAAGG CGCTGGGCAT CATCCCGAAG GATGCGCAGG TCTACACCGG CCACCCGCTG
TGGCCGAAGT GGAGCACGCT GACCGCTGCG CAGAAGCAGA CCGAGTCCAA GCGCATGGCG
GTCTATGCCG CGATGGTCGA CGACATGGAT TACTACATCG GCGAAGTCGT GAACTACCTG
AAGAAGACCG GCCAGTACGA CAACACGCTG ATCCTCTTCA TGTCCGACAA CGGCGCGGAC
GGCAACACCG CCCTCGACGA AGGCCGCACG CGCGAGTGGG TGAAGACGCG CATGGACAAC
AGCCTTGCCA ACAGCGGGCG CAAGGGCTCC TACATCGATT ACGGCCCCAA CTGGGCCCAG
GTGGGCTCCA ACCCCTTCCA CCTGTACAAG GGCTTCCTGT ACGAGGGTGG AATCTCCGTG
CCGTTCATCG CCAGCTGGCC GGCACTGGGC CGCAAGGGGC AGATCAGCGA CAGCTTCGCC
CACACGATGG ACATCGCTCC CACGCTGCTG GAGCTGGCGG GCGCCCGCCA TCCGGGCACC
GAGTACCAGG GACGTGCGGT GCTCCCGCTG CGCGGTCGCT CGATGCTCGC GATGCTGACC
GGGCAGCGCG ACAGCGTGCA CCCGGCCGAT CACGTGCACG GCTGGGAACT CGGGGGCCGC
AAGGCGCTGC GCAAGGGGGA CTGGAAGATC GTCTACAGCA ACCGGCAGTG GGGCACCGGC
GAGTGGGAGC TCTACGACCT GTCCAAGGAT CGCAGCGAGC TCAACAACCT GGCAGCGTCC
CAGCCGGCCA AGTTGAGCGA ACTGGTGGCC GAGTACGAGC GCTACGTGCG CGAAGTGGGC
GTGGTGGACA TCCCCGGCCT GGCGGAGCGC AAGGGCTACA GCAACGGCAC ACGCTACTTC
GAGGACATGC AGTGA
 
Protein sequence
MCSEEPTMRL TASPKPTLAA LTLGAAALLC VATAVRAEGT APPIRPNILL IVADDLGYSD 
LGAYGGEIDT PHLDAIARDG VRLTSFYSAP FCSPTRAMLM SGTDNHLAGF GGMAELLTPD
QKGRPGYEGF LNDRVVPFPQ LLRDSGYHTY MAGKWHLGVT PEVSPARRGF EQSYAMVQGG
AGHFDQTGII TGDPAKPPRA IYNENGQLVD VPARGFYSSE FFARRMISYI DRGRGDGKPF
FGYLAFTAPH WPLQAYDETI RKYEGRYDVG YDAIRDQRTT RQKALGIIPK DAQVYTGHPL
WPKWSTLTAA QKQTESKRMA VYAAMVDDMD YYIGEVVNYL KKTGQYDNTL ILFMSDNGAD
GNTALDEGRT REWVKTRMDN SLANSGRKGS YIDYGPNWAQ VGSNPFHLYK GFLYEGGISV
PFIASWPALG RKGQISDSFA HTMDIAPTLL ELAGARHPGT EYQGRAVLPL RGRSMLAMLT
GQRDSVHPAD HVHGWELGGR KALRKGDWKI VYSNRQWGTG EWELYDLSKD RSELNNLAAS
QPAKLSELVA EYERYVREVG VVDIPGLAER KGYSNGTRYF EDMQ