Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2680 |
Symbol | |
ID | 4784042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2856198 |
End bp | 2857952 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640091251 |
Product | arylsulfatase |
Protein accession | YP_001021869 |
Protein GI | 124267865 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.246177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGTTCTG AGGAGCCCAC GATGAGACTC ACCGCATCAC CGAAACCCAC CCTCGCCGCC TTGACGCTGG GCGCCGCCGC ACTCCTGTGC GTCGCGACGG CCGTGCGTGC CGAGGGCACG GCCCCGCCAA TCCGCCCCAA CATCCTGCTG ATCGTCGCCG ACGACCTGGG CTACTCCGAC CTCGGCGCCT ACGGCGGCGA GATCGACACG CCTCACCTGG ACGCGATCGC CCGTGACGGC GTGCGCCTGA CGAGCTTCTA CTCGGCGCCG TTCTGCTCGC CCACGCGCGC CATGCTGATG TCGGGCACGG ACAACCACCT GGCCGGCTTC GGCGGCATGG CCGAACTGCT CACCCCGGAC CAGAAGGGCA GGCCGGGCTA CGAGGGCTTC CTCAACGATC GGGTCGTGCC CTTCCCGCAG TTGCTGCGGG ACAGCGGGTA CCACACCTAC ATGGCTGGCA AGTGGCACCT CGGGGTCACG CCCGAAGTGA GCCCGGCCCG GCGCGGCTTC GAGCAGTCGT ACGCGATGGT GCAGGGCGGC GCCGGGCATT TCGACCAGAC CGGCATCATC ACCGGCGACC CAGCCAAGCC CCCGCGCGCG ATCTATAACG AGAACGGCCA GCTCGTCGAC GTGCCCGCGC GAGGCTTCTA CTCGAGCGAG TTCTTCGCGC GCCGCATGAT CTCCTACATC GACCGCGGCC GCGGCGACGG CAAGCCCTTC TTCGGCTACC TCGCCTTCAC CGCGCCGCAC TGGCCCCTGC AGGCCTACGA CGAGACGATC CGCAAGTACG AGGGCCGCTA CGACGTCGGC TACGACGCGA TCCGCGACCA GCGCACCACG CGCCAGAAGG CGCTGGGCAT CATCCCGAAG GATGCGCAGG TCTACACCGG CCACCCGCTG TGGCCGAAGT GGAGCACGCT GACCGCTGCG CAGAAGCAGA CCGAGTCCAA GCGCATGGCG GTCTATGCCG CGATGGTCGA CGACATGGAT TACTACATCG GCGAAGTCGT GAACTACCTG AAGAAGACCG GCCAGTACGA CAACACGCTG ATCCTCTTCA TGTCCGACAA CGGCGCGGAC GGCAACACCG CCCTCGACGA AGGCCGCACG CGCGAGTGGG TGAAGACGCG CATGGACAAC AGCCTTGCCA ACAGCGGGCG CAAGGGCTCC TACATCGATT ACGGCCCCAA CTGGGCCCAG GTGGGCTCCA ACCCCTTCCA CCTGTACAAG GGCTTCCTGT ACGAGGGTGG AATCTCCGTG CCGTTCATCG CCAGCTGGCC GGCACTGGGC CGCAAGGGGC AGATCAGCGA CAGCTTCGCC CACACGATGG ACATCGCTCC CACGCTGCTG GAGCTGGCGG GCGCCCGCCA TCCGGGCACC GAGTACCAGG GACGTGCGGT GCTCCCGCTG CGCGGTCGCT CGATGCTCGC GATGCTGACC GGGCAGCGCG ACAGCGTGCA CCCGGCCGAT CACGTGCACG GCTGGGAACT CGGGGGCCGC AAGGCGCTGC GCAAGGGGGA CTGGAAGATC GTCTACAGCA ACCGGCAGTG GGGCACCGGC GAGTGGGAGC TCTACGACCT GTCCAAGGAT CGCAGCGAGC TCAACAACCT GGCAGCGTCC CAGCCGGCCA AGTTGAGCGA ACTGGTGGCC GAGTACGAGC GCTACGTGCG CGAAGTGGGC GTGGTGGACA TCCCCGGCCT GGCGGAGCGC AAGGGCTACA GCAACGGCAC ACGCTACTTC GAGGACATGC AGTGA
|
Protein sequence | MCSEEPTMRL TASPKPTLAA LTLGAAALLC VATAVRAEGT APPIRPNILL IVADDLGYSD LGAYGGEIDT PHLDAIARDG VRLTSFYSAP FCSPTRAMLM SGTDNHLAGF GGMAELLTPD QKGRPGYEGF LNDRVVPFPQ LLRDSGYHTY MAGKWHLGVT PEVSPARRGF EQSYAMVQGG AGHFDQTGII TGDPAKPPRA IYNENGQLVD VPARGFYSSE FFARRMISYI DRGRGDGKPF FGYLAFTAPH WPLQAYDETI RKYEGRYDVG YDAIRDQRTT RQKALGIIPK DAQVYTGHPL WPKWSTLTAA QKQTESKRMA VYAAMVDDMD YYIGEVVNYL KKTGQYDNTL ILFMSDNGAD GNTALDEGRT REWVKTRMDN SLANSGRKGS YIDYGPNWAQ VGSNPFHLYK GFLYEGGISV PFIASWPALG RKGQISDSFA HTMDIAPTLL ELAGARHPGT EYQGRAVLPL RGRSMLAMLT GQRDSVHPAD HVHGWELGGR KALRKGDWKI VYSNRQWGTG EWELYDLSKD RSELNNLAAS QPAKLSELVA EYERYVREVG VVDIPGLAER KGYSNGTRYF EDMQ
|
| |