Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0526 |
Symbol | |
ID | 5833842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 571349 |
End bp | 573043 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641366303 |
Product | sulfatase |
Protein accession | YP_001638012 |
Protein GI | 163849969 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCACG ACGACACGCA CGAGAAGCAC CAGGAGTGCC CTGGAACGGG AAGCGCCCTC AGCCGCCGCA GCATCCTTCT CACCGGGACC TCGGCCCTCG CGGCCGCCGC GCTGGTGTCT ACCGCCCAGG CCCAGCAGCA AACCCAGCAG CAACCCACAG CAGTCGGGCA GAAGCCGAAT ATCATCGTCA TCATGGGTGA CGATATCGGC ATCTGGAACA TCGGCGCCTA TCACCGCGGC ATGATGGCTG GTCGCACGCC CCATATCGAT CAGTTGGCCG CAGAGGGCAT GCTGTTCACC GACTATTACG CCGAGGCGAG CTGCACGGCG GGGCGCGCCG CCTTCATCAC CGGCGAGCTG CCGATCCGCA CCGGCATGAC CACGGTTGGC CAGGCCGGCG CCGCCATCGG TATTCCGGCG GAGGCCGTGA CCATCGCGAC CGCGCTGAAG GGCATGGGCT ACGCCACCGG CCAGTTCGGC AAGAACCACC TGGGCGACAA GAACGAGTTC CTGCCGACGG TGCACGGCTT CGACGAGTTC TTCGGCTACC TGTATCACCT CGACGCGATG GAGGACCCGG CGCACCCCGC CTATCCGCAA GAACTGCTGA ACAGGGTCGG CCCGCGCAAC ATGGTTCATT CGTGGGCGAC GAACGTGGAC GATCCCACCG ACGATCCGCG CTGGGGCAGG GTCGGCAAGC AGCGGATTGA GGATGCCGGG ACGCTCTACC CGAAGCGGAT GGAGACAATC GACGACGAAA TCCGCGACCT GGCGCTCGGC TTCATCGACA AGGCCAAGGC CAACGGCAAG CCTTTTTTCG TCTGGCTCAA CCCGACCCGC ATGCACGTCA CCACGCACCT ATCGCCGAAG TATCAGGCGA TGCGCAACTC CAAGAACGGC TGGAGCATCC AGGAAGCCGG CATGGCGCAG ATCGACGATG TCGTTGGCGC GGTGATGAAG AAGCTGAAGG ACTTAGGGGT CGACGACAAC ACCATCGTGG TCTTCACCAC CGACAACGGC ACCGAGGTCT TCACCTGGCC AGACGGCGGG CAGACACCGT TCGCGCAGTC GAAGGGCACG GTCATGGAGG GCGGCTTCCG CGCGCCGGCT ATGGTTCGCT GGCCCGGCAA GGTGCCGGCC GGCACGGTCG ACAACGGCGT CATCTCGGGC CTCGACTGGT TCCCGACGCT TGTGGCGGCG GCGGGCAACC CGGACATCGG CGAGGAGCTG AAAAAGGGCA AGCAGATCGC CGACCAGACC TACAAGGTGC ACCTGGACGG CTACAACCAG CTGGACCTGA TCACCGGCAA GGGACCATCG AAGCGCAACG AGGTCTGGTA TTTCGGCGAG AGCGAGCTTG GGGCTGTCCG GATCGGCGAC TACAAGTACC GCTTCATCGA CCAGCCCGGC GGGTGGCTCG GCGACAAAAC CAAGCCTGAC GTGCCCTACA TCACTAACCT GCGGCTCGAC CCCTTTGAGC GCACGGGGTG GCCCGATAGC GGGACGAAGA TCGGCACACA GAACTACATG AACTGGTTCT TGTATGAGTT CTGGCGCTTC ACCTTTGTCC AGCAGGAGGT GGAGAAGCTT GCCATGACGG CGGTCGAGTT CCCGCCGATG CAGAAGGGCG CGAGCTTCAA TCTCGAAGCG GTCAAGGCGA AGATCGTGGC GGCTAGGTCG GCAATGGGGA AGTAA
|
Protein sequence | MSHDDTHEKH QECPGTGSAL SRRSILLTGT SALAAAALVS TAQAQQQTQQ QPTAVGQKPN IIVIMGDDIG IWNIGAYHRG MMAGRTPHID QLAAEGMLFT DYYAEASCTA GRAAFITGEL PIRTGMTTVG QAGAAIGIPA EAVTIATALK GMGYATGQFG KNHLGDKNEF LPTVHGFDEF FGYLYHLDAM EDPAHPAYPQ ELLNRVGPRN MVHSWATNVD DPTDDPRWGR VGKQRIEDAG TLYPKRMETI DDEIRDLALG FIDKAKANGK PFFVWLNPTR MHVTTHLSPK YQAMRNSKNG WSIQEAGMAQ IDDVVGAVMK KLKDLGVDDN TIVVFTTDNG TEVFTWPDGG QTPFAQSKGT VMEGGFRAPA MVRWPGKVPA GTVDNGVISG LDWFPTLVAA AGNPDIGEEL KKGKQIADQT YKVHLDGYNQ LDLITGKGPS KRNEVWYFGE SELGAVRIGD YKYRFIDQPG GWLGDKTKPD VPYITNLRLD PFERTGWPDS GTKIGTQNYM NWFLYEFWRF TFVQQEVEKL AMTAVEFPPM QKGASFNLEA VKAKIVAARS AMGK
|
| |