Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pmen_0064 |
Symbol | |
ID | 5106249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas mendocina ymp |
Kingdom | Bacteria |
Replicon accession | NC_009439 |
Strand | - |
Start bp | 60993 |
End bp | 62501 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640501280 |
Product | sulfatase |
Protein accession | YP_001185570 |
Protein GI | 146305105 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGCC CGAACATCCT CTTCATCATG GCCGACCAGA TGGCCGCACC GATCCTGCCG CTGCACGATG CCGCCTCGCC GGTGCAGATG CCCAACCTGA TGAAACTGGC CGAACGGGCC GTGGTGTTCG ACTCGGCCTA CTGCAACAGC CCGCTGTGCG CGCCGTCGCG CTTCACCCTG GTCAGCGGCC GGCTGCCGTC GCAGATCGGC GCCTATGACA ACGCCGCCGA CTTTCCCGCC GACGTGCCCA CCTACGCCCA CTACCTGCGC CGCCTCGGCT ACCGCACCGC GCTGTCGGGC AAGATGCACT TCTGCGGCCC GGACCAGCTG CACGGCTACG AGGAACGCCT GACCAGCGAC ATCTACCCGG CCGACTACGG CTGGGCGGTG AACTGGGACG AACCGGACGT GCGCGCCAGC TGGTACCACA ACATGTCCTC GGTGCTGCAG GCCGGCCCCT GCGTGCGCAG CAACCAGCTG GATTTCGACG AGGAGGTGGT GTTCAAGGCC CGCCAGTACC TCTACGACCA CGTGCGTCTG ACGCCGGAGC AGCCGTTCTG CCTGACCGTG TCGATGACCC ACCCCCATGA CCCCTACACC ATCCCGCGCG AATACTGGGA GCGCTACGAA GGCGTGGACA TTCCCATGCC GCGCCAGCAC ATCGAGCAGG CCGAGCAGGA CCCGCACTCG CAGCGCTTGC TCAAGGTCAT CGACCTGTGG GACAAGCCGC TGCCGGCAGA CAAGATCCGC GACGCCCGCC GCGCCTACTT CGGTGCCTGC AGCTATATCG ACGACAACAT CGGCAAGCTG CTGAAGACCC TGGAGGAATG CGGCCTGGCC GAGGACACCC TGATCGTCTT CTCCGGCGAC CACGGCGACA TGCTTGGCGA GCGCGGCCTC TGGTACAAGA TGCACTGGTT CGAGATGGCC GCCCGTGTGC CGCTGCTGGT GCATGCGCCG CAGCGCTTCG CCGCGCACCG GGTCAGCCAG TCGGTCTCCA CCCTCGATCT GCTGCCCACA CTGGTGGAAC TGGCCGGCGG CCGGGTCGAG GAGGGCCTGG CGCTGGAGGG CCGCTCGCTG CTGGCGCATC TGACTGGCGA AGGCGGACAT GACGAGGTAA TCGGCGAATA TATGGCCGAA GGCACCACCA GCCCGCTGAT GATGATCCGC CGCGGGCCGT GGAAATTCGT CTACTCCGAG CAGGATCCGC TGCTGCTGTT CCATCTCCAG CAGGACCCGC AGGAGCGCGA GAATCTGGCC GGCTCGGCGG ACCACCAGGG CGTGCTGGCC GAGTTTCTCG CCGAGGCGCG GGCGCGCTGG GACATCCCCG CGATCCACGC CGCCACCCTG GCCAGCCAGC GCCGTCGGCG CCTGGTGGCA GAGGCCCTGA GCCAGGGCAA GCTGACCAGC TGGGACCATC AGCCCTGGGT CGACGCCAGC CAGCAGTACA TGCGCAACCA TATCGATCTC GACGACCTGG AACGTCGCGC CCGCTATCCA CAGGTTTGA
|
Protein sequence | MKRPNILFIM ADQMAAPILP LHDAASPVQM PNLMKLAERA VVFDSAYCNS PLCAPSRFTL VSGRLPSQIG AYDNAADFPA DVPTYAHYLR RLGYRTALSG KMHFCGPDQL HGYEERLTSD IYPADYGWAV NWDEPDVRAS WYHNMSSVLQ AGPCVRSNQL DFDEEVVFKA RQYLYDHVRL TPEQPFCLTV SMTHPHDPYT IPREYWERYE GVDIPMPRQH IEQAEQDPHS QRLLKVIDLW DKPLPADKIR DARRAYFGAC SYIDDNIGKL LKTLEECGLA EDTLIVFSGD HGDMLGERGL WYKMHWFEMA ARVPLLVHAP QRFAAHRVSQ SVSTLDLLPT LVELAGGRVE EGLALEGRSL LAHLTGEGGH DEVIGEYMAE GTTSPLMMIR RGPWKFVYSE QDPLLLFHLQ QDPQERENLA GSADHQGVLA EFLAEARARW DIPAIHAATL ASQRRRRLVA EALSQGKLTS WDHQPWVDAS QQYMRNHIDL DDLERRARYP QV
|
| |