Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PputW619_0096 |
Symbol | |
ID | 6109266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas putida W619 |
Kingdom | Bacteria |
Replicon accession | NC_010501 |
Strand | - |
Start bp | 103988 |
End bp | 105505 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641619852 |
Product | choline-sulfatase |
Protein accession | YP_001746971 |
Protein GI | 170719283 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.574472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0000926567 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGCAAC CGAATATCCT GTTCATCATG GCCGACCAGA TGGCCGCGCC GATCTTGCCG ATATATGGCC CTTCACCGAT TCAGATGCCC CATCTGAGCC GCCTGGCCGA TCAGGCGGTG GTGTTCGATT CTGCCTACTG CAACAGCCCG CTGTGCGCAC CCTCGCGCTT TACCCTGGTA AGCGGCCAGT TGCCCAGTCG TATTGGCGCC TACGACAACG CCGCCGACTT CCCGGCCGAT ATCCCGACCT ACGCCCACTA CCTGCGCCGG CTCGGCTACC GCACCGCGCT GTCGGGCAAG ATGCATTTCT GCGGCCCGGA CCAGTTGCAC GGCTACGAAG AACGCTTGAC CAGCGACATC TACCCCGCCG ACTACGGCTG GGCAGTTAAC TGGGATGCCC CGGACAAGCG CGCAAGCTGG TATCACAACA TGTCATCGGT ATTGCAGGCC GGCCCTTGCG TGCGTACTAA CCAGCTGGAT TTCGACGAAG AAGTGGTATT CAAGGCCCGT CAGTACCTTT ACGACCATGT CCGCGAGAAC GATGGTCGGC CGTTCTGCCT CACCGTGTCC ATGACCCACC CCCACGACCC GTACACCATC CCCAAGCGCT ACTGGGATCG CTACGAGGGT GTGGATATCC CCATGCCCCG TACCGAATTC GGCCAGCACG AACTCGACCC GCACTCACAG CGCCTGCTCA AGGTGTACGA CCTGTGGAAC AAGCCCCTGC CTGTGGACAA GATCCGCGAT GCACGCCGTG CCTACTTCGG TGCCTGCAGC TACATCGACG ACAACATCGG CCAGCTGTTG CAGACCCTGG AGGAGTGCAA CCTGGCCGAC GACACCCTGG TCGTTTTCTC TGGCGATCAC GGCGACATGC TTGGCGAGCG CGGGCTCTGG TACAAGATGC ACTGGTTCGA GATGTCGGCG CGGGTGCCGC TGCTGGTCCA CGCGCCCAAG CGCTTCGCCC CGACGCGGGT CAGTGCTTCG GTATCGACCT GCGACCTGCT GCCGACGCTG GTCGAGCTTG CCGGCGGTAC TGTGGATAAA AGCTTGCATC TGGACGGCCA ATCGCTGCTC GGCCACCTGC AAGGGCAGGG TGGCCATGAC CAAGTGATCG GCGAGTACAT GGCCGAGGGT ACCGTCGGAC CATTGATGAT GATCCGCCGT GGCTCGTACA AGTTCGTGTA CAGCGAAGAC GACCCATGTT TACTCTATGA CCTGAGCCGC GACCCGCACG AGCGGGAGAA CCTCACCGGC AGCCCGGACC ATCAGGCGCT GCTGCAGGCA TTTGTCGACG AAGCACGCCA GCGCTGGGAT ATCCCCGGCC TGCGCCAGCA GGTACTGGCC AGCCAGCGGC GCCGCCGCCT GGTTGCCGAA GCGCTGGCCA TCGGCAAACT GAAAAGCTGG GACCACCAAC CGCTGGTGGA TGCCAGCCAA CAGTACATGC GCAACCACAT CGATCTCGAC GACCTCGAGC GCAAGGCACG TTATCCACAG CCCGCACCCC TGGATTGA
|
Protein sequence | MTQPNILFIM ADQMAAPILP IYGPSPIQMP HLSRLADQAV VFDSAYCNSP LCAPSRFTLV SGQLPSRIGA YDNAADFPAD IPTYAHYLRR LGYRTALSGK MHFCGPDQLH GYEERLTSDI YPADYGWAVN WDAPDKRASW YHNMSSVLQA GPCVRTNQLD FDEEVVFKAR QYLYDHVREN DGRPFCLTVS MTHPHDPYTI PKRYWDRYEG VDIPMPRTEF GQHELDPHSQ RLLKVYDLWN KPLPVDKIRD ARRAYFGACS YIDDNIGQLL QTLEECNLAD DTLVVFSGDH GDMLGERGLW YKMHWFEMSA RVPLLVHAPK RFAPTRVSAS VSTCDLLPTL VELAGGTVDK SLHLDGQSLL GHLQGQGGHD QVIGEYMAEG TVGPLMMIRR GSYKFVYSED DPCLLYDLSR DPHERENLTG SPDHQALLQA FVDEARQRWD IPGLRQQVLA SQRRRRLVAE ALAIGKLKSW DHQPLVDASQ QYMRNHIDLD DLERKARYPQ PAPLD
|
| |