Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PputGB1_0093 |
Symbol | |
ID | 5867781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas putida GB-1 |
Kingdom | Bacteria |
Replicon accession | NC_010322 |
Strand | - |
Start bp | 100650 |
End bp | 102167 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641545166 |
Product | choline-sulfatase |
Protein accession | YP_001666345 |
Protein GI | 167031114 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000000000895462 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGCGCC CGAATATCCT GTTCATCATG GCCGACCAGA TGGCCGCACC CTTGCTGCCG ATCTACGCCC CTTCGCCCAT CCAGATGCCG CACCTGAGCC GCCTGGCCGA GCAGGCGGTG GTGTTCGAGT CGGCCTACTG CAACAGCCCG CTGTGCGCAC CGTCCCGCTT TACCCTGGTC AGCGGCCAAC TGCCTAGCCG CATCGGAGCC TACGACAATG CCGCCGATTT CCCTGCCGAT GTGCCGACCT ATGCCCACTA CCTGCGCCGC CTGGGTTACC GCACCGCACT GTCAGGCAAG ATGCACTTCT GCGGCCCGGA CCAACTGCAT GGCTACGAAG AGCGCCTGAC CAGCGATATT TACCCAGCCG ACTATGGCTG GGCAGTGAAC TGGGATGAGC CGGATGTTCG CCCGAGCTGG TACCACAACA TGTCCTCGGT GCTGCAGGCC GGTCCGTGCG TGCGCACCAA CCAGCTGGAT TTCGACGAAG AAGTGGTGTT CAAGGCACGC CAGTACCTGT ACGACCACGT GCGTGATAAC GATGGCCGAC CGTTCTGCCT GACCGTGTCC ATGACCCACC CGCACGACCC CTACACCATC CCCAAGCGTT ACTGGGACCG CTACGAGGGT GTGGATATCC CCATGCCCCG TGCCGAGTTC GGTCAGGCAG AACTCGACCC GCATTCGCAG CGCCTGCTGA AGGTCTATGA CCTGTGGAAC AAGCCGCTGC CTATGGAAAA GATCCGCGAC GCCCGCCGCG CCTACTTCGG CGCTTGCAGC TACATCGACG ACAACATCGG CCAGCTGCTG CAAACCCTGG AGGAGTGCAA CCTCGCCGAC GACACCCTGA TCGTGTTCTC CGGCGACCAC GGCGACATGC TTGGCGAGCG AGGCCTCTGG TACAAGATGC ACTGGTTCGA GATGTCGGCG CGGGTTCCGC TGCTTGTCCA TGCGCCCAAG CGCTTTGCAG CAGGCCGGGT CAGCGCCTCG GTATCGACCT GCGACCTGCT GCCAACCCTG GTCGAACTGG CCGGCGGGGC TGTGGATAAA AGCCTGCACC TGGACGGCCG CTCGCTTGTC GGCCATCTGC AAGGGCAGGG CGGTCACGAT GAAGTGATCG GCGAATACAT GGCCGAAGGC ACCGTCGGCC CGCTGATGAT GATCCGCCGC GGGCCGTACA AGTTCGTGTA CAGCGAGGAT GACCCCAGCC TACTCTATGA CCTGAGCCGC GACCCGCACG AGCGGGAGAA CCTCACCGGC AGCCCGGAGC ATCAGGCGCT GCTGCAGGCA TTTGTCGATG AAGCACAACA GCGCTGGGAT ATCCCCAGCC TGCGCCAGCA GGTACTGGCC AGCCAGCGGC GCCGCCGCCT GGTGGCCGAG GCGCTGGCCA TCGGCACGCT GAAAAGCTGG GACCATCAAC CGCTGGTGGA CGCCAGCCAA CAGTACATGC GCAACCACAT CGATCTCGAC GACCTCGAGC GCAAGGCACG TTATCCACAG CCCGCCCCCC TGGATTGA
|
Protein sequence | MTRPNILFIM ADQMAAPLLP IYAPSPIQMP HLSRLAEQAV VFESAYCNSP LCAPSRFTLV SGQLPSRIGA YDNAADFPAD VPTYAHYLRR LGYRTALSGK MHFCGPDQLH GYEERLTSDI YPADYGWAVN WDEPDVRPSW YHNMSSVLQA GPCVRTNQLD FDEEVVFKAR QYLYDHVRDN DGRPFCLTVS MTHPHDPYTI PKRYWDRYEG VDIPMPRAEF GQAELDPHSQ RLLKVYDLWN KPLPMEKIRD ARRAYFGACS YIDDNIGQLL QTLEECNLAD DTLIVFSGDH GDMLGERGLW YKMHWFEMSA RVPLLVHAPK RFAAGRVSAS VSTCDLLPTL VELAGGAVDK SLHLDGRSLV GHLQGQGGHD EVIGEYMAEG TVGPLMMIRR GPYKFVYSED DPSLLYDLSR DPHERENLTG SPEHQALLQA FVDEAQQRWD IPSLRQQVLA SQRRRRLVAE ALAIGTLKSW DHQPLVDASQ QYMRNHIDLD DLERKARYPQ PAPLD
|
| |