Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PP_0077 |
Symbol | betC |
ID | 1043558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pseudomonas putida KT2440 |
Kingdom | Bacteria |
Replicon accession | NC_002947 |
Strand | - |
Start bp | 87799 |
End bp | 89316 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637143449 |
Product | choline sulfatase |
Protein accession | NP_742247 |
Protein GI | 26986822 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR03417] choline-sulfatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.604146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000459968 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGCGTC CGAATATCCT GTTCATCATG GCCGACCAGA TGGCCGCGCC CTTGCTGCCG ATCTACGGCC CTTCGCCCAT CAAGATGCCG CACCTCGGCC GTCTGGCCGA GCAAGCCGTG GTGTTCGACT CGGCCTACTG CAACAGCCCA CTGTGCGCGC CATCACGCTT CACCCTGGTC AGCGGTCAGT TGCCCAGCCG CATTGGCGCC TACGACAACG CGGCCGACTT CCCTGCCGAT GTGCCGACCT ACGCCCATTA CCTGCGTCGC CTGGGCTACC GCACCGCGCT GTCGGGCAAG ATGCACTTCT GCGGCCCGGA CCAGTTGCAC GGCTATGAAG AACGCCTGAC CAGCGACATC TACCCGGCCG ACTACGGTTG GGCGGTGAAC TGGGACGAAC CCGATGTGCG TCCAAGCTGG TACCACAACA TGTCCTCGGT GCTGCAGGCG GGTCCGTGCG TGCGCACCAA TCAGCTGGAT TTCGACGAGG AGGTGGTGTT CAAGGCGCGC CAGTACCTGT ACGACCATGT GCGCGAAAAC GATGGCCGGC CATTTTGCCT GACCGTTTCG ATGACCCACC CGCATGACCC CTACACCATT GCCAAACGCT ACTGGGACCG CTACGAAGGT GTGGATATCC CCATGCCCCG TGCCGAGTTC AGCCAGGCAG AACTCGACCC GCATTCACAG CGCCTGCTGA AGGTCTACGA CCTTTGGAAC AAGCCACTGC CTGTGGATAA GGTTCGCGAT GCCCGCCGCG CCTACTTCGG CGCGTGCAGC TATATCGATG ACAACATCGG CCAATTGCTG CAGACCCTGG AGGAATGCAA CCTGGCCGAT GACACACTGA TCGTGTTTTC CGGCGACCAC GGCGACATGC TTGGCGAGCG TGGCCTCTGG TACAAAATGC ACTGGTTCGA AATGTCGGCG CGGGTGCCGC TGCTGATCCA CGCGCCGAAG CGCTTCGCGG CGGGGCGGGT CACTGCCTCG GTGTCGACCT GCGACCTGTT GCCAACCTTG GTCGAACTGG CTGGCGGCGC TGTGGATAAA GACCTGCAGC TGGACGGCCG CTCACTTCTG GGCCATCTGC AAGGGCAGGG CGGTCACGAC GAGGTGATCG GCGAGTATAT GGCCGAAGGC ACCGTCGGCC CGCTGATGAT GATTCGCCGC GGGCCCTACA AGTTCGTGTA CAGCGAAGAC GACCCATGCC TACTCTATGA CCTGAGCCGC GACCCGCACG AGCGGGAGAA CCTCACCGGC AGCCCGGACC ACCAGGTGCT GCTGCAGGCA TTTGTCGATG AAGCGCAACA GCGCTGGGAC ATCACCAGCC TGCGCCAGCA GGTACTGGCC AGCCAGCGCC GCCGCCGGCT GGTGGCCGAA GCGCTGGCCA TCGGCAAGCT GAAAAGCTGG GACCACCAAC CACTGGTGGA CGCCAGCCAA CAGTACATGC GCAACCACAT CGATCTCGAT GACCTCGAAC GCAAGGCACG TTATCCACAG CCCGCCCCCC TGGATTGA
|
Protein sequence | MTRPNILFIM ADQMAAPLLP IYGPSPIKMP HLGRLAEQAV VFDSAYCNSP LCAPSRFTLV SGQLPSRIGA YDNAADFPAD VPTYAHYLRR LGYRTALSGK MHFCGPDQLH GYEERLTSDI YPADYGWAVN WDEPDVRPSW YHNMSSVLQA GPCVRTNQLD FDEEVVFKAR QYLYDHVREN DGRPFCLTVS MTHPHDPYTI AKRYWDRYEG VDIPMPRAEF SQAELDPHSQ RLLKVYDLWN KPLPVDKVRD ARRAYFGACS YIDDNIGQLL QTLEECNLAD DTLIVFSGDH GDMLGERGLW YKMHWFEMSA RVPLLIHAPK RFAAGRVTAS VSTCDLLPTL VELAGGAVDK DLQLDGRSLL GHLQGQGGHD EVIGEYMAEG TVGPLMMIRR GPYKFVYSED DPCLLYDLSR DPHERENLTG SPDHQVLLQA FVDEAQQRWD ITSLRQQVLA SQRRRRLVAE ALAIGKLKSW DHQPLVDASQ QYMRNHIDLD DLERKARYPQ PAPLD
|
| |