Gene PP_0077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPP_0077 
SymbolbetC 
ID1043558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas putida KT2440 
KingdomBacteria 
Replicon accessionNC_002947 
Strand
Start bp87799 
End bp89316 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content63% 
IMG OID637143449 
Productcholine sulfatase 
Protein accessionNP_742247 
Protein GI26986822 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.604146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000459968 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCGTC CGAATATCCT GTTCATCATG GCCGACCAGA TGGCCGCGCC CTTGCTGCCG 
ATCTACGGCC CTTCGCCCAT CAAGATGCCG CACCTCGGCC GTCTGGCCGA GCAAGCCGTG
GTGTTCGACT CGGCCTACTG CAACAGCCCA CTGTGCGCGC CATCACGCTT CACCCTGGTC
AGCGGTCAGT TGCCCAGCCG CATTGGCGCC TACGACAACG CGGCCGACTT CCCTGCCGAT
GTGCCGACCT ACGCCCATTA CCTGCGTCGC CTGGGCTACC GCACCGCGCT GTCGGGCAAG
ATGCACTTCT GCGGCCCGGA CCAGTTGCAC GGCTATGAAG AACGCCTGAC CAGCGACATC
TACCCGGCCG ACTACGGTTG GGCGGTGAAC TGGGACGAAC CCGATGTGCG TCCAAGCTGG
TACCACAACA TGTCCTCGGT GCTGCAGGCG GGTCCGTGCG TGCGCACCAA TCAGCTGGAT
TTCGACGAGG AGGTGGTGTT CAAGGCGCGC CAGTACCTGT ACGACCATGT GCGCGAAAAC
GATGGCCGGC CATTTTGCCT GACCGTTTCG ATGACCCACC CGCATGACCC CTACACCATT
GCCAAACGCT ACTGGGACCG CTACGAAGGT GTGGATATCC CCATGCCCCG TGCCGAGTTC
AGCCAGGCAG AACTCGACCC GCATTCACAG CGCCTGCTGA AGGTCTACGA CCTTTGGAAC
AAGCCACTGC CTGTGGATAA GGTTCGCGAT GCCCGCCGCG CCTACTTCGG CGCGTGCAGC
TATATCGATG ACAACATCGG CCAATTGCTG CAGACCCTGG AGGAATGCAA CCTGGCCGAT
GACACACTGA TCGTGTTTTC CGGCGACCAC GGCGACATGC TTGGCGAGCG TGGCCTCTGG
TACAAAATGC ACTGGTTCGA AATGTCGGCG CGGGTGCCGC TGCTGATCCA CGCGCCGAAG
CGCTTCGCGG CGGGGCGGGT CACTGCCTCG GTGTCGACCT GCGACCTGTT GCCAACCTTG
GTCGAACTGG CTGGCGGCGC TGTGGATAAA GACCTGCAGC TGGACGGCCG CTCACTTCTG
GGCCATCTGC AAGGGCAGGG CGGTCACGAC GAGGTGATCG GCGAGTATAT GGCCGAAGGC
ACCGTCGGCC CGCTGATGAT GATTCGCCGC GGGCCCTACA AGTTCGTGTA CAGCGAAGAC
GACCCATGCC TACTCTATGA CCTGAGCCGC GACCCGCACG AGCGGGAGAA CCTCACCGGC
AGCCCGGACC ACCAGGTGCT GCTGCAGGCA TTTGTCGATG AAGCGCAACA GCGCTGGGAC
ATCACCAGCC TGCGCCAGCA GGTACTGGCC AGCCAGCGCC GCCGCCGGCT GGTGGCCGAA
GCGCTGGCCA TCGGCAAGCT GAAAAGCTGG GACCACCAAC CACTGGTGGA CGCCAGCCAA
CAGTACATGC GCAACCACAT CGATCTCGAT GACCTCGAAC GCAAGGCACG TTATCCACAG
CCCGCCCCCC TGGATTGA
 
Protein sequence
MTRPNILFIM ADQMAAPLLP IYGPSPIKMP HLGRLAEQAV VFDSAYCNSP LCAPSRFTLV 
SGQLPSRIGA YDNAADFPAD VPTYAHYLRR LGYRTALSGK MHFCGPDQLH GYEERLTSDI
YPADYGWAVN WDEPDVRPSW YHNMSSVLQA GPCVRTNQLD FDEEVVFKAR QYLYDHVREN
DGRPFCLTVS MTHPHDPYTI AKRYWDRYEG VDIPMPRAEF SQAELDPHSQ RLLKVYDLWN
KPLPVDKVRD ARRAYFGACS YIDDNIGQLL QTLEECNLAD DTLIVFSGDH GDMLGERGLW
YKMHWFEMSA RVPLLIHAPK RFAAGRVTAS VSTCDLLPTL VELAGGAVDK DLQLDGRSLL
GHLQGQGGHD EVIGEYMAEG TVGPLMMIRR GPYKFVYSED DPCLLYDLSR DPHERENLTG
SPDHQVLLQA FVDEAQQRWD ITSLRQQVLA SQRRRRLVAE ALAIGKLKSW DHQPLVDASQ
QYMRNHIDLD DLERKARYPQ PAPLD