Gene PputGB1_0093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPputGB1_0093 
Symbol 
ID5867781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas putida GB-1 
KingdomBacteria 
Replicon accessionNC_010322 
Strand
Start bp100650 
End bp102167 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content64% 
IMG OID641545166 
Productcholine-sulfatase 
Protein accessionYP_001666345 
Protein GI167031114 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000895462 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCGCC CGAATATCCT GTTCATCATG GCCGACCAGA TGGCCGCACC CTTGCTGCCG 
ATCTACGCCC CTTCGCCCAT CCAGATGCCG CACCTGAGCC GCCTGGCCGA GCAGGCGGTG
GTGTTCGAGT CGGCCTACTG CAACAGCCCG CTGTGCGCAC CGTCCCGCTT TACCCTGGTC
AGCGGCCAAC TGCCTAGCCG CATCGGAGCC TACGACAATG CCGCCGATTT CCCTGCCGAT
GTGCCGACCT ATGCCCACTA CCTGCGCCGC CTGGGTTACC GCACCGCACT GTCAGGCAAG
ATGCACTTCT GCGGCCCGGA CCAACTGCAT GGCTACGAAG AGCGCCTGAC CAGCGATATT
TACCCAGCCG ACTATGGCTG GGCAGTGAAC TGGGATGAGC CGGATGTTCG CCCGAGCTGG
TACCACAACA TGTCCTCGGT GCTGCAGGCC GGTCCGTGCG TGCGCACCAA CCAGCTGGAT
TTCGACGAAG AAGTGGTGTT CAAGGCACGC CAGTACCTGT ACGACCACGT GCGTGATAAC
GATGGCCGAC CGTTCTGCCT GACCGTGTCC ATGACCCACC CGCACGACCC CTACACCATC
CCCAAGCGTT ACTGGGACCG CTACGAGGGT GTGGATATCC CCATGCCCCG TGCCGAGTTC
GGTCAGGCAG AACTCGACCC GCATTCGCAG CGCCTGCTGA AGGTCTATGA CCTGTGGAAC
AAGCCGCTGC CTATGGAAAA GATCCGCGAC GCCCGCCGCG CCTACTTCGG CGCTTGCAGC
TACATCGACG ACAACATCGG CCAGCTGCTG CAAACCCTGG AGGAGTGCAA CCTCGCCGAC
GACACCCTGA TCGTGTTCTC CGGCGACCAC GGCGACATGC TTGGCGAGCG AGGCCTCTGG
TACAAGATGC ACTGGTTCGA GATGTCGGCG CGGGTTCCGC TGCTTGTCCA TGCGCCCAAG
CGCTTTGCAG CAGGCCGGGT CAGCGCCTCG GTATCGACCT GCGACCTGCT GCCAACCCTG
GTCGAACTGG CCGGCGGGGC TGTGGATAAA AGCCTGCACC TGGACGGCCG CTCGCTTGTC
GGCCATCTGC AAGGGCAGGG CGGTCACGAT GAAGTGATCG GCGAATACAT GGCCGAAGGC
ACCGTCGGCC CGCTGATGAT GATCCGCCGC GGGCCGTACA AGTTCGTGTA CAGCGAGGAT
GACCCCAGCC TACTCTATGA CCTGAGCCGC GACCCGCACG AGCGGGAGAA CCTCACCGGC
AGCCCGGAGC ATCAGGCGCT GCTGCAGGCA TTTGTCGATG AAGCACAACA GCGCTGGGAT
ATCCCCAGCC TGCGCCAGCA GGTACTGGCC AGCCAGCGGC GCCGCCGCCT GGTGGCCGAG
GCGCTGGCCA TCGGCACGCT GAAAAGCTGG GACCATCAAC CGCTGGTGGA CGCCAGCCAA
CAGTACATGC GCAACCACAT CGATCTCGAC GACCTCGAGC GCAAGGCACG TTATCCACAG
CCCGCCCCCC TGGATTGA
 
Protein sequence
MTRPNILFIM ADQMAAPLLP IYAPSPIQMP HLSRLAEQAV VFESAYCNSP LCAPSRFTLV 
SGQLPSRIGA YDNAADFPAD VPTYAHYLRR LGYRTALSGK MHFCGPDQLH GYEERLTSDI
YPADYGWAVN WDEPDVRPSW YHNMSSVLQA GPCVRTNQLD FDEEVVFKAR QYLYDHVRDN
DGRPFCLTVS MTHPHDPYTI PKRYWDRYEG VDIPMPRAEF GQAELDPHSQ RLLKVYDLWN
KPLPMEKIRD ARRAYFGACS YIDDNIGQLL QTLEECNLAD DTLIVFSGDH GDMLGERGLW
YKMHWFEMSA RVPLLVHAPK RFAAGRVSAS VSTCDLLPTL VELAGGAVDK SLHLDGRSLV
GHLQGQGGHD EVIGEYMAEG TVGPLMMIRR GPYKFVYSED DPSLLYDLSR DPHERENLTG
SPEHQALLQA FVDEAQQRWD IPSLRQQVLA SQRRRRLVAE ALAIGTLKSW DHQPLVDASQ
QYMRNHIDLD DLERKARYPQ PAPLD