Gene Pput_0093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPput_0093 
Symbol 
ID5195170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas putida F1 
KingdomBacteria 
Replicon accessionNC_009512 
Strand
Start bp99396 
End bp100913 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content63% 
IMG OID640584534 
Productsulfatase 
Protein accessionYP_001265452 
Protein GI148545350 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGTC CGAATATCCT GTTCATCATG GCCGACCAGA TGGCCGCGCC CTTGCTGCCC 
ATCTACGGCC CTTCGCCCAT CCAGATGCCG CACCTCGGCC GCCTGGCCGA GCAAGCCGTG
GTGTTCGACT CGGCCTACTG CAACAGCCCA CTGTGCGCGC CATCACGCTT CACCCTGGTC
AGCGGTCAGT TGCCCAGCCG CATTGGCGCC TACGATAACG CGGCCGACTT CCCTGCCGAT
GTACCGACCT ACGCCCATTA CCTGCGTCGC CTGGGCTACC GCACCGCGCT GTCTGGCAAG
ATGCACTTCT GCGGCCCGGA CCAGTTGCAC GGCTACGAAG AACGCCTGAC CAGCGACATC
TACCCGGCCG ACTACGGTTG GGCGGTGAAC TGGGACGAAC CCGATGTGCG TCCAAGCTGG
TACCACAACA TGTCCTCGGT GCTGCAGGCC GGTCCGTGCG TGCGCACCAA TCAGCTGGAT
TTCGACGAGG AGGTAGTGTT CAAGGCGCGC CAGTACCTGT ACGACCATGT GCGCGAAAAC
GATGGCCGGC CATTTTGCCT GACCGTTTCG ATGACGCACC CGCACGACCC CTACACCATT
GCCAAACGCT ACTGGGACCG CTACGAAGGT GTGGATATCC CCATGCCCCG TGCCGAGTTC
AGCCAGGCAG CACTCGACCC GCACTCCCAG CGCCTGCTGA AAGTCTACGA CCTGTGGAAC
AAGCCGCTGC CTGTGGACAA GGTCCGCGAC GCCCGCCGCG CCTACTTCGG CGCTTGCAGC
TATATCGATG ACAATATTGG CCAATTGCTG CAGACCCTGG AGGAATGCAA CCTGGCCGAT
GACACACTGA TCGTGTTTTC CGGCGACCAC GGCGACATGC TTGGCGAGCG TGGCCTCTGG
TACAAAATGC ATTGGTTCGA AATGTCGGCG CGGGTGCCGC TGCTGATCCA TGCGCCGAAG
CGCTTCGCAG CGGGCCGGGT CACTGCCTCG GTGTCGACCT GCGACCTGTT GCCAACCTTG
GTCGAACTGG CTGGCGGCGC TGTGGATAAA GACCTGCAGC TGGACGGCCG CTCACTGCTG
GGCCACCTGC AAGGGCAGGG CGGTCACGAC GAGGTGATCG GCGAGTATAT GGCCGAAGGC
ACCGTCGGCC CGCTGATGAT GATCCGCCGC GGGCCCTACA AGTTCGTGTA CAGCGAAGAC
GACCCATGCC TACTCTATGA CCTGAGCCGC GACCCGCACG AGCGGGAGAA CCTCACCGGC
AGCCCGGACC ACCAGGTGCT GCTGCAGGCA TTTGTCGATG AAGCACAACA GCGCTGGGAC
ATCCCCAGCC TGCGCCAGCA AGTGCTGGCC AGCCAGCGCC GCCGCCGGCT GGTGGCCGAA
GCGCTGGCCA TCGGCAAGCT GAAAAGCTGG GATCACCAAC CACTGGTGGA CGCCAGCCAA
CAGTACATGC GCAACCACAT CGATCTCGAT GACCTCGAGC GCAAGGCACG TTATCCACAG
CCCGCCCCCC TGGATTGA
 
Protein sequence
MTRPNILFIM ADQMAAPLLP IYGPSPIQMP HLGRLAEQAV VFDSAYCNSP LCAPSRFTLV 
SGQLPSRIGA YDNAADFPAD VPTYAHYLRR LGYRTALSGK MHFCGPDQLH GYEERLTSDI
YPADYGWAVN WDEPDVRPSW YHNMSSVLQA GPCVRTNQLD FDEEVVFKAR QYLYDHVREN
DGRPFCLTVS MTHPHDPYTI AKRYWDRYEG VDIPMPRAEF SQAALDPHSQ RLLKVYDLWN
KPLPVDKVRD ARRAYFGACS YIDDNIGQLL QTLEECNLAD DTLIVFSGDH GDMLGERGLW
YKMHWFEMSA RVPLLIHAPK RFAAGRVTAS VSTCDLLPTL VELAGGAVDK DLQLDGRSLL
GHLQGQGGHD EVIGEYMAEG TVGPLMMIRR GPYKFVYSED DPCLLYDLSR DPHERENLTG
SPDHQVLLQA FVDEAQQRWD IPSLRQQVLA SQRRRRLVAE ALAIGKLKSW DHQPLVDASQ
QYMRNHIDLD DLERKARYPQ PAPLD