Gene PSPTO_0165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPSPTO_0165 
Symbol 
ID1181773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas syringae pv. tomato str. DC3000 
KingdomBacteria 
Replicon accessionNC_004578 
Strand
Start bp184922 
End bp186427 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content60% 
IMG OID637391542 
Productsulfatase family protein 
Protein accessionNP_790024 
Protein GI28867405 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGCA AAAATATTCT GTTCATCATG GCCGATCAAA TGGCCGCGCC AATGTTGCCG 
TTCTACGCCC CGTCTCCCAT CCTGATGCCC AACCTGAGCC GCCTTGCTGC CGACGGCGTG
GTGTTCGACT CGGCGTATTG CAACAGCCCG CTGTGCGCGC CTTCGCGCTT TACTCTGGTG
AGCGGTCAGC TCCCGAGCAG GATCGGCGGG TACGACAACG CGGCGGACTT CCCGGCAGAC
GTTCCGACCT ACGCGCACTA CCTGCGTGCG CTGGGTTACA AGACCGCGCT GTCGGGCAAG
ATGCATTTTT GCGGGCCGGA TCAGTTGCAC GGTTACGAAG AGCGCCTGAC CAGTGACATC
TACCCGGCCG ACTACGGCTG GTCGGTCAAT TGGGACGAGC CGGACGTACG CCCGAGCTGG
TATCACAACA TGTCATCGGT ATTGCAGGCC GGCCCGTGCG TGCGCACCAA CCAGCTGGAT
TTCGATGAAG AGGTGCTGTT CAAGGCTCAG CAGTACCTCT ACGACCATGT GCGTCAGGAC
GGTGATGCGC CGTTCTGCCT GACCGTTTCC ATGACTCACC CTCACGACCC GTACACCATC
CCGCGTCCGT TCTGGGACCT GTACAGCGAC GACGAAATCC CGATGCCAAC GCCGCACGCC
AATCAGGCCG CGCTGGACCC GCACTCACAA CGGCTGCTCA AGGTGTATGA CCTGTGGGAC
AAGCCGATGC CGACAAACAA GATTCGTGAT GCGCGCCGTG CCTATTTCGG CGCGTGCAGC
TACATCGACC TGAACGTCGG CAAGCTGATG CAGACGCTTG ATGAGGTCGG GCTGGCGGAC
GACACCATCG TGGTGTTCTC TGGCGATCAC GGCGACATGC TGGGCGAGAA GGGTCTCTGG
TACAAAATGC ACTGGTTCGA AATGGCCGCT CGCGTGCCGC TGGTGGTGTA CGCGCCGGGG
CAGTTCAAGC CGGGGCGGGT CAGTGCGTCG GTGTCGACGG CCGACCTGTT ACCGACCTTT
GTCGAAATGG CCAAGGGCAC ACTGGACGCC GGCTTGCCGC TGGACGGGCG CTCGCTGATG
CCGCACCTGA AACGCAAAGG CGGGCACGAT GAGGTGTTTG GCGAATACAT GGCCGAAGGC
ACGACCAGCC CGCTGATGAT GATCCGTCGC GGTGCGTACA AATTCATCTA TTCGGAACAG
GACCCGTGCC TGTTGTTCGA TGTGAAGAAA GACCCGAAAG AGCAGAAAGA CCTGAGCCAG
TCGCCAGCCC ATGAAAAGCT GTTCAATGAT TTTCTGGCCG AAGCTCGGGC CAAGTGGGAC
ATACCGGCGA TACACCAACA GGTGCTCGCC AGCCAGCGCA GAAGGCGCTT TGTCGCCAAA
TCGCTGGCAA CCGGCAAGCT GAAGAGTTGG GATCACCAGC CACTGGTCGA CGCCAGTCAG
CAGTACATGC GCAACCACAT TGATCTCGAC GATCTGGAGC GCAAGGCACG TTTTCCGCAA
CCTTGA
 
Protein sequence
MKRKNILFIM ADQMAAPMLP FYAPSPILMP NLSRLAADGV VFDSAYCNSP LCAPSRFTLV 
SGQLPSRIGG YDNAADFPAD VPTYAHYLRA LGYKTALSGK MHFCGPDQLH GYEERLTSDI
YPADYGWSVN WDEPDVRPSW YHNMSSVLQA GPCVRTNQLD FDEEVLFKAQ QYLYDHVRQD
GDAPFCLTVS MTHPHDPYTI PRPFWDLYSD DEIPMPTPHA NQAALDPHSQ RLLKVYDLWD
KPMPTNKIRD ARRAYFGACS YIDLNVGKLM QTLDEVGLAD DTIVVFSGDH GDMLGEKGLW
YKMHWFEMAA RVPLVVYAPG QFKPGRVSAS VSTADLLPTF VEMAKGTLDA GLPLDGRSLM
PHLKRKGGHD EVFGEYMAEG TTSPLMMIRR GAYKFIYSEQ DPCLLFDVKK DPKEQKDLSQ
SPAHEKLFND FLAEARAKWD IPAIHQQVLA SQRRRRFVAK SLATGKLKSW DHQPLVDASQ
QYMRNHIDLD DLERKARFPQ P