Gene Psyr_0029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPsyr_0029 
Symbol 
ID3365504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas syringae pv. syringae B728a 
KingdomBacteria 
Replicon accessionNC_007005 
Strand
Start bp32731 
End bp34236 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content60% 
IMG OID637650372 
Productsulfatase 
Protein accessionYP_233141 
Protein GI66043300 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCA AAAATATTCT GTTCATCATG GCCGATCAAA TGGCCGCGCC AATGTTGCCG 
TTTTACGCCC GGTCCCCGAT TCTGATGCCG AACCTGAGCC GTCTCGCCGC CGACGGCGTA
GTGTTCGATT CGGCGTACTG CAACAGCCCG CTATGCGCGC CGTCGCGCTT CACCCTGGTC
AGCGGCCAGT TGCCAAGCAG GATCGGCGCG TACGACAACG CAGCCGACTT CCCGGCCGAT
ATTCCGACCT ATGCCCACTA CCTGCGTGCG CTGGGCTACA AGACCGCCCT CGCAGGCAAG
ATGCATTTCT GCGGACCGGA TCAACTGCAC GGCTACGAAG AACGCCTGAC CAGTGACATC
TACCCGGCCG ATTACGGCTG GTCGGTCAAC TGGGACGAGC CGGACGTGCG GCCAAGCTGG
TATCACAACA TGTCTTCGGT GTTGCAGGCC GGTCCGTGCA TTCGCACCAA CCAGCTGGAC
TTCGATGAAG AGGTGCTGTT CAAGTCCCAG CAATACCTCT ACGACCATGT GCGCCAGGAC
GGTGATGCGC CGTTCTGCCT GACCGTTTCA ATGACTCACC CTCATGACCC GTACACCATC
CCTCTGCCGT TCTGGGACCT GTACGCCGAC GACGAAATAC CGATGCCCAC GCCGCACGCC
AACCAGGCCG CGCTGGACCC GCATTCACAA CGGCTGCTCA AGGTCTATGA CCTGTGGGAC
AAGCCCATGC CGACCGACAA GATTCGTGAT GCCCGCCGCG CCTACTTCGG TGCGTGCAGC
TACATTGACC TCAACGTCGG CAAACTGATG CAGACGCTCG ATGAAGTCGG GTTGGCGGAA
GACACCATCG TGGTGTTCTC CGGCGATCAC GGCGACATGC TGGGCGAGAA GGGCCTCTGG
TACAAAATGC ACTGGTTCGA AATGGCCGCC CGCGTGCCGC TGGTGGTGTA CGCGCCGGGC
CAGTTCAAAC CGGGACGGGT CAGTGCGTCG GTGTCGACTG CCGACTTGCT GCCGACCTTC
GTCGAAATGG CCAAAGGCAA ACTGGACGCC GGTTTGCCAA TGGACGGACG TTCGCTGATG
CCGCATTTGA AACGCAAGGG CGGGCACGAT GAGGTGTTCG GTGAGTACAT GGCCGAAGGC
ACCACCAGCC CGCTGATGAT GATCCGTCGT GGCGCGTACA AATTCATCTA TTCGGAGCAG
GACCCGTGCC TGCTGTTCGA TGTGAAAAAG GACCCGAAAG AGCAGAAGGA CCTGAGCCAG
TCACCCACCC ATGAAAAGCT GTTCAACGAT TTCCTGGCCG AAGCTCGGGC CAAGTGGGAC
ATCCCGGCGA TACACCAACA GGTGCTCGCC AGCCAGCGCA GAAGGCGCTT CGTCGCCAAA
TCGCTGGCAA CCGGCAAGCT GAAGAGTTGG GATCACCAGC CACTGGTCGA CGCCAGTCAG
CAGTACATGC GCAACCACAT TGATCTGGAC GATCTGGAGC GCAAGGCACG TTTTCCGCAA
CCTTGA
 
Protein sequence
MKRKNILFIM ADQMAAPMLP FYARSPILMP NLSRLAADGV VFDSAYCNSP LCAPSRFTLV 
SGQLPSRIGA YDNAADFPAD IPTYAHYLRA LGYKTALAGK MHFCGPDQLH GYEERLTSDI
YPADYGWSVN WDEPDVRPSW YHNMSSVLQA GPCIRTNQLD FDEEVLFKSQ QYLYDHVRQD
GDAPFCLTVS MTHPHDPYTI PLPFWDLYAD DEIPMPTPHA NQAALDPHSQ RLLKVYDLWD
KPMPTDKIRD ARRAYFGACS YIDLNVGKLM QTLDEVGLAE DTIVVFSGDH GDMLGEKGLW
YKMHWFEMAA RVPLVVYAPG QFKPGRVSAS VSTADLLPTF VEMAKGKLDA GLPMDGRSLM
PHLKRKGGHD EVFGEYMAEG TTSPLMMIRR GAYKFIYSEQ DPCLLFDVKK DPKEQKDLSQ
SPTHEKLFND FLAEARAKWD IPAIHQQVLA SQRRRRFVAK SLATGKLKSW DHQPLVDASQ
QYMRNHIDLD DLERKARFPQ P