Gene YpsIP31758_0931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0931 
Symbol 
ID5385100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1116167 
End bp1117774 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content49% 
IMG OID640863897 
Productsulfatase 
Protein accessionYP_001399915 
Protein GI153946814 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTAC CCGCAGGAAA AAGAAGCCTG TTGGCAGGGA TGATCGCTGC CGCTGGTATG 
AGTATGACAC CTGTGACTCT GGCGGCACCG GCAGAAAAAC CCAATGTATT GCTGGTAATC
ATGGATGATC TGGGTACCGG GCAGTTAGAT TTCACCCTCA ATAATCTGGA TAAAAAAGCA
CTAAGCCAGC GCCCAGTTCC CGTGCGCTAT CAAGGCGATC TGGACAAGAT GATCGATGCG
GCACAGCGGG CGATGCCGAA TGTGTCTTTG TTGGCCAAAA ACGGGGTCAA AATGACCAAT
GCGTTTGTGG CGCATCCGGT ATGCGGGCCT TCGCGCGCGG GTATTTATAC CGGTCGCCAC
CCAACCAGTT TTGGTACTTA CAGTAATGAT GATGCCATGC AGGGGATCCC ACTGGATATT
AAACTGCTGC CCGCCTTGTT TCAGGAGCAT GGCTATGCAA CCGCAAATAT CGGGAAATGG
CACAACGCAC GCATAGAGAA AAAAGCGTTC GTCGCCGATG AGGTCAAAAG CCGCGATTAT
CACGACAACA TGATCTCCGT CAGCGCCCCC GGATATGCAC CTGAAAAACG GGGTTTTGAC
TATTCCTACA GTTATTACGC CTCAGGCGCG GCATTGTGGC ACTCTCCGGC CATCTGGCAA
AACAGTAAAA ATATTGCCGC CCCAGGCTAT CTGACCCATA ACCTGACGGA TGAAACGCTG
AAATTTATTG ATGACTCAGG GAAAAAACCG TTTTTCATCA GCCTGGCTTA CAGCGTGCCA
CATATTCCAT TAGAGCAAGC ATCACCCGCG AAATATATGG ATCGGTTTAA TACCGGCAAC
GTTGAAGCAG ATAAATATTT TGCTGCCATT AATGCCGCAG ACGAGGGGAT TGGTAGAATT
GTTCAGCACT TACAAGAAAA AGGTGAGCTG GATAACACAC TGATTTTCTT CATTTCGGAT
AACGGGGCGG TTCATGAATC CCCAATGCCA ATGAATGGCA TGGACCGTGG ACATAAAGGA
CAAATGTATA ACGGGGGGGT GCATATTCCC TTCGTCGCTT ACTGGCCAAA ACAGATCCCC
GCAGGTACGC AAAGTGATGC ATTGGTGAGT GCATTAGATA TTTTACCGAC GGCATTGAAA
GCCGCGGGTA TTGCCATCCC AGCGGAGATG AGAGTGGATG GTAAAGATAT TCTGCCGGTA
CTGGCAGGTA AGGAACAAAC CTCGCCGCAT CAATATATGT ACTGGGCTGG GCCGGGGGCA
AAGCATTACA GCGATGAGAA TCAGTCATTC TGGCATGACT ACTGGAAATG GATCACTTAC
GAACATCAAC AGGCGCCTAA AAATGATCAT GTAGAGACAT TATCGAAAGC CTCTTGGGCA
ATCCGCGATC AGGAGTGGGC GCTCTACTTC TATGATGACG GCACCAATAC GCCAAAATTA
TTTAATGATA AGCATGATCC CATGGAATCA AAGGATTTAG CTGATCAGTA CCCTGAGCGT
GTCAGTGCAA TGAAAGCGGC ATTCTATGAT TGGATCAAAG ATAAACCCAA ACCCGTGGCT
TGGGGGCAAG ATCGCTATCA GATCTTAGCA AGCTCCGCAA AAAGTTAA
 
Protein sequence
MKLPAGKRSL LAGMIAAAGM SMTPVTLAAP AEKPNVLLVI MDDLGTGQLD FTLNNLDKKA 
LSQRPVPVRY QGDLDKMIDA AQRAMPNVSL LAKNGVKMTN AFVAHPVCGP SRAGIYTGRH
PTSFGTYSND DAMQGIPLDI KLLPALFQEH GYATANIGKW HNARIEKKAF VADEVKSRDY
HDNMISVSAP GYAPEKRGFD YSYSYYASGA ALWHSPAIWQ NSKNIAAPGY LTHNLTDETL
KFIDDSGKKP FFISLAYSVP HIPLEQASPA KYMDRFNTGN VEADKYFAAI NAADEGIGRI
VQHLQEKGEL DNTLIFFISD NGAVHESPMP MNGMDRGHKG QMYNGGVHIP FVAYWPKQIP
AGTQSDALVS ALDILPTALK AAGIAIPAEM RVDGKDILPV LAGKEQTSPH QYMYWAGPGA
KHYSDENQSF WHDYWKWITY EHQQAPKNDH VETLSKASWA IRDQEWALYF YDDGTNTPKL
FNDKHDPMES KDLADQYPER VSAMKAAFYD WIKDKPKPVA WGQDRYQILA SSAKS