Gene YPK_0984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0984 
Symbol 
ID6087159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp1104972 
End bp1106579 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content49% 
IMG OID641596047 
Productsulfatase 
Protein accessionYP_001719738 
Protein GI170023233 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTAC CCGCAGGAAA AAGAAGCCTG TTGGCAGGGA TGATCGCTGC CGCTGGTATG 
AGTATGACAC CTGTGACTCT GGCGGCACCG GCAGAAAAAC CCAATGTATT GCTGGTAATC
ATGGATGATC TGGGTACCGG GCAGTTAGAT TTCACCCTCA ATAATCTGGA TAAAAAAGCA
CTAAGCCAGC GCCCAGTTCC CGTGCGCTAT CAAGGCGATC TGGACAAGAT GATCGATGCG
GCACAGCGGG CGATGCCGAA TGTGTCTTTG TTGGCCAAAA ACGGGGTCAA AATGACCAAT
GCGTTTGTGG CGCATCCGGT ATGCGGGCCT TCGCGCGCGG GTATTTATAC CGGTCGCCAC
CCAACCAGTT TTGGTACTTA CAGTAATGAT GATGCCATGC AGGGGATCCC ACTGGATATT
AAACTGCTGC CCGCCTTGTT TCAGGAGCAT GGCTATGCAA CCGCAAATAT CGGGAAATGG
CACAACGCAC GCATAGAGAA AAAAGCGTTC GTCGCCGATG AGGTCAAAAG CCGCGATTAT
CACGACAACA TGATCTCCGT CAGCGCCCCC GGATATGCAC CTGAAAAACG GGGTTTTGAC
TATTCCTACA GTTATTACGC CTCAGGCGCG GCATTGTGGC ACTCTCCGGC CATCTGGCAA
AACAGTAAAA ATATTGCCGC CCCAGGCTAT CTGACCCATA ACCTGACGGA TGAAACGCTG
AAATTTATTG ATGACTCAGG GAAAAAACCG TTTTTCATCA GCCTGGCTTA CAGCGTGCCA
CATATTCCAT TAGAGCAAGC ATCACCCGCG AAATATATGG ATCGGTTTAA TACCGGCAAC
GTTGAAGCAG ATAAATATTT TGCTGCCATT AATGCCGCAG ACGAGGGGAT TGGTAGAATT
GTTCAGCACT TACAAGAAAA AGGTGAGCTG GATAACACAC TGATTTTCTT CATTTCGGAT
AACGGGGCGG TTCATGAATC CCCAATGCCA ATGAATGGCA TGGACCGTGG ACATAAAGGA
CAAATGTATA ACGGGGGGGT GCATATTCCC TTCGTCGCTT ACTGGCCAAA ACAGATCCCC
GCAGGTACGC AAAGTGATGC ATTGGTGAGT GCATTAGATA TTTTACCGAC GGCATTGAAA
GCCGCGGGTA TTGCCATCCC AGCGGAGATG AGAGTGGATG GTAAAGATAT TCTGCCGGTA
CTGGCAGGTA AGGAACAAAC CTCGCCGCAT CAATATATGT ACTGGGCTGG GCCGGGGGCA
AAGCATTACA GCGATGAGAA TCAGTCATTC TGGCATGACT ACTGGAAATG GATCACTTAC
GAACATCAAC AGGCGCCTAA AAATGATCAT GTAGAGACAT TATCGAAAGC CTCTTGGGCA
ATCCGCGATC AGGAGTGGGC GCTCTACTTC TATGATGACG GCACCAATAC GCCAAAATTA
TTTAATGATA AGCATGACCC TCTGGAATCA AAGGATTTAG CTGATCAGTA CCCTGAGCGT
GTCAGTGCAA TGAAAGCGGC ATTCTATGAT TGGATCAAAG ATAAACCCAA ACCCGTGGCT
TGGGGGCAAG ATCGCTATCA GATCTTAGCA AGCTCCGCGA AAAGTTAA
 
Protein sequence
MKLPAGKRSL LAGMIAAAGM SMTPVTLAAP AEKPNVLLVI MDDLGTGQLD FTLNNLDKKA 
LSQRPVPVRY QGDLDKMIDA AQRAMPNVSL LAKNGVKMTN AFVAHPVCGP SRAGIYTGRH
PTSFGTYSND DAMQGIPLDI KLLPALFQEH GYATANIGKW HNARIEKKAF VADEVKSRDY
HDNMISVSAP GYAPEKRGFD YSYSYYASGA ALWHSPAIWQ NSKNIAAPGY LTHNLTDETL
KFIDDSGKKP FFISLAYSVP HIPLEQASPA KYMDRFNTGN VEADKYFAAI NAADEGIGRI
VQHLQEKGEL DNTLIFFISD NGAVHESPMP MNGMDRGHKG QMYNGGVHIP FVAYWPKQIP
AGTQSDALVS ALDILPTALK AAGIAIPAEM RVDGKDILPV LAGKEQTSPH QYMYWAGPGA
KHYSDENQSF WHDYWKWITY EHQQAPKNDH VETLSKASWA IRDQEWALYF YDDGTNTPKL
FNDKHDPLES KDLADQYPER VSAMKAAFYD WIKDKPKPVA WGQDRYQILA SSAKS