Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YPK_0984 |
Symbol | |
ID | 6087159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis YPIII |
Kingdom | Bacteria |
Replicon accession | NC_010465 |
Strand | - |
Start bp | 1104972 |
End bp | 1106579 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641596047 |
Product | sulfatase |
Protein accession | YP_001719738 |
Protein GI | 170023233 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTAC CCGCAGGAAA AAGAAGCCTG TTGGCAGGGA TGATCGCTGC CGCTGGTATG AGTATGACAC CTGTGACTCT GGCGGCACCG GCAGAAAAAC CCAATGTATT GCTGGTAATC ATGGATGATC TGGGTACCGG GCAGTTAGAT TTCACCCTCA ATAATCTGGA TAAAAAAGCA CTAAGCCAGC GCCCAGTTCC CGTGCGCTAT CAAGGCGATC TGGACAAGAT GATCGATGCG GCACAGCGGG CGATGCCGAA TGTGTCTTTG TTGGCCAAAA ACGGGGTCAA AATGACCAAT GCGTTTGTGG CGCATCCGGT ATGCGGGCCT TCGCGCGCGG GTATTTATAC CGGTCGCCAC CCAACCAGTT TTGGTACTTA CAGTAATGAT GATGCCATGC AGGGGATCCC ACTGGATATT AAACTGCTGC CCGCCTTGTT TCAGGAGCAT GGCTATGCAA CCGCAAATAT CGGGAAATGG CACAACGCAC GCATAGAGAA AAAAGCGTTC GTCGCCGATG AGGTCAAAAG CCGCGATTAT CACGACAACA TGATCTCCGT CAGCGCCCCC GGATATGCAC CTGAAAAACG GGGTTTTGAC TATTCCTACA GTTATTACGC CTCAGGCGCG GCATTGTGGC ACTCTCCGGC CATCTGGCAA AACAGTAAAA ATATTGCCGC CCCAGGCTAT CTGACCCATA ACCTGACGGA TGAAACGCTG AAATTTATTG ATGACTCAGG GAAAAAACCG TTTTTCATCA GCCTGGCTTA CAGCGTGCCA CATATTCCAT TAGAGCAAGC ATCACCCGCG AAATATATGG ATCGGTTTAA TACCGGCAAC GTTGAAGCAG ATAAATATTT TGCTGCCATT AATGCCGCAG ACGAGGGGAT TGGTAGAATT GTTCAGCACT TACAAGAAAA AGGTGAGCTG GATAACACAC TGATTTTCTT CATTTCGGAT AACGGGGCGG TTCATGAATC CCCAATGCCA ATGAATGGCA TGGACCGTGG ACATAAAGGA CAAATGTATA ACGGGGGGGT GCATATTCCC TTCGTCGCTT ACTGGCCAAA ACAGATCCCC GCAGGTACGC AAAGTGATGC ATTGGTGAGT GCATTAGATA TTTTACCGAC GGCATTGAAA GCCGCGGGTA TTGCCATCCC AGCGGAGATG AGAGTGGATG GTAAAGATAT TCTGCCGGTA CTGGCAGGTA AGGAACAAAC CTCGCCGCAT CAATATATGT ACTGGGCTGG GCCGGGGGCA AAGCATTACA GCGATGAGAA TCAGTCATTC TGGCATGACT ACTGGAAATG GATCACTTAC GAACATCAAC AGGCGCCTAA AAATGATCAT GTAGAGACAT TATCGAAAGC CTCTTGGGCA ATCCGCGATC AGGAGTGGGC GCTCTACTTC TATGATGACG GCACCAATAC GCCAAAATTA TTTAATGATA AGCATGACCC TCTGGAATCA AAGGATTTAG CTGATCAGTA CCCTGAGCGT GTCAGTGCAA TGAAAGCGGC ATTCTATGAT TGGATCAAAG ATAAACCCAA ACCCGTGGCT TGGGGGCAAG ATCGCTATCA GATCTTAGCA AGCTCCGCGA AAAGTTAA
|
Protein sequence | MKLPAGKRSL LAGMIAAAGM SMTPVTLAAP AEKPNVLLVI MDDLGTGQLD FTLNNLDKKA LSQRPVPVRY QGDLDKMIDA AQRAMPNVSL LAKNGVKMTN AFVAHPVCGP SRAGIYTGRH PTSFGTYSND DAMQGIPLDI KLLPALFQEH GYATANIGKW HNARIEKKAF VADEVKSRDY HDNMISVSAP GYAPEKRGFD YSYSYYASGA ALWHSPAIWQ NSKNIAAPGY LTHNLTDETL KFIDDSGKKP FFISLAYSVP HIPLEQASPA KYMDRFNTGN VEADKYFAAI NAADEGIGRI VQHLQEKGEL DNTLIFFISD NGAVHESPMP MNGMDRGHKG QMYNGGVHIP FVAYWPKQIP AGTQSDALVS ALDILPTALK AAGIAIPAEM RVDGKDILPV LAGKEQTSPH QYMYWAGPGA KHYSDENQSF WHDYWKWITY EHQQAPKNDH VETLSKASWA IRDQEWALYF YDDGTNTPKL FNDKHDPLES KDLADQYPER VSAMKAAFYD WIKDKPKPVA WGQDRYQILA SSAKS
|
| |