Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0945 |
Symbol | |
ID | 5386840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 1129529 |
End bp | 1131082 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640863911 |
Product | sulfatase |
Protein accession | YP_001399929 |
Protein GI | 153946894 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCTAA CAAGACGCAA TTTATTAAAA GGAATCGCAG TTTCTGGCGC ATTGGGGGCA ACCGCGGCCG CGACGGGGGT GCTCAGTACG GCTCAGGCCG TAGTACCTGC GAAAAAAACC GGTAAACAAC CAAATCTACT GATTATCTTC CCGGATGAAA TGCGTACCCA ATCTTTGGGA TTTATGGGAC AAGATCCCTC AATTACCCCT TTCATTAACC AATTCGCCAG TCAAAGTGTG GTATTGAAAC AAGCGGTGTC TAATTATCCA CTGTGTACCC CTTTCCGTGG CATGCTCATG ACCGGACAAT ACCCTTATCG CAATGGCTTA CAAGGCAACT GCCACACGGG GGCTGATGGC AATTTTGGGG GTAAAGATTT TGGTATTGAA CTTAAAAAAG AGTCGGTTAC GTGGTCTGAT ATCCTGAAAA AACAGGGATA CAGCATGGGC TACATTGGCA AATGGCATCT TGATGCCCCA GAAGCCCCCT TTGTGCCAAG CTATAACAAC CCAATGGAAG GGCGTTACTG GAATGATTGG ACACCACCTG AAAAACGCCA CGGCTTCGAT TTTTGGTACA GTTACGGGAC TTATGATTTA CATCTCAATC CTATGTATTG GACCAACGAC ACGCCGCGTG ATAAGCCCCT GAAAATTAAC CAGTGGAGCC CGGAGCATGA GGCCGATATC GCCATTAAGT ATCTGCGCAA TGAGGGGGGG AAATACCGAG ATAACGATCA ACCCTTCGCG TTAGTGGTCT CCATGAACCC ACCGCATTCA CCGTATGATC AAGTGCCACA AAAATACCTG GATCGTTTTA AAGACCACAC CTCAGAATCT CTGAATACCC GCCCGAATGT AGTATGGGAT AAAGCCTATC AGGACGGTTA CGGGCCAAAA TACTTTAAAG AGTATATGGC GATGGTGAAC GGTGTCGATG AGCAATTTGG CCGTATTGTG GCGGAGCTGG ATCGCCTAAA TCTGGATAAA GATACCCTAG TGGTCTTCTT CTCTGATCAT GGTTGCTGTA TGGGATCTAA CGGTCAGCCA ACCAAGAACG TGCATTACGA AGAATCGATG CGTATTCCTA TGATGTTCCG CTGGCCTGGA AAACTGCCGG TGCGGGAAGA TGAGTTGCTG TTCTCCGCCC CCGATATTTA TCCGACGCTA CTGGGTCTGA TGGGCATGAG CGAGCACATC CCGGATCAGA TCGAAGGCAC GGATTTCTCT AATACGGTTG CCGGGCGTCC GGGAGATAAA CGTCCTACCT CGCAGTTGTA TACTTTTATG CCTTATGGCG GACAATCTTA TGGTCAGCGC GGGGTGCGCA CAGACCGTTA TACGTTAGTC ATTGATCGTA AAGTTGGCAA GCCACTCACT TATACTCTGC ATGACAACAA AAATGATCCT TACCAGATGA AAAATATTGC TGCAGAAAAT ATGGCGCTGG TGAATCAATT GATTGCTGAC GAGTTAATCC CTTGGCTGGA GCACTCTGGT GATGTCTGGC GGCCAACAGA AGTGGCGGCC AATGCGGCCA AGGCTTATCT TTAA
|
Protein sequence | MSLTRRNLLK GIAVSGALGA TAAATGVLST AQAVVPAKKT GKQPNLLIIF PDEMRTQSLG FMGQDPSITP FINQFASQSV VLKQAVSNYP LCTPFRGMLM TGQYPYRNGL QGNCHTGADG NFGGKDFGIE LKKESVTWSD ILKKQGYSMG YIGKWHLDAP EAPFVPSYNN PMEGRYWNDW TPPEKRHGFD FWYSYGTYDL HLNPMYWTND TPRDKPLKIN QWSPEHEADI AIKYLRNEGG KYRDNDQPFA LVVSMNPPHS PYDQVPQKYL DRFKDHTSES LNTRPNVVWD KAYQDGYGPK YFKEYMAMVN GVDEQFGRIV AELDRLNLDK DTLVVFFSDH GCCMGSNGQP TKNVHYEESM RIPMMFRWPG KLPVREDELL FSAPDIYPTL LGLMGMSEHI PDQIEGTDFS NTVAGRPGDK RPTSQLYTFM PYGGQSYGQR GVRTDRYTLV IDRKVGKPLT YTLHDNKNDP YQMKNIAAEN MALVNQLIAD ELIPWLEHSG DVWRPTEVAA NAAKAYL
|
| |