Gene YPK_0996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0996 
Symbol 
ID6089598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp1118147 
End bp1119700 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content48% 
IMG OID641596059 
Productsulfatase 
Protein accessionYP_001719750 
Protein GI170023245 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.884308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTAA CAAGACGCAA TTTATTAAAA GGAATCGCAG TTTCTGGCGC ATTGGGGGCA 
ACCGCGGCCG CGACGGGGGT GCTCAGTACG GCTCAGGCCG TAGTACCTGC GAAAAAAACC
GGTAAACAAC CAAATCTACT GATTATCTTC CCGGATGAAA TGCGTACCCA ATCTTTGGGA
TTTATGGGAC AAGATCCCTC AATTACCCCT TTCATTAACC AATTCGCCAG TCAAAGTGTG
GTATTGAAAC AAGCGGTGTC TAATTATCCA CTGTGTACCC CTTTCCGTGG CATGCTCATG
ACCGGACAAT ACCCTTATCG CAATGGCTTA CAAGGCAACT GCCACACGGG GGCTGATGGC
AATTTTGGGG GTAAAGATTT TGGTATTGAA CTTAAAAAAG AGTCGGTGAC GTGGTCTGAT
ATCCTGAAAA AACAGGGATA CAGCATGGGC TACATTGGCA AATGGCATCT TGATGCCCCA
GAAGCCCCCT TTGTGCCAAG CTATAACAAC CCAATGGAAG GGCGTTACTG GAATGATTGG
ACACCACCTG AAAAACGCCA CGGCTTCGAT TTTTGGTACA GTTACGGGAC TTATGATTTA
CATCTCAATC CTATGTATTG GACCAACGAC ACGCCGCGTG ATAAGCCCCT GAAAATTAAC
CAGTGGAGCC CGGAGCATGA GGCCGATATC GCCATTAAGT ATCTGCGCAA TGAGGGGGGG
AAATACCGAG ATAACGATCA ACCCTTCGCG TTAGTGGTCT CCATGAACCC ACCGCATTCA
CCGTATGATC AAGTGCCACA AAAATACCTG GATCGTTTTA AAGACCACAC CTCAGAATCT
CTGAATACCC GCCCGAATGT AGTATGGGAT AAAGCCTATC AGGACGGTTA CGGGCCAAAA
TACTTTAAAG AGTATATGGC GATGGTGAAC GGTGTCGATG AGCAATTTGG CCGTATTGTG
GCGGAGCTGG ATCGCCTAAA TCTGGATAAA GATACCCTAG TGGTCTTCTT CTCTGATCAT
GGTTGCTGTA TGGGATCTAA CGGTCAGCCA ACCAAGAACG TGCATTACGA AGAATCGATG
CGTATTCCTA TGATGTTCCG CTGGCCTGGA AAACTGCCGG TGCGGGAAGA TGAGTTGCTG
TTCTCCGCCC CCGATATTTA TCCGACGCTA CTGGGTCTGA TGGGCATGAG CGAGCACATC
CCGGATCAGA TCGAAGGCAC GGATTTCTCT AATACGGTTG CCGGGCGTCC GGGAGATAAA
CGTCCTACCT CGCAGTTGTA TACTTTTATG CCTTATGGCG GACAATCTTA TGGTCAGCGC
GGGGTGCGCA CAGACCGTTA TACGTTAGTC ATTGATCGTA AAGTTGGCAA GCCACTCACT
TATACTCTGC ATGACAACAA AAATGATCCT TACCAGATGA AAAATATTGC TGCAGAAAAT
ATGGCGCTGG TGAATCAATT GATTGCTGAC GAGTTAATCC CTTGGCTGGA GCACTCTGGT
GATGTCTGGC GGCCAACAGA AGTGGCGGCC AATGCGGCCA AGGCTTATCT TTAA
 
Protein sequence
MSLTRRNLLK GIAVSGALGA TAAATGVLST AQAVVPAKKT GKQPNLLIIF PDEMRTQSLG 
FMGQDPSITP FINQFASQSV VLKQAVSNYP LCTPFRGMLM TGQYPYRNGL QGNCHTGADG
NFGGKDFGIE LKKESVTWSD ILKKQGYSMG YIGKWHLDAP EAPFVPSYNN PMEGRYWNDW
TPPEKRHGFD FWYSYGTYDL HLNPMYWTND TPRDKPLKIN QWSPEHEADI AIKYLRNEGG
KYRDNDQPFA LVVSMNPPHS PYDQVPQKYL DRFKDHTSES LNTRPNVVWD KAYQDGYGPK
YFKEYMAMVN GVDEQFGRIV AELDRLNLDK DTLVVFFSDH GCCMGSNGQP TKNVHYEESM
RIPMMFRWPG KLPVREDELL FSAPDIYPTL LGLMGMSEHI PDQIEGTDFS NTVAGRPGDK
RPTSQLYTFM PYGGQSYGQR GVRTDRYTLV IDRKVGKPLT YTLHDNKNDP YQMKNIAAEN
MALVNQLIAD ELIPWLEHSG DVWRPTEVAA NAAKAYL