Gene YPK_1378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_1378 
Symbol 
ID6090743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp1517450 
End bp1519123 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content42% 
IMG OID641596442 
Productsulfatase 
Protein accessionYP_001720125 
Protein GI170023620 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAATT TAAAGGTTAA CAGGAGCATT ATTAGTGCTT CGATATCGGT CATCTTAGCT 
GCGGGCGTCA TGGGTGGCCC TGCATATGCT GATGATGTCA AACTCAAAGC AACAAACACC
AATGTTGCTT TCGCTGATTT TACGCCAAAA GAATACAGCA CGAAGAATAA GCCAAATATC
ATTGTCTTAA CCATGGATGA CTTAGGTTAT GGGCAGCTCC CTTTTGATAA GACCTCCTTT
GACCCTAAGT CGATGGAAGA TCGGGACGTT GTTGATACCT ACAAAATAGG CATTGATAAA
GCCATTGAAG CCGCCAAAAA GTCCACGCCA ACACTACTCT CGTTGATGGA TGAAGGGGTT
CGTCTGACGA ATGGCTACGT TGCTCATGGC GTATCAGGGC CTTCGCGGGC GGCCATTATG
ACGGGCCGGT CCCCTGCAAG GTTTGGTGTT TACTCCAATA CCGATGCTCA GAATGGGATT
TCATTAGAAG AGACATTCCT GCCTGAGTTA TTGCAAAACA ATGGCTATTA CACGGCGGCC
ATCGGAAAAT GGCATCTTTC AAAAATCAGT AATGTTCCTG TTCCTGAAGC GGAGCAAACG
CGCGATTACC ACGATAACTT TACAACTTAC TCAGCCGATG AATGGCAGCC TCAAAACCGA
GGCTTCCAGT ATTTTATGGG TTACCATGCC GCGGGAACGG CTTATTATAA TTCCCCGTCT
CTTTTCCATA ATAAAGAGCG GGTGAAAGCC AAAGGTTATA TCAGTGATCA ACTTACCGAT
GAGGCTATCG GTGTTGCCAA TAGAGCTAAA TCCTTAGATG AGCCATTCAT GATGTATTTG
GCTTACAGTG CTCCCCATTT ACCTAATGAT AATCCAGCGC CGGATGAATA TCAGAAACAC
TTTAATACAG GTAGCCAAAC TGCTGATAAC TTCTATGCCT CTGTCTATTC TGTTGACCAG
GGCGTAAAAC GGCTTCTTGA GCAGCTTAAA AAGAATGGTC AATATGACAA TACGATAATT
ATGTTTACCT CTGATAACGG TGCCGTTATC GATGGGCCAT TACCGTTGAA CGGTAATCAG
AAAGGGTATA AAAGCCAAAC ATTTCCTGGC GGAACCCATA CTCCAATGTT TATTTGGTGG
AAGGGGAAAT TGCAAACAGG AAATTATGAC AAGTTGATCT CTGCAATGGA TTTCATGCCT
ACAGCGCTTG AAGCCGCTGA GATTGATGCT CCAAATAATT TAGATGGTGT CTCACTGCTT
CCTTATTTGA CGGGGAAAAG CAAAGCTGAA CCGCATAAAT ATCTTACCTG GGTGACATCC
TATACCCACT GGTTCGATGA AGAGAATATT CCATTCTGGG ATGGTTACCA TAAATTTGTG
CGTAATGAAT CCAATGAATA TCCTAAAAAC CCAAATACCG AAGATCTTAG TCAATTCTCT
TATACCATCC GCAGTAATGA CTACTCTTTA ACCTATACCT ATGAAGGTAA TAAGTTAAAT
CTGTATAAAC TGAGTGATTT AAATCAAAAA CAAGACCTTG CAAGTACCCA TCCTGATGTT
GTTAAGGTAA TGCAAGCCGA GATGAGGAAC TTCATTAATC AGAGTCAATC TCCTGTTAGT
GAAGTTAATC AGGATAAATT TAATAAAATT AAGCAATCGC TTGGTATGAA TTAA
 
Protein sequence
MMNLKVNRSI ISASISVILA AGVMGGPAYA DDVKLKATNT NVAFADFTPK EYSTKNKPNI 
IVLTMDDLGY GQLPFDKTSF DPKSMEDRDV VDTYKIGIDK AIEAAKKSTP TLLSLMDEGV
RLTNGYVAHG VSGPSRAAIM TGRSPARFGV YSNTDAQNGI SLEETFLPEL LQNNGYYTAA
IGKWHLSKIS NVPVPEAEQT RDYHDNFTTY SADEWQPQNR GFQYFMGYHA AGTAYYNSPS
LFHNKERVKA KGYISDQLTD EAIGVANRAK SLDEPFMMYL AYSAPHLPND NPAPDEYQKH
FNTGSQTADN FYASVYSVDQ GVKRLLEQLK KNGQYDNTII MFTSDNGAVI DGPLPLNGNQ
KGYKSQTFPG GTHTPMFIWW KGKLQTGNYD KLISAMDFMP TALEAAEIDA PNNLDGVSLL
PYLTGKSKAE PHKYLTWVTS YTHWFDEENI PFWDGYHKFV RNESNEYPKN PNTEDLSQFS
YTIRSNDYSL TYTYEGNKLN LYKLSDLNQK QDLASTHPDV VKVMQAEMRN FINQSQSPVS
EVNQDKFNKI KQSLGMN