Gene YPK_2798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2798 
Symbol 
ID6087815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp3088468 
End bp3090264 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content49% 
IMG OID641597867 
Productsulfatase 
Protein accessionYP_001721528 
Protein GI170025023 
COG category[R] General function prediction only 
COG ID[COG3083] Predicted hydrolase of alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000743938 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGACAA ATCGTCAGCG CTATCGTGAA AAAGTTTCCC AGATGATCAG CTGGGGGCAC 
TGGTTCGCCT TATTTAATAT TCTTTTCAGC CTGGTTCTGG GTAGCCGCTA CCTGTTTGTC
TCCGATTGGC CCTCTTCGCT GATTGGCCGT GTTTATGCCC TTGTCAGTTG GCTGGGGCAC
TTTAGTTTTC TTGTTTTTGC CGCCTATTTA TTGGTGATAT TCCCGCTTAC CTTTGTGGTG
ATGTCACAAC GGCTACTCCG TTTTCTTTCT GCCGCCTTTG CGACCGCAGG TTTAACCCTG
TTATTGGTGG ATACTGAGGT TTTCGCCCGC TTCCATTTAC ACCTTAATCC GGTGGTATGG
GAGTTAGTTG TTAACCCCGA GCAAACAGAG TTGGCACGTG ACTGGCAGCT GATGTTTATT
TCAGTGCCGG TTATTTTTCT GATCGAGATG TTGTTCGGTA CCTGGGCCTG GCAAAAACTG
CGTAGCCTCA ACCGCCAGCG TTTTGTTAAA CCATTAGTTG CGGTACTGAT CTCTGCCTTT
GTGGCCTCAC ATCTGATGTA TATCTGGGCC GATGCGAATT TTTATCGTCC TATCAGTATG
CAGCGCGCTA ATCTGCCGCT CTCTTATCCA ATGACCGCCC GTAAATTCCT TGAGAAACAC
GGTCTGCTCG ATCAACAAGA GTATCAACGC CGTTTGACCG AGCAAGGCAA TCCCGATGCT
CTGGCAGTTG AATACCCACT AAACCCGATA ACATTCAATG ATAAAGGCAG CGGTTATAAC
CTGCTGTTGA TCGTGGTTGA TGGTATTCGG GTCAGCAGCC TGCAACAGGA TATGCCCGCA
TTGGCCGAGT TTGGTCGCGA AAACATTCAG TTTGATCACC ATTACAGTTC GGGTAACCGG
CAAGATACTG GTTTATTTGG CCTGTTTTAT GGCATTTCAC CGACTTACTG GGATGGGATT
CTCTCCGCCC GGGAACCTTC AGCTTTCATC ACTGCATTAG GCGCTCAAGG CTACCAATTT
GGTCTGTTCG CGTCTGATGG TTTTAAATCG GCATTATACC GCCAAGCGCT GTTAGCTGAT
TTTACCTTGC CAGCGCCGGT TGCGCAGGCT GATGCGGAAA CCACGGCTCA GTGGAAACAG
TGGCTGGCCA GTACCGCCGC CAATAGCAAC CCATGGTTCT CATATATTAG TTTGAGTGGT
CCCGCCGAAG CGCAAGATCC GGTAATTGGG CAAAAAGTTG CCTTACCCAC CGATTTTATA
CGCAACTATC AGTCGGGGGC CAAAGAGGTA GATCAGCAAA TTGCGGCCAT CCTTGAGACG
CTGAAGCAAA GTGGTCAACT CGATAAAACC GTCGTCATTA TTACTGCCAC CCATGGCGTC
GAATTCAATG ACAGTGGCAA TAATTACTGG GGTACGGGTA GCAGCTTCAA TCGCCAGCAA
TTGCAGGTAC CATTAGTGGT GCACTGGCCG GGTACCCCAC CACAAAACGT GGGTAAACTG
ACCAACCACG AAGATGTCAT GACGACATTG ATGCAACGCT TGCTGCATGT GAAAACAGCC
CCAGAAGATT ATTCTCAAGG GGAAGATTTG TTTGCCGCAC AGCGCAATAA CAACTGGGTC
GCAACAGGCG ACAACGGCAT ATTAGTCATC ACGACCCCAA CGCAAACTAT CGTGTTGGAC
AATAACGGGG GTTATCGCAC TTATGATCAG CAAGGTCATG AGGTCAAAGA TGAGAAACCT
CAACTACCGT TGCTGTTACA AGTGCTAACC GACGTGAAAC GCTTTATTGC AAACTAG
 
Protein sequence
MVTNRQRYRE KVSQMISWGH WFALFNILFS LVLGSRYLFV SDWPSSLIGR VYALVSWLGH 
FSFLVFAAYL LVIFPLTFVV MSQRLLRFLS AAFATAGLTL LLVDTEVFAR FHLHLNPVVW
ELVVNPEQTE LARDWQLMFI SVPVIFLIEM LFGTWAWQKL RSLNRQRFVK PLVAVLISAF
VASHLMYIWA DANFYRPISM QRANLPLSYP MTARKFLEKH GLLDQQEYQR RLTEQGNPDA
LAVEYPLNPI TFNDKGSGYN LLLIVVDGIR VSSLQQDMPA LAEFGRENIQ FDHHYSSGNR
QDTGLFGLFY GISPTYWDGI LSAREPSAFI TALGAQGYQF GLFASDGFKS ALYRQALLAD
FTLPAPVAQA DAETTAQWKQ WLASTAANSN PWFSYISLSG PAEAQDPVIG QKVALPTDFI
RNYQSGAKEV DQQIAAILET LKQSGQLDKT VVIITATHGV EFNDSGNNYW GTGSSFNRQQ
LQVPLVVHWP GTPPQNVGKL TNHEDVMTTL MQRLLHVKTA PEDYSQGEDL FAAQRNNNWV
ATGDNGILVI TTPTQTIVLD NNGGYRTYDQ QGHEVKDEKP QLPLLLQVLT DVKRFIAN