Gene YpAngola_B0067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_B0067 
SymbolyopH 
ID5798273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010157 
Strand
Start bp44761 
End bp46167 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content47% 
IMG OID641337877 
Productprotein-tyrosine-phosphatase YopH 
Protein accessionYP_001604497 
Protein GI162417751 
COG category[T] Signal transduction mechanisms 
COG ID[COG5599] Protein tyrosine phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones98 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTAT CATTAAGCGA TCTTCATCGT CAGGTATCTC GATTGGTGCA GCAAGAGAGC 
GGTGATTGTA CCGGGAAATT AAGAGGTAAC GTTGCTGCCA ATAAAGAAAC TACCTTTCAA
GGTTTGACCA TAGCCAGTGG AGCCAGAGAG TCAGAAAAAG TATTTGCTCA AACTGTACTA
AGCCACGTAG CAAATGTTGT TCTAACTCAA GAAGATACCG CTAAGCTATT GCAAAGTACG
GTAAAGCATA ATTTGAATAA TTATGACTTA AGAAGTGTCG GCAATGGTAA TAGTGTACTT
GTCAGTTTAC GTAGTGACCA AATGACACTA CAAGACGCCA AAGTGCTGTT GGAGGCCGCA
TTGCGACAAG AGTCGGGAGC GAGGGGGCAT GTATCATCTC ATTCACATTC AGCCCTTCAC
GCACCGGGAA CCCCGGTGCG TGAAGGACTG CGTTCACATC TAGACCCCAG AACTCCACCG
TTGCCACCGC GTGAACGACC ACACACTTCT GGCCATCACG GGGCTGGCGA AGCCAGAGCC
ACCGCACCAA GCACTGTTTC TCCTTATGGC CCAGAAGCGC GCGCAGAACT CAGCAGCCGC
CTCACCACAT TGCGCAATAC GCTGGCGCCA GCAACGAATG ATCCGCGTTA CTTACAAGCC
TGCGGCGGTG AAAAGCTAAA CCGATTTAGA GATATTCAAT GCTGTCGGCA AACCGCAGTA
CGCGCCGATC TTAATGCCAA TTACATCCAG GTCGGTAACA CTCGTACCAT AGCGTGCCAG
TATCCGCTAC AATCTCAACT TGAAAGCCAT TTCCGTATGC TGGCAGAAAA CCGAACGCCA
GTGTTGGCTG TTTTAGCGTC CAGTTCTGAG ATAGCCAATC AAAGATTCGG TATGCCAGAT
TATTTCCGCC AGAGTGGTAC CTATGGCAGT ATCACTGTAG AGTCTAAAAT GACTCAGCAA
GTTGGTCTCG GTGACGGGAT TATGGCAGAT ATGTATACTT TAACGATTCG TGAAGCGGGT
CAAAAAACAA TCTCTGTTCC TGTGGTTCAT GTTGGCAATT GGCCCGATCA GACCGCAGTC
AGCTCTGAAG TTACCAAGGC ACTCGCTTCA CTGGTAGATC AAACAGCAGA AACAAAACGC
AATATGTATG AAAGCAAAGG AAGTTCAGCG GTAGGAGATG ACTCCAAATT ACGGCCGGTA
ATACATTGCC GTGCGGGTGT TGGCCGTACT GCGCAACTGA TTGGCGCAAT GTGCATGAAT
GATAGTCGTA ATAGTCAGTT AAGCGTAGAA GATATGGTCA GCCAAATGCG AGTACAAAGA
AATGGTATTA TGGTACAAAA AGATGAGCAA CTTGATGTTC TGATTAAGTT GGCTGAAGGA
CAAGGGCGAC CATTATTAAA TAGCTAA
 
Protein sequence
MNLSLSDLHR QVSRLVQQES GDCTGKLRGN VAANKETTFQ GLTIASGARE SEKVFAQTVL 
SHVANVVLTQ EDTAKLLQST VKHNLNNYDL RSVGNGNSVL VSLRSDQMTL QDAKVLLEAA
LRQESGARGH VSSHSHSALH APGTPVREGL RSHLDPRTPP LPPRERPHTS GHHGAGEARA
TAPSTVSPYG PEARAELSSR LTTLRNTLAP ATNDPRYLQA CGGEKLNRFR DIQCCRQTAV
RADLNANYIQ VGNTRTIACQ YPLQSQLESH FRMLAENRTP VLAVLASSSE IANQRFGMPD
YFRQSGTYGS ITVESKMTQQ VGLGDGIMAD MYTLTIREAG QKTISVPVVH VGNWPDQTAV
SSEVTKALAS LVDQTAETKR NMYESKGSSA VGDDSKLRPV IHCRAGVGRT AQLIGAMCMN
DSRNSQLSVE DMVSQMRVQR NGIMVQKDEQ LDVLIKLAEG QGRPLLNS