Gene YPK_0107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_0107 
Symbol 
ID6091212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp126853 
End bp130065 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content49% 
IMG OID641595167 
Productouter membrane autotransporter 
Protein accessionYP_001718873 
Protein GI170022368 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA ACCGCTCTAC GCTTTCCCCG TGCTTTCGTA AAACACTGAT AGCCAGTTTG 
CTGGTGCCTC TTTGCAGCCC CCTGTATAGC TGGGCGGTAC AAACGGCCAG CATAACGGAT
GGCAGCACGA TGGTTATCTC TGGGGGTTAT GACACTGAGG CTAATAACCA GACGGCAGTA
TTTGTGCAAG GTTCCGGTAG CACTATTAAT GGTGGCTCCG ATGTCGTTAT TGAAACCACG
GGTGTTGGTG CAATTGGTGC CAATGCCTCT GAAGGTGGGA CGCTGGGTCT GACGGGTTCG
ACTATCAAGA CCGAGAATAG TGTGGCTTTT GGTGTCTTAA ATGACAAAGG TACGGTGAAT
TTACAGGGTG GTACGATTAC CACGAAAGGT CAGACGGCAT ATGGCGTGTA TTCCTCTGGT
CTGGGCAGTA ATACCGATAT TCACAGTTCG GAGATCACGA CCAGCTACTC GTTAACCCAC
GCTATTTATG GTGCGGGCGG GACGGGATTG ACATTGAACA ATACCACCCT CAATACCAGT
GGCAGTGGTA GCTATGGCAT TTATCTGAAT GGCCCCGGAG GGAGCTTAAC GGGTGCGGAT
AATACCATTA ATAGTACTCA TGCGACCAAT GGTGCGGGTA TCTATATTTC ATCGGGTGGC
TCAAATGCGA CTTTAGATAA CACCACACTG AATATCACTA AAGGTGCTGT TGGTGTGAAT
GTGGGGGAGG GATCCTCTAT TACGATGGAT GGCCTTATTG CCACCGGTAA TATCACCAAC
CTATTTAAAG TGAACGGGAA TGCCTCGGTC AGTAATGCCA ATATCGAATT AGCCGCGGGT
GGCTTATTAA TGGCACAGGG CCACAGTGCA TCCAATCAAG CGGTCATCAT ATTAAATAAT
GTCGATGCTA TTTCTAACGG CGGCGGCACG ACACTGGTTG ATGTTAATAA GGACGCTGAC
GTCACCATTA ATGGGGGGGC TTACCACTCA AAAGGTAACA ATGCGAAGGG AATCTGGGTT
CGAGATAATA ACTCATCGCT GAATGTCGAT AACGTCGTGA TTATCACCGA GGGCGTGAAT
GCAACGGCGA TTGAAAATCG TGGCACCGCT ATCGTAAAAA ATACCACGGT GATAACCCAA
GGGAATAACT CTCACGGCCT CTACTCTGAG CAGAGCCTTG ATGCCACCAA TATGGCAATT
TCCACTGCGG GGATTGGCAG TATTGGGGCG GCGGCAGCTA AAGGCGGTAA CCTAAATCTG
AATGATGCCC TCATCGAGAC GACGGGTAAT TCAGGTATGG TGCTGGGTAC TTTTGCCGAC
TCATCCATCA GCGCTAAAAA TATTACAGGT CTATCGACCG GCGCTGGTGC TTATGCCTTG
TGGGTAGATG ATGGTAGCTC AATTCTTCTG GAAGAGAGTC AAATTACCAC TCAAGGCCAG
GGCGCAGGAG GGATTTATGC CTCAAATACA GGGACTGGCT CTCACACCGC TTACACTCAG
GTTACGCTGA ACAACTCGCA GATTCATAGT GAGCAGGGGC CGGGCATCTG GGCTAATGGT
GCTGACATTA ATGTTGATGT GAAGAATGGT TCGCAGTTAA CGGGAGGGAA TGGGTTATTG
ATCTACGCCT CGAGTAATGC AGGGGCTGCC AGTAATGTCA ATGTGAATGG CGATAACCAC
GCCGTCCTGT TGGGTGATAT TCACGCCGCA GAAAACAGCA ATATTAACCT GGCACTGAAT
AATAATTCCG TTTGGACGGG TGCGGCGACT AACGCCAAAC AGGTTGATAT CGACAGCAGC
AGTATCTGGA ATTTAACGGG TGATGCAGAT GTTGAGTCAA TGCATGTATT GGGCCAGATG
AACTTTATCT CAAATAGCAG TGACACCAAT TCACGAGCCC CCTACGATAA TTTCAGTACC
TTAACGATCA ACAGTAATGT CACCGGGAGT GGCAGTTTTA CCTTTAATGT GCAATTGGGT
GATAACGACT CGCCAGTGGA TAGACTCTAT GTAATAGGTA ATGCCTCTGG TGACCATGGG
GTTCAGGTTA TTAACCAAGG CGGTTTGGGT GCGTTGACCA CGGGTGACGG GATTAACCTG
ATTACCGTTG ATGGGGAGAC CCATTCTGGC TCATTTACTA TGAGCAACTC GGTGAGCGCA
GGGGCCTATG AGTATTTTTT GTATAAGATA GATGACTACC GTTGGAATCT GCAATCTAAT
CTCATCAATC CCGGTCCTGG TCCTGAACCA GAAATTGAAC CAGAAGAGAT AGCTTACCGC
CCTGAAGTTC CTGGCTATAT TGCCGCACCT TGGTTAAATG CATTTTATGG TTTTACTACT
TTGGGTAGCT TGCACGAACG CCGTGGCTCG GCCGAGGGGG CAGCCGAAGG GTTTAATCAA
GACTCATGGG GCCGTATCCG TGGGCAGCAT AATAATTTTG AGGCGGGCCG TTTTAGCTAC
GATTCAAATA TCTGGTTTAT GCAATTGGGT CATGATGTCT ATCAGGCCAA AAATGCCGCA
GGCACGCAAG TGACTGGCGG TATGATGATC ACCCTAGGTA AGCAGAATAG CGATACACGG
GATCGGGCGC GGGCGATAAA TCCGGATTTG TCGATCGATA CCGGCAAGAT CAAAACCGAG
GCTTATGGGT TTGGGGGTTA TTACACCCTG ATGACCGAGG AAGGCGGTTA CCTTGATATC
GTTAGCCAGG CGACGCTATA CCGCAACAAC TATGAGAGCC AACATAATAC CAAGCATAAT
GGCTACGGTG TTGTGATGTC TGCCGAAGTG GGTCAGCCGT ATCCACTGGC TGCTGGCTGG
GTAGTGGAGC CTCAGGGGCA GCTAAAATAT CAATACCTGC ACCTGAGTCC GAAGAATTTC
AACGATGCCA TTTCAGAGAT CGGGGGGACG GATTACTCTG TTGGTCAGGT ACGTGCCGGG
CTGCGTCTGT TCAGTGACGC GAGTGAGAAG CGAGATATTA AGCCTTATCT GACCACCGAT
GTGCTTCACC AGTTGGGCCG AAACCCACAG GTGACGGTAG CGACGGTGGA TATCCGTCCT
GACTTCACAA AAACCTTCTG GCAGGGGGGC GCAGGGGTGA CCGCTAAAGT GAATAGTCAG
GTTGATCTCT ATGCTGATGC GAAATACCAA AAATCTTTTG ATGGCAAATT AGATGGCTAC
TTAGGTAATT TGGGCGTGAA AGTCAGTTTC TGA
 
Protein sequence
MKKNRSTLSP CFRKTLIASL LVPLCSPLYS WAVQTASITD GSTMVISGGY DTEANNQTAV 
FVQGSGSTIN GGSDVVIETT GVGAIGANAS EGGTLGLTGS TIKTENSVAF GVLNDKGTVN
LQGGTITTKG QTAYGVYSSG LGSNTDIHSS EITTSYSLTH AIYGAGGTGL TLNNTTLNTS
GSGSYGIYLN GPGGSLTGAD NTINSTHATN GAGIYISSGG SNATLDNTTL NITKGAVGVN
VGEGSSITMD GLIATGNITN LFKVNGNASV SNANIELAAG GLLMAQGHSA SNQAVIILNN
VDAISNGGGT TLVDVNKDAD VTINGGAYHS KGNNAKGIWV RDNNSSLNVD NVVIITEGVN
ATAIENRGTA IVKNTTVITQ GNNSHGLYSE QSLDATNMAI STAGIGSIGA AAAKGGNLNL
NDALIETTGN SGMVLGTFAD SSISAKNITG LSTGAGAYAL WVDDGSSILL EESQITTQGQ
GAGGIYASNT GTGSHTAYTQ VTLNNSQIHS EQGPGIWANG ADINVDVKNG SQLTGGNGLL
IYASSNAGAA SNVNVNGDNH AVLLGDIHAA ENSNINLALN NNSVWTGAAT NAKQVDIDSS
SIWNLTGDAD VESMHVLGQM NFISNSSDTN SRAPYDNFST LTINSNVTGS GSFTFNVQLG
DNDSPVDRLY VIGNASGDHG VQVINQGGLG ALTTGDGINL ITVDGETHSG SFTMSNSVSA
GAYEYFLYKI DDYRWNLQSN LINPGPGPEP EIEPEEIAYR PEVPGYIAAP WLNAFYGFTT
LGSLHERRGS AEGAAEGFNQ DSWGRIRGQH NNFEAGRFSY DSNIWFMQLG HDVYQAKNAA
GTQVTGGMMI TLGKQNSDTR DRARAINPDL SIDTGKIKTE AYGFGGYYTL MTEEGGYLDI
VSQATLYRNN YESQHNTKHN GYGVVMSAEV GQPYPLAAGW VVEPQGQLKY QYLHLSPKNF
NDAISEIGGT DYSVGQVRAG LRLFSDASEK RDIKPYLTTD VLHQLGRNPQ VTVATVDIRP
DFTKTFWQGG AGVTAKVNSQ VDLYADAKYQ KSFDGKLDGY LGNLGVKVSF