Gene YpsIP31758_4055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4055 
Symbol 
ID5384747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4565506 
End bp4568709 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content49% 
IMG OID640867083 
Productputative autotransporter protein 
Protein accessionYP_001402999 
Protein GI153950137 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAA ACCGCTCTAC GCTTTCCCCG TGCTTTAGTA AAACAGTGAT AGCCAGTTTG 
CTGGTGCCTC TTTGCAGTCC CCTGTATAGC TGGGCGGTAC AAACTGCCAG CATAACGGAT
GGCAGCACTA TGGTTATCTC TGGGGGTTAT GACACTGAGG CTAATAACCA CTCGGCGGTA
TTTGTGCAAG GCTCCGGTAG CACCATTAAT GGTGGCTCCG ATGTCGTTAT TGAAACCACG
GGCTTTGGTG CAATTGGTGC CTATTCTTCT GAAGGTGGGA CGTTGGGTTT GACGGGCTCG
ACTATCAAGA CCGAGAGTGG TACGGCTTTT GGTGTCTTAA ATGATAAAGG TACGGTGAAT
TTACAAGGTG GTACGATTAC CACGAAAGGT CAGACGGCAT ATGGCGTGTA TTCCTCTGGT
TTGGGCAGTA ATACCGATAT TCACAGTTCG GTGATCACGA CCAGCTACTC GTTAACCCAC
GCTATTTATG GTGCGGGCGG GACGGGGTTG ACATTGAACA ATACCACCGT CAATACCAGT
GGCAGTGGTA GCTATGGCAT TTATCTGAAT GGTCCCGGAG GGAGCTTAAC GGGTGCGGAT
AATACCATTA ATAGTACTCA TGCGACCAAT GGTGCGGGTA TCTATATTTC ATCGGGTGGC
TCAAATGCGA CTTTAGATAA CACCACACTG AATATCACTA AAGGTGCTGT TGGTGTGAAT
GTGGGGGAGG GATCCTCTAT TACGATGGAT GGCCTTATTG CCACCGGTAA TATCACCAAC
CTATTTAAAG TGAACGGGAA TGCCTCGGTC AGTAATGCCA ATATCGAATT AGCCGCGGGT
GGCTTATTAA TGGCACAGGG CCACAGTGCG TCCAATCAAG CGGTCATCAT ATTAAATAAT
GTCGATGCTA TTTCTAACAG CGGCGGCACG ACACTGGTTG ATGTTAATAA GGACGCTGAC
GTCACCATTA ATGAGGGGTC TTACCACTCA AAAGGTAACA ATGCGAAGGG GATCTGGGTT
CGAGATAATA GCTCATCGCT GAATGTCGAT AACGCCGTGA TTATCACCGA GGGCGTGAAT
GCAACGGCGA TTGAAAATCG TGGCACCGCT ATCGTAGAAA ATACTACGGT GATAACTAAA
GGGAATAACT CTCACGGCCT CTACTCTGAG CAGAGCCTTG ATGCCACCAA TATGGCAATT
TCCACTGCGG GGATTGGCAG TATTGGGGCG GCGGCAGCTA AAGGCGGTAA CCTAAATCTG
AATGATGCCC TCATCGAGAC GACGGGTAAT TCAGGTATGG TGCTGGGTAC TTTTGCCGAC
TCATCCATCA GCGCTAAAAA TATTACAGGT CTATCGACCG GCGCTGGTGC TTATGCCTTG
TGGGTCGATG ATGGTAGCTC AATCCTTCTG GAAGAGAGCC AAATTACCAC TCAAGGCCAG
GGCGCAGGAG GGATTTATGC CTCAAATACC GGGACCGGCT CTCACACCGC TTATACTCAG
GTTACGCTGA ACAACTCACA GATTCATAGT GAGCAGGGGC CGGGCATCTG GGCTAATGGT
GCTGACATTA ATGTTGATGT GAAGAATGGT TCGCAGTTAA CGGGAGGCAA TGGGTTATTG
GTCTACGCCT CGAGTAATGC AGGGGCAGCC AGTAATGTCA ATGTGAATGG CGATAACCAC
GCCGTTCTGT TGGGTGATAT TCACGCCGCA GAAAACAGCA ATATTAACCT GGCACTGAAT
AATAATTCCG TTTGGACGGG GGCGGCAACT AACGCCAAAC AGGTTGATAT CGACAGCAGC
AGTATCTGGA ATTTAACGGG TGATGCAGAT GTTGAGTCAA TGCATGTATT GGGCCAGATG
AACTTTATCT CAAATAGCAG TGACACCAAT TCACGAGCCC CCTACGATAA TTTCAGTACC
TTAACGATCA ACAGTAATGT CACCGGGAGT GGCAGTTTTA CCTTTAATGT GCAATTGGGT
GATAACGACT CGCCAGTGGA TAGACTCTAT GTAATAGGTA ATGCCTCTGG TGACCATGGG
GTTCAGGTTA TTAACCAAGG TGGTTTGGGT GCGTTGACCA CGGGTGACGG GATTAACCTG
ATTACCGTTG ATGGGGAGAC CCATTCTGGC TCATTTACTA TGAGCAACTC GGTGAGCGCA
GGGGCCTATG AGTATTTTTT GTATCAGATA GATGACCACC GTTGGAACCT GCAATCTAAT
CTCATCAAGC CTGATCCAGG CGTTGATCCA GGCGAAGAGA CAGCTTACCG CCCTGAAGTT
CCTGGCTATA TTGCCGCACC TTGGTTAAAT GCATTTTATG GTTTTACTAC TTTGGGTAGC
TTGCACGAAC GCCGTGGCTC GGCCGAGGGA GCAGCCGAAG GGTTTAATCA AGACTCATGG
GGCCGGATCC GTGGGCAGCA TAATAATTTT GAGGCGGGCC GTTTTAGCTA CGATTCAAAT
ATCTGGTTTA TGCAATTGGG TCATGATGTC TATCAGGCCA AAAATGCCGC AGGCACGCAA
GTGACTGGCG GTATGATGAT CACCCTAGGT AAGCAGAATA GCGATACACG GGATCGGGCG
CGGGCGATAA ATCCGGATTT GTCGATCGAT ACCGGCAAGA TCAAAACCGA GGCTTATGGG
TTTGGGGGTT ATTACACCCT GATGACCGAG GAAGGCGGTT ACCTTGATAT CGTTAGCCAG
GCGACGCTAT ACCGCAACAA CTATGAGAGC CAACATAATA CCAAGCATAA TGGCTACGGT
GTTGTGATGT CTGCCGAAGT GGGTCAGCCG TATCCACTGG CTGCTGGCTG GGTAGTGGAG
CCTCAGGGGC AGCTAAAATA TCAATACCTG CACCTGAGTC CGAAGAATTT CAACGATGCC
ATTTCAGAGA TCGGGGGGAC GGATTACTCT GTTGGTCAGG TACGTGCTGG CCTGCGTCTG
TTCAGTGATT CGAGCGAGAA GCGGGACATT AAGCCTTATC TGACCACCGA TGTGCTTCAC
CAGTTAGGCC GAAACCCACA GGTGACGGTA GCGACGGTGG ATATCCGTCC TGACTTCACA
AAAACCTTCT GGCAGGGGGG CGCAGGGGTG ACCGCTAAAG TGAATAGTCA GGTTGATCTC
TATGCCGATG CGAAATACCA AAAATCCTTT GATGGCAAAT TAGATGGCTA CTTAGGTAAT
TTGGGCGTGA AAGTCAGTTT CTGA
 
Protein sequence
MKTNRSTLSP CFSKTVIASL LVPLCSPLYS WAVQTASITD GSTMVISGGY DTEANNHSAV 
FVQGSGSTIN GGSDVVIETT GFGAIGAYSS EGGTLGLTGS TIKTESGTAF GVLNDKGTVN
LQGGTITTKG QTAYGVYSSG LGSNTDIHSS VITTSYSLTH AIYGAGGTGL TLNNTTVNTS
GSGSYGIYLN GPGGSLTGAD NTINSTHATN GAGIYISSGG SNATLDNTTL NITKGAVGVN
VGEGSSITMD GLIATGNITN LFKVNGNASV SNANIELAAG GLLMAQGHSA SNQAVIILNN
VDAISNSGGT TLVDVNKDAD VTINEGSYHS KGNNAKGIWV RDNSSSLNVD NAVIITEGVN
ATAIENRGTA IVENTTVITK GNNSHGLYSE QSLDATNMAI STAGIGSIGA AAAKGGNLNL
NDALIETTGN SGMVLGTFAD SSISAKNITG LSTGAGAYAL WVDDGSSILL EESQITTQGQ
GAGGIYASNT GTGSHTAYTQ VTLNNSQIHS EQGPGIWANG ADINVDVKNG SQLTGGNGLL
VYASSNAGAA SNVNVNGDNH AVLLGDIHAA ENSNINLALN NNSVWTGAAT NAKQVDIDSS
SIWNLTGDAD VESMHVLGQM NFISNSSDTN SRAPYDNFST LTINSNVTGS GSFTFNVQLG
DNDSPVDRLY VIGNASGDHG VQVINQGGLG ALTTGDGINL ITVDGETHSG SFTMSNSVSA
GAYEYFLYQI DDHRWNLQSN LIKPDPGVDP GEETAYRPEV PGYIAAPWLN AFYGFTTLGS
LHERRGSAEG AAEGFNQDSW GRIRGQHNNF EAGRFSYDSN IWFMQLGHDV YQAKNAAGTQ
VTGGMMITLG KQNSDTRDRA RAINPDLSID TGKIKTEAYG FGGYYTLMTE EGGYLDIVSQ
ATLYRNNYES QHNTKHNGYG VVMSAEVGQP YPLAAGWVVE PQGQLKYQYL HLSPKNFNDA
ISEIGGTDYS VGQVRAGLRL FSDSSEKRDI KPYLTTDVLH QLGRNPQVTV ATVDIRPDFT
KTFWQGGAGV TAKVNSQVDL YADAKYQKSF DGKLDGYLGN LGVKVSF