Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_3775 |
Symbol | |
ID | 5384632 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 4254954 |
End bp | 4258811 |
Gene Length | 3858 bp |
Protein Length | 1285 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640866799 |
Product | putative autotransporter protein |
Protein accession | YP_001402729 |
Protein GI | 153949827 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3468] Type V secretory pathway, adhesin AidA |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATATAC AATTCAGAGA GCCTATTTCT ACCTCACCCG CTCAACCGGA TGCAGGGCGT CAACACCTCT TCAGCCGTGT TCCTGCAGCC GTGATGAATC CGATTACATT GGCAATTATT GTTGCGTTCT CCCCCCTCGT CTTACCACAA ACCGCATGGG CGGCGTGCAA TAGCTCGGGG GTGGGAACCT ATGTTTGCGA AGGTGAAAAT AACACCGCGA TATCCCTGTT CGGCACGGAT ATCGCCGTTG AGACCCGGCC AGGTTTTGGC ATAACAGAAC ATGAAATTAC GGATTCTGCG CTCTCGTTAA CCGGTTCGGG TACCATCAGT TACCTCGATA CCAACAGTTC TGCACTGGAT ACCGACAGTC GATATTCTTT GTATATTAAG AATGACACAT TAATAACCGA GCAATCGGCA TCCATTAATG TTCAGAGCAA CGGCTCCATT AGCAGTGGCG TTTACATTGA CAACCAGAGC AGTGACGATT CGACCATACG GGTAGATCTC TCTGGCATCT TATCCAGCAG CCTGAGCGGC GCTCCAGCCC TTTCAATCTT TTCATCGGCA GGCAATGATT CAACTATCAT TCTGAACACC CACGCTATTT CAGGTGTTAC TGGCATCCAA AGCGATAATA ATTCACAAAA TGGAGCGACC ATCACCCACG TTGATGTTAC TGGTGATATT AACGTCGAAA ATTCAGGCGT CTCGATCCGA AATGCTGCCA ATGGTGGCAC CAGCATAATT AACTTCAATT CAAAGAGTAT TAATACAGAA TATGACAGTT TTTATATTCA AAATACCAAT TATGTAGGTG GAGTGATAAC TGATATCAAT ATTGATGGCG ATATCAGTTC AGCTAATAGT CAAGCAGCCA GGATTTACAA TTATACCAAT GGCGGTCTTG CGAGCCTTAG ATTTCGCGCA AATAATGTCA CTGGATCGAC TGGCCTTTAT ATCGATAATA GCAGCCAAAA TGGCGCTGTA ACCGATATTA TATTGACGGG TGATCTTACG GCTACTTCAG GATCTGCTCT ACAAGCCAAT GCCTACTCGG ATGAGGGAAA CATCGAGACC TCTATCAAGT TGAATAACGT TTATTCTTTG TATGATGCAC TCAATATTAG CGACTATACC CGATCCGGAA ATATACTGCA TGACCTTGAT ATTTCAGGAA CGATTACTGC AGAAAACGGT ACCGGAATTA AAGTGATGGG CGCGGCTGGT GAAGGCAGTT CTACTATGCT CATCAATGTG AATAATATTA CTTCCAGCAG TCAATCGCTT GATATCAACA ATTACAATTT TTTGGGTTCA GCTTTTTCTG CCATTACCGC GACCGGGCAT CTCACGGCGG AATGGGGACA AGGGGCAATG CTCCAGACCC ATTCCTCTCT GGGGGATGCC ACCACGCTCA TCCACTTCAA TGACATCACT GCAATGAGCA GTGGAATATC TCTCATTAAC GAAGCTAACC AAGGTACTTC AACCACCGAT ATCACCGTGA CGGGCCAGAT TAACGTCAGC CACGGAGAGG GGATTACCCT TAATGCACTG ACCACTGATG GCCGCACTTT GGTCAACGTT GACGTCAATA ATATTGCCAG TGAATACGAT GCCATACGCC TCTATAACTA TAACTATAAC TATAACTATA ACTATAACTA TAACTATAAC TATAACTATA ACTATAATGA TAACTACGCC ACAGGCGTGG ATGATGGCAC AGGCGCAGAC AACGGCACCT CGACCATTGA TCTGATCACC CGGGGCGCGC TGGTTTCACA GCAGGGCTAC GGGATTAATA TCGAAACCAA TACCGCAGAC ACCTATGTCA CCGTGGGCGG CTTGGTGCAC GGCGGCAACG GTACCGCAAT TGGCATTCAT CGGCTCGAAA ATGTTCAAAA ATCAGCCACG TTAGAGCTGC AATCTGGCTA TGCCCTTGAA GGCGTTACAC AGGCACTGGT CTTCACTGGC AGTTATGCGG AGATCAATGA TGCCGCGTTG GATCTGGCAA ACAGCCATCT GGTGCTGGGG GGAGCAGGGG ACGCTGCTTT CGATCTCACG CGTATTGATA ACCGTGAAGA GGCCATTCTG GATGGCGACC CGAACCGGAT CACCGGCTTC GGTACCCTGA CCAAAACCAA CAACAGTATC TGGACGTTAA CCGGCTCCAA TATGGCTGAC GGTGACGCCA ATGCCTTCCT GTCGGCCAAT ATCGCCGGGG GGATTTTGGT GCTGGATAAC GCCACGCTGG GCCTGACACC TGACGCCGGG GCACTGACTG GCCCCACAGT AAACCGCCTC AGTGCGGCCG ATATCGCCGC TGACCCGACG TTGGTGGCTA CCGAAACCGG TGCATTAACC CTGGGTGAAG GCGGCGCGCT GTCCTCGCTC GGTGATTCGG TTCTGAGCGG TAACCTCATC AGCGCTGGCG GGATCCTGCT GTCAAACCAC TATACCGGCG GCAATGGTAC CGCTACCGAT GATCGGCTGA CCGTGACCGG GACTTATTTG GGTGAAAATA ACGGTTCCGG TGAAGGGGCC TGGCTAACGC TGGATACGGT GCTGGGGGAT GACGATTCTG CCACCGACCG GTTAGTGATC AACGGCGATG CCACCGGCAC CACCTCGGTC CGGGTGAACA ATGCGGGCGG TCTGGGCGAT AAAACCCTCA ATGGCATCAA CCTGATCACC GTGGACGGTC TGGCGCAGGA TGACACCTTC CTGCTGGCCG GGGATTATGT CACCACGGAT GGCTATCAGG CGGTGGTGGG CGGGGCGTAT GCCTACACCT TACAGGCCGA CGGGGAAGCC GCCACTGCGG GGCGCAACTG GTATCTCTCG TCAGAGCTGA TGTTAACCGA GGGGGTACGC TATCAGGCGG GCGTGCCGCT GTATGAACAA TATCCGCAGG TGCTGGCCGC CCTGAATACC CTGCCGACGC TGCAACAGCG TGTCGGTAAC CGTTACGGGG CACCGGGCGC ACTGGCAGAC CTGAACTTTG ACGATAATCA ATGGGCCTGG GGCCGTATTG AAGGGAGCCA CCAGGTCACC GACCCGGCCC GCTCCACCAG CGGTTCACAG CGTGAGATTG ATGTGTGGAA GTTGCAGACC GGCATTGATG TGCCGTTGTA TCAGAGCCAG GGCGGTTCAC TGCTGACCGG CGGGGTGAAC TTCACCTACG GAAAAGCCAA AGCGGATATC CACTCATTCT TTGGTGATGG CCGCATCAAC AGCGCAGGTT ACGGTCTTGG CACCAGCCTG ACCTGGTATG GCAATAACGG CGTGTATGTG GATGGCCAGT TGCAGACGAT GTGGTTTGAC AGCGACCTGA GCTCCCGTAC CGCAGGGCAT GCGGTGGCCA GCGGTAACAA TGGTCGCGGG TATACCTCAG CGATAGAAGC CGGTAAAGGT TACGCACTGG GTAACGGGTT ATCGCTGACT CCGCAGATGC AGGTGACCTA CTCGCGGGTC GATTTCGATA CCTTCCGCGA TCCGTTTGAT AGCGAAGTCT CGCTGCAAGA GGGTGACAGC CTGCGGGGCC GCCTCGGTGT CTCACTGGAT AAGGAAACGA CCTGGAGTGC GAAAGACGGC ACCACCCGCC GCTCACACAT TTACAGTCAT CTCGATCTGC ACAATGAGTT CCTCAATGGC AGCAAAGTAC AGGTCTCAGG GGTGGAGTTC GCCACCCGGG ATGAGCGTCA GTCGGTGGGG TTAGGCGCGG GTGGCACCTA TGAATGGCAG AATGGTCGCT ACGCGGTTTA CGGCAATGTC AATCTGCTGG GGGCTACACG GAATATCAGT GACAACTATG CCGTCGGCGG CACGATAGGT GCACGAGTGA GCTGGTAA
|
Protein sequence | MHIQFREPIS TSPAQPDAGR QHLFSRVPAA VMNPITLAII VAFSPLVLPQ TAWAACNSSG VGTYVCEGEN NTAISLFGTD IAVETRPGFG ITEHEITDSA LSLTGSGTIS YLDTNSSALD TDSRYSLYIK NDTLITEQSA SINVQSNGSI SSGVYIDNQS SDDSTIRVDL SGILSSSLSG APALSIFSSA GNDSTIILNT HAISGVTGIQ SDNNSQNGAT ITHVDVTGDI NVENSGVSIR NAANGGTSII NFNSKSINTE YDSFYIQNTN YVGGVITDIN IDGDISSANS QAARIYNYTN GGLASLRFRA NNVTGSTGLY IDNSSQNGAV TDIILTGDLT ATSGSALQAN AYSDEGNIET SIKLNNVYSL YDALNISDYT RSGNILHDLD ISGTITAENG TGIKVMGAAG EGSSTMLINV NNITSSSQSL DINNYNFLGS AFSAITATGH LTAEWGQGAM LQTHSSLGDA TTLIHFNDIT AMSSGISLIN EANQGTSTTD ITVTGQINVS HGEGITLNAL TTDGRTLVNV DVNNIASEYD AIRLYNYNYN YNYNYNYNYN YNYNYNDNYA TGVDDGTGAD NGTSTIDLIT RGALVSQQGY GINIETNTAD TYVTVGGLVH GGNGTAIGIH RLENVQKSAT LELQSGYALE GVTQALVFTG SYAEINDAAL DLANSHLVLG GAGDAAFDLT RIDNREEAIL DGDPNRITGF GTLTKTNNSI WTLTGSNMAD GDANAFLSAN IAGGILVLDN ATLGLTPDAG ALTGPTVNRL SAADIAADPT LVATETGALT LGEGGALSSL GDSVLSGNLI SAGGILLSNH YTGGNGTATD DRLTVTGTYL GENNGSGEGA WLTLDTVLGD DDSATDRLVI NGDATGTTSV RVNNAGGLGD KTLNGINLIT VDGLAQDDTF LLAGDYVTTD GYQAVVGGAY AYTLQADGEA ATAGRNWYLS SELMLTEGVR YQAGVPLYEQ YPQVLAALNT LPTLQQRVGN RYGAPGALAD LNFDDNQWAW GRIEGSHQVT DPARSTSGSQ REIDVWKLQT GIDVPLYQSQ GGSLLTGGVN FTYGKAKADI HSFFGDGRIN SAGYGLGTSL TWYGNNGVYV DGQLQTMWFD SDLSSRTAGH AVASGNNGRG YTSAIEAGKG YALGNGLSLT PQMQVTYSRV DFDTFRDPFD SEVSLQEGDS LRGRLGVSLD KETTWSAKDG TTRRSHIYSH LDLHNEFLNG SKVQVSGVEF ATRDERQSVG LGAGGTYEWQ NGRYAVYGNV NLLGATRNIS DNYAVGGTIG ARVSW
|
| |