Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_1650 |
Symbol | |
ID | 5385772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 1919149 |
End bp | 1921791 |
Gene Length | 2643 bp |
Protein Length | 880 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640864631 |
Product | hemagglutination repeat-containing protein |
Protein accession | YP_001400627 |
Protein GI | 153947101 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport [W] Extracellular structures |
COG ID | [COG5295] Autotransporter adhesin |
TIGRFAM ID | [TIGR03304] outer membrane insertion C-terminal signal |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGTA TCCAAAAGTG TGACTGTAAT TATCTGGCCA GATTTGCAGT TTCAGATTCC TTTATCAGAG ATAATTCTTT TCGCGGCTTG CTTGCTATTT TTATGGTGAC GATATTTGTG CCAGGTACTG CATTTTCTGA ACTCTATGTG AACAATGCCA ATGATCCTGG CTGTTATGCC GTTGTGGATG ATAATAATCT GAATTTTAAG GGGAGGATCA CCGGTATTGT TACTAACCAT ATATATTGTA ATACGTTGAC TGAGGCGAAT CTGAATACCA ATGGGGGGCA GCTATTTGTT GGCGGACAAG GCGGGCTCCC AGGGTACGCT CCCACCCCCT GGACAGGGAC ATTCACGACG GCCGTTGGCA CCAGTAATGT CGCCACAGCA TACGGCTTTG TCGTTCAAAA AAATGGTGCA TTTATCAACG GTGATACCTA CGTGCAGGGG GGGTTATTTT TGAATGGTAG GAAAGCCACC AATCTTGCCC CCGCAACGAT TTCATCGACC TCCACCGATG CGGTGGTTGG TAGCCAGCTT TATACGGTGA TCCAAGATGG AACCCGCTAT TTCCACGCCA ACTCAGCGAA CCCGCAAGAC TCCGTGCCTA CGGGTCAGGA TGCTATTGCT GTCGGGCCGG CGACGGTGGT CAATGGTAAT AACGGGATTG GTATTGGTAA TAGCGCCGTT GTTGGGCCGA GTGCTGTCGG GGGCATTGCA ATTGGCCCCA ACACTCAGGC GACCGGTACC GCCAGCACGG CCCTTGGTGC CGGAACGCAG GCGCAGGGGG CACAGTCTTT GGCTCTTGGG GCCGGGGCGG TTACCCGCCA GGTAAACAGT ATTGCGTTAG GGGCGTCGTC GGTCACCACG GTGGGCGCTC AGGGCAGCTA CAGTGCATAC GGACTGCCGA CCACGCAGGC GTCGGTGGGG GAAGTGGGCA TAGGGACCGC GCAGGGTAAT CGCAAGATCA CCGGTGTGGC AGCGGGCTCG GCTGGTTATG ATGCGGTCAA CGTGACTCAG TTGACTGCTG TTGGTAATAA AGTCGACCAA AATACCGCTG ACATTACCAG TTTAGACGGC CGGGTCACCA ATGTTGAGGG GGGGATGACC AGCATCACCA ACGGGGGCGG TATAAAGTAC TTCCACACTC ACTCCACCGA GCCTGATTCG GTGGCCAGCG GCAGTGATTC GGTGGCGATC GGACCGAATG CGCAGGCGTC CGGTACCACG TCGATAGCCA TGGGGGCCGG GTCGACAGCG CAGGGAGCAC AGTCTCTGGC ATTGGGGGCG GGAGCGGCTG CCAGCCAGGC AAACAGTATT GCGTTAGGGG CGTCATCGGT CACCACGGTC GGTGCTGAGA GCAACTACAG TGCGTACGGA CTGACAGCTT CCCAAACGTC GGTGGGCGAG GTGGGGGTGG GCACGGCACA GGGGAATCGC AAGATCACCG GTGTGGCAGC CGGTTCGGCT GATTATGATG CGGTCAATGT CGCGCAATTG ACCGCTGTTG GTGACAAGGT CGATCAGAAT ACCGCTGACA TCACCAGTTT AGACGGCCGG GTCACCAATG TTGAGGGGGA GATGACCAGC ATCACCAACG GGGGCGGCGT GAAATACTTC CACACCCACT CCACCGAGCC TGATTCGGTG GCCAGCGGCA GTGATTCGGT GGCGATCGGA CCGAATGCGC AGGCGTCAGG TACGGCTTCG GTGGCCTCCG GCAAGGATAC GCTGGCCTCC GGTAACGGTG CGGTGGCGAT AGGTGATGCA GCAAGCGTCA GCGCAGAGGG CAGTGTTGCC CTGGGGCAGG GTTCCGCTGA CAACGGGCGC GGTGCAGAGA GCTACACCGG CAAGTACTCC ACTGCGGATA ACACCACCTC AGGTACGGTG TCGGTGGGCA ATGCGGCAAC CGGAGAGACC CGGACGGTCA GCAACGTTGC CGACGGGCGA GAGGCCATGG ATGCAGTCAA TCTGCGGCAA CTCGATGGTG CAATGGCGGC GGTGGGTGAC ACCGTATCAG GGTTGCAGAA CGGCACTGAC GGGATGTTCC AGGTGAACAA CAACAGCGGT CAGGCCAAGC CTTCGGCCAC CGGAACTGAT GCGATGGCGG GGGGGGCAGG TTCCGTGGCG TCTGGCAGCC ACAGTACCGC GATGGGTACG GGCAGCAAGG CGACGGCGGC AAACAGCACC GCGCTGGGGG CCAACTCAGT GGCGGATCGT GAAAACAGTG TCTCGGTGGG GTCAGTGGGT AATGAACGGC AGCTCACTAA TGTTGCTGCG GGGACTCAGG GCACTGATGC GGTGAATCTG GATCAACTCA ACCATAGCAT GTCGAATGTC ACCAACGACG CCAATGCTTA TACAGACCAG CGCTATTCTG CACTTAAAGA AGATCTGAAA AAACAGGATA GTACGTTAAG TGCGGGGATC GCCGGTGCCA TGGCGATGGC GAGCCTGACT CAACCCTATA CGCCGGGTGC CAGCATGGCG ACCATTGGTG CGGCCAGCTA TCGGGGCCAG TCGGCGCTGT CGGTGGGGGT GTCGAGTATT TCTGACAGTG GGCGATGGGT CAGCAAATTG CAGGCCTCCT CTAATACACA AGGCGATATG GGGGTTGGTG TCGGCGTCGG TTATCAATGG TAA
|
Protein sequence | MKSIQKCDCN YLARFAVSDS FIRDNSFRGL LAIFMVTIFV PGTAFSELYV NNANDPGCYA VVDDNNLNFK GRITGIVTNH IYCNTLTEAN LNTNGGQLFV GGQGGLPGYA PTPWTGTFTT AVGTSNVATA YGFVVQKNGA FINGDTYVQG GLFLNGRKAT NLAPATISST STDAVVGSQL YTVIQDGTRY FHANSANPQD SVPTGQDAIA VGPATVVNGN NGIGIGNSAV VGPSAVGGIA IGPNTQATGT ASTALGAGTQ AQGAQSLALG AGAVTRQVNS IALGASSVTT VGAQGSYSAY GLPTTQASVG EVGIGTAQGN RKITGVAAGS AGYDAVNVTQ LTAVGNKVDQ NTADITSLDG RVTNVEGGMT SITNGGGIKY FHTHSTEPDS VASGSDSVAI GPNAQASGTT SIAMGAGSTA QGAQSLALGA GAAASQANSI ALGASSVTTV GAESNYSAYG LTASQTSVGE VGVGTAQGNR KITGVAAGSA DYDAVNVAQL TAVGDKVDQN TADITSLDGR VTNVEGEMTS ITNGGGVKYF HTHSTEPDSV ASGSDSVAIG PNAQASGTAS VASGKDTLAS GNGAVAIGDA ASVSAEGSVA LGQGSADNGR GAESYTGKYS TADNTTSGTV SVGNAATGET RTVSNVADGR EAMDAVNLRQ LDGAMAAVGD TVSGLQNGTD GMFQVNNNSG QAKPSATGTD AMAGGAGSVA SGSHSTAMGT GSKATAANST ALGANSVADR ENSVSVGSVG NERQLTNVAA GTQGTDAVNL DQLNHSMSNV TNDANAYTDQ RYSALKEDLK KQDSTLSAGI AGAMAMASLT QPYTPGASMA TIGAASYRGQ SALSVGVSSI SDSGRWVSKL QASSNTQGDM GVGVGVGYQW
|
| |