Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0868 |
Symbol | |
ID | 5386152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 1050115 |
End bp | 1052265 |
Gene Length | 2151 bp |
Protein Length | 716 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640863833 |
Product | hemagglutination repeat-containing protein |
Protein accession | YP_001399852 |
Protein GI | 153947097 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport [W] Extracellular structures |
COG ID | [COG5295] Autotransporter adhesin |
TIGRFAM ID | [TIGR03304] outer membrane insertion C-terminal signal |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAACA TCATAAAATA TGACATTAAT TATTTTTTAA GGTTTGTTTG TTTTAATTAC TTTATGAACA TAAAGTTTTA TCGTGGTCTG TTTAGTATTT CTATAGTGAT GATTTATTTA TTTATCCCTG GCACCGCAAC TTCTCAGCTT TATGTGAATA ATGCTGATGA TCAGTCTTGT TATGCCATTA TGGATGGGCC GTCGGTGGGG CCTATAACTA GTCCGAATTG TAATACTCCA TCTATTGGGA TTATTAATAC AAATGCGGGG CAGCTATTTG TTGGTGGAAC GGGGGCTGCG GTATCGGGAA CATTAGGCCC TACTTTTTGG ACAGGGACAT TTGCTACACC GGTCGGCACC AGTAATACCG CAACTGCATA CGGTATGGTC GTTCAAAGCG GTGGTGCGTA CATCACGGGT AATACTTACG TACAAGGTGG GTTATTCTTG AATGGGCAAA AAGCGACCAA TCTTGCCCCT GCGACGATTT CATCTACCTC TACCGATGCG GTAGTTGGTA GCCAGCTTTA TAATCTGGTC CAAGATGGAA CCCGCTATTT CCATGCTAAC TCAGTGAATC CAACAGACTC ACTGGCCTCT GGCCTGGAAA CCATTGCTGT CGGGCCAGCG ACAGTGGTCA GTGGAGATAA CGGTGTTGGC ATTGGTAACA CCGCCCTTGT CGGGGCGGCA GCAACGGGGG GAATTGCAAT AGGTTTCGGT ACTCAGGTGA CAGCTGCCGG GGCTACGGCC ATCGGTTCCG CAGCGCAGGC TCAAGGGGCG CAATCTCTGG CGTTAGGCGC GGGGGCGGTG ACCAGCCAGG CAAACAGTAT CGCGTTGGGG GCTGCTTCGA TTAATACGGT CGGCGCTCAG AGCAGTTACA GTGCTTATGC ACTAACGGCT CCGCAGGTAT CGGTGGGGGA GTTAGGGATT GGCACCGCAT TGGGTAATCG TAAGATTACC GGTGTCGCAG CTGGATCGGC GAGTTCTGAT GCGGTCAATG TTGCTCAGTT GACTGCTGTT GGTGATCAGG TGCAGCAGAA TACCGCGAAC ATCACCAGCC TGGGTGGCCG GGTTACCACC ATTGAGGGGA GCATGGCCAG CATCGCCAAT GGTGGCGGTG TGAAGTATTT CCATGCCAAC TCGACTCAGC CTGACTCGGT AGCCAGTGGT ACCAATTCGG TGGCCATCGG GCCTGCTTCT CTGGCCTCTG GGGCTGCTTC TCTGGCTTCG GGTAATGCTG CTTTGGCTTC GGGGGCAGGG GCGGTGGCGA TAGGGGATGG TGCTGCCGCC AGCGCGGACG GCAGTGTTGC CATTGGTCAG GGATCTGGTG ATAACGGGCG TGGCGTAGAG AACTACATCG GTAAGTATTC CAATGCGAGT AACACCAGCT CGGGTACCGT ATCAGTGGGT AATACTGCTA CTGGAGAAAC CCGGACGGTC AGCAATGTTG CCGATGGGCT GCAGGCCACC GATGCGGTCA ATTTGCGACA ACTCGATGGT ATTGCTGCCT CTATCGTCGT CGTCGAAAAC AACGTTTCTG GGCTCCAGAA TGGTACCGAT GGTATGTTTC AGGTGAATAA TAGTAGCGGT CTTGCCAAGC CATCCGCAAC CGGTGCCAAT TCAGCAACTG GTGGTGCCGG TTCTGTGGCT TCCGGCAATA ACAGTACCGC GTTTGGCTCA GGTGCTAAAG CAACAGCGGC AAACAGTGCA GCCTTGGGGG CTAACTCAGT AGCTGATCGG GCAAACAGTG TTTCGGTAGG CTCTGTTGGT AATGAACGGC AGATCACCAA CGTCGCCCCC GCAACTCAGG GAACCGACGC GGTGAACTTC GATCAACTTA AGAGCATCTC CAATCAAACT AATGCTTACA CGAACCAACG TTATTCCGAG CTGAAGCAGG ATCTGAGGAA ACAAAATAGC GTGTTGAGTG CAGGGATAGC TAGCGCAATG TCTATGGCGA GCTTGACACA ACCATATACT TCGGGTTCCA GCATGACCAC TATCGGCGCG GCCAGTTACC GTGGTCAATC GGCTCTGTCA CTAGGGGTAT CGAGTATTTC TGACAGCGGG AGATGGGTCA GCAAACTACA GGCATCCTCT AACACTCAAG GTGATTTTGG GATTGGTGTC GGCGTAGGGT ATCAGTGGTA A
|
Protein sequence | MENIIKYDIN YFLRFVCFNY FMNIKFYRGL FSISIVMIYL FIPGTATSQL YVNNADDQSC YAIMDGPSVG PITSPNCNTP SIGIINTNAG QLFVGGTGAA VSGTLGPTFW TGTFATPVGT SNTATAYGMV VQSGGAYITG NTYVQGGLFL NGQKATNLAP ATISSTSTDA VVGSQLYNLV QDGTRYFHAN SVNPTDSLAS GLETIAVGPA TVVSGDNGVG IGNTALVGAA ATGGIAIGFG TQVTAAGATA IGSAAQAQGA QSLALGAGAV TSQANSIALG AASINTVGAQ SSYSAYALTA PQVSVGELGI GTALGNRKIT GVAAGSASSD AVNVAQLTAV GDQVQQNTAN ITSLGGRVTT IEGSMASIAN GGGVKYFHAN STQPDSVASG TNSVAIGPAS LASGAASLAS GNAALASGAG AVAIGDGAAA SADGSVAIGQ GSGDNGRGVE NYIGKYSNAS NTSSGTVSVG NTATGETRTV SNVADGLQAT DAVNLRQLDG IAASIVVVEN NVSGLQNGTD GMFQVNNSSG LAKPSATGAN SATGGAGSVA SGNNSTAFGS GAKATAANSA ALGANSVADR ANSVSVGSVG NERQITNVAP ATQGTDAVNF DQLKSISNQT NAYTNQRYSE LKQDLRKQNS VLSAGIASAM SMASLTQPYT SGSSMTTIGA ASYRGQSALS LGVSSISDSG RWVSKLQASS NTQGDFGIGV GVGYQW
|
| |