Gene YpsIP31758_0868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0868 
Symbol 
ID5386152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1050115 
End bp1052265 
Gene Length2151 bp 
Protein Length716 aa 
Translation table11 
GC content52% 
IMG OID640863833 
Producthemagglutination repeat-containing protein 
Protein accessionYP_001399852 
Protein GI153947097 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACA TCATAAAATA TGACATTAAT TATTTTTTAA GGTTTGTTTG TTTTAATTAC 
TTTATGAACA TAAAGTTTTA TCGTGGTCTG TTTAGTATTT CTATAGTGAT GATTTATTTA
TTTATCCCTG GCACCGCAAC TTCTCAGCTT TATGTGAATA ATGCTGATGA TCAGTCTTGT
TATGCCATTA TGGATGGGCC GTCGGTGGGG CCTATAACTA GTCCGAATTG TAATACTCCA
TCTATTGGGA TTATTAATAC AAATGCGGGG CAGCTATTTG TTGGTGGAAC GGGGGCTGCG
GTATCGGGAA CATTAGGCCC TACTTTTTGG ACAGGGACAT TTGCTACACC GGTCGGCACC
AGTAATACCG CAACTGCATA CGGTATGGTC GTTCAAAGCG GTGGTGCGTA CATCACGGGT
AATACTTACG TACAAGGTGG GTTATTCTTG AATGGGCAAA AAGCGACCAA TCTTGCCCCT
GCGACGATTT CATCTACCTC TACCGATGCG GTAGTTGGTA GCCAGCTTTA TAATCTGGTC
CAAGATGGAA CCCGCTATTT CCATGCTAAC TCAGTGAATC CAACAGACTC ACTGGCCTCT
GGCCTGGAAA CCATTGCTGT CGGGCCAGCG ACAGTGGTCA GTGGAGATAA CGGTGTTGGC
ATTGGTAACA CCGCCCTTGT CGGGGCGGCA GCAACGGGGG GAATTGCAAT AGGTTTCGGT
ACTCAGGTGA CAGCTGCCGG GGCTACGGCC ATCGGTTCCG CAGCGCAGGC TCAAGGGGCG
CAATCTCTGG CGTTAGGCGC GGGGGCGGTG ACCAGCCAGG CAAACAGTAT CGCGTTGGGG
GCTGCTTCGA TTAATACGGT CGGCGCTCAG AGCAGTTACA GTGCTTATGC ACTAACGGCT
CCGCAGGTAT CGGTGGGGGA GTTAGGGATT GGCACCGCAT TGGGTAATCG TAAGATTACC
GGTGTCGCAG CTGGATCGGC GAGTTCTGAT GCGGTCAATG TTGCTCAGTT GACTGCTGTT
GGTGATCAGG TGCAGCAGAA TACCGCGAAC ATCACCAGCC TGGGTGGCCG GGTTACCACC
ATTGAGGGGA GCATGGCCAG CATCGCCAAT GGTGGCGGTG TGAAGTATTT CCATGCCAAC
TCGACTCAGC CTGACTCGGT AGCCAGTGGT ACCAATTCGG TGGCCATCGG GCCTGCTTCT
CTGGCCTCTG GGGCTGCTTC TCTGGCTTCG GGTAATGCTG CTTTGGCTTC GGGGGCAGGG
GCGGTGGCGA TAGGGGATGG TGCTGCCGCC AGCGCGGACG GCAGTGTTGC CATTGGTCAG
GGATCTGGTG ATAACGGGCG TGGCGTAGAG AACTACATCG GTAAGTATTC CAATGCGAGT
AACACCAGCT CGGGTACCGT ATCAGTGGGT AATACTGCTA CTGGAGAAAC CCGGACGGTC
AGCAATGTTG CCGATGGGCT GCAGGCCACC GATGCGGTCA ATTTGCGACA ACTCGATGGT
ATTGCTGCCT CTATCGTCGT CGTCGAAAAC AACGTTTCTG GGCTCCAGAA TGGTACCGAT
GGTATGTTTC AGGTGAATAA TAGTAGCGGT CTTGCCAAGC CATCCGCAAC CGGTGCCAAT
TCAGCAACTG GTGGTGCCGG TTCTGTGGCT TCCGGCAATA ACAGTACCGC GTTTGGCTCA
GGTGCTAAAG CAACAGCGGC AAACAGTGCA GCCTTGGGGG CTAACTCAGT AGCTGATCGG
GCAAACAGTG TTTCGGTAGG CTCTGTTGGT AATGAACGGC AGATCACCAA CGTCGCCCCC
GCAACTCAGG GAACCGACGC GGTGAACTTC GATCAACTTA AGAGCATCTC CAATCAAACT
AATGCTTACA CGAACCAACG TTATTCCGAG CTGAAGCAGG ATCTGAGGAA ACAAAATAGC
GTGTTGAGTG CAGGGATAGC TAGCGCAATG TCTATGGCGA GCTTGACACA ACCATATACT
TCGGGTTCCA GCATGACCAC TATCGGCGCG GCCAGTTACC GTGGTCAATC GGCTCTGTCA
CTAGGGGTAT CGAGTATTTC TGACAGCGGG AGATGGGTCA GCAAACTACA GGCATCCTCT
AACACTCAAG GTGATTTTGG GATTGGTGTC GGCGTAGGGT ATCAGTGGTA A
 
Protein sequence
MENIIKYDIN YFLRFVCFNY FMNIKFYRGL FSISIVMIYL FIPGTATSQL YVNNADDQSC 
YAIMDGPSVG PITSPNCNTP SIGIINTNAG QLFVGGTGAA VSGTLGPTFW TGTFATPVGT
SNTATAYGMV VQSGGAYITG NTYVQGGLFL NGQKATNLAP ATISSTSTDA VVGSQLYNLV
QDGTRYFHAN SVNPTDSLAS GLETIAVGPA TVVSGDNGVG IGNTALVGAA ATGGIAIGFG
TQVTAAGATA IGSAAQAQGA QSLALGAGAV TSQANSIALG AASINTVGAQ SSYSAYALTA
PQVSVGELGI GTALGNRKIT GVAAGSASSD AVNVAQLTAV GDQVQQNTAN ITSLGGRVTT
IEGSMASIAN GGGVKYFHAN STQPDSVASG TNSVAIGPAS LASGAASLAS GNAALASGAG
AVAIGDGAAA SADGSVAIGQ GSGDNGRGVE NYIGKYSNAS NTSSGTVSVG NTATGETRTV
SNVADGLQAT DAVNLRQLDG IAASIVVVEN NVSGLQNGTD GMFQVNNSSG LAKPSATGAN
SATGGAGSVA SGNNSTAFGS GAKATAANSA ALGANSVADR ANSVSVGSVG NERQITNVAP
ATQGTDAVNF DQLKSISNQT NAYTNQRYSE LKQDLRKQNS VLSAGIASAM SMASLTQPYT
SGSSMTTIGA ASYRGQSALS LGVSSISDSG RWVSKLQASS NTQGDFGIGV GVGYQW