Gene YpsIP31758_1650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1650 
Symbol 
ID5385772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1919149 
End bp1921791 
Gene Length2643 bp 
Protein Length880 aa 
Translation table11 
GC content58% 
IMG OID640864631 
Producthemagglutination repeat-containing protein 
Protein accessionYP_001400627 
Protein GI153947101 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGTA TCCAAAAGTG TGACTGTAAT TATCTGGCCA GATTTGCAGT TTCAGATTCC 
TTTATCAGAG ATAATTCTTT TCGCGGCTTG CTTGCTATTT TTATGGTGAC GATATTTGTG
CCAGGTACTG CATTTTCTGA ACTCTATGTG AACAATGCCA ATGATCCTGG CTGTTATGCC
GTTGTGGATG ATAATAATCT GAATTTTAAG GGGAGGATCA CCGGTATTGT TACTAACCAT
ATATATTGTA ATACGTTGAC TGAGGCGAAT CTGAATACCA ATGGGGGGCA GCTATTTGTT
GGCGGACAAG GCGGGCTCCC AGGGTACGCT CCCACCCCCT GGACAGGGAC ATTCACGACG
GCCGTTGGCA CCAGTAATGT CGCCACAGCA TACGGCTTTG TCGTTCAAAA AAATGGTGCA
TTTATCAACG GTGATACCTA CGTGCAGGGG GGGTTATTTT TGAATGGTAG GAAAGCCACC
AATCTTGCCC CCGCAACGAT TTCATCGACC TCCACCGATG CGGTGGTTGG TAGCCAGCTT
TATACGGTGA TCCAAGATGG AACCCGCTAT TTCCACGCCA ACTCAGCGAA CCCGCAAGAC
TCCGTGCCTA CGGGTCAGGA TGCTATTGCT GTCGGGCCGG CGACGGTGGT CAATGGTAAT
AACGGGATTG GTATTGGTAA TAGCGCCGTT GTTGGGCCGA GTGCTGTCGG GGGCATTGCA
ATTGGCCCCA ACACTCAGGC GACCGGTACC GCCAGCACGG CCCTTGGTGC CGGAACGCAG
GCGCAGGGGG CACAGTCTTT GGCTCTTGGG GCCGGGGCGG TTACCCGCCA GGTAAACAGT
ATTGCGTTAG GGGCGTCGTC GGTCACCACG GTGGGCGCTC AGGGCAGCTA CAGTGCATAC
GGACTGCCGA CCACGCAGGC GTCGGTGGGG GAAGTGGGCA TAGGGACCGC GCAGGGTAAT
CGCAAGATCA CCGGTGTGGC AGCGGGCTCG GCTGGTTATG ATGCGGTCAA CGTGACTCAG
TTGACTGCTG TTGGTAATAA AGTCGACCAA AATACCGCTG ACATTACCAG TTTAGACGGC
CGGGTCACCA ATGTTGAGGG GGGGATGACC AGCATCACCA ACGGGGGCGG TATAAAGTAC
TTCCACACTC ACTCCACCGA GCCTGATTCG GTGGCCAGCG GCAGTGATTC GGTGGCGATC
GGACCGAATG CGCAGGCGTC CGGTACCACG TCGATAGCCA TGGGGGCCGG GTCGACAGCG
CAGGGAGCAC AGTCTCTGGC ATTGGGGGCG GGAGCGGCTG CCAGCCAGGC AAACAGTATT
GCGTTAGGGG CGTCATCGGT CACCACGGTC GGTGCTGAGA GCAACTACAG TGCGTACGGA
CTGACAGCTT CCCAAACGTC GGTGGGCGAG GTGGGGGTGG GCACGGCACA GGGGAATCGC
AAGATCACCG GTGTGGCAGC CGGTTCGGCT GATTATGATG CGGTCAATGT CGCGCAATTG
ACCGCTGTTG GTGACAAGGT CGATCAGAAT ACCGCTGACA TCACCAGTTT AGACGGCCGG
GTCACCAATG TTGAGGGGGA GATGACCAGC ATCACCAACG GGGGCGGCGT GAAATACTTC
CACACCCACT CCACCGAGCC TGATTCGGTG GCCAGCGGCA GTGATTCGGT GGCGATCGGA
CCGAATGCGC AGGCGTCAGG TACGGCTTCG GTGGCCTCCG GCAAGGATAC GCTGGCCTCC
GGTAACGGTG CGGTGGCGAT AGGTGATGCA GCAAGCGTCA GCGCAGAGGG CAGTGTTGCC
CTGGGGCAGG GTTCCGCTGA CAACGGGCGC GGTGCAGAGA GCTACACCGG CAAGTACTCC
ACTGCGGATA ACACCACCTC AGGTACGGTG TCGGTGGGCA ATGCGGCAAC CGGAGAGACC
CGGACGGTCA GCAACGTTGC CGACGGGCGA GAGGCCATGG ATGCAGTCAA TCTGCGGCAA
CTCGATGGTG CAATGGCGGC GGTGGGTGAC ACCGTATCAG GGTTGCAGAA CGGCACTGAC
GGGATGTTCC AGGTGAACAA CAACAGCGGT CAGGCCAAGC CTTCGGCCAC CGGAACTGAT
GCGATGGCGG GGGGGGCAGG TTCCGTGGCG TCTGGCAGCC ACAGTACCGC GATGGGTACG
GGCAGCAAGG CGACGGCGGC AAACAGCACC GCGCTGGGGG CCAACTCAGT GGCGGATCGT
GAAAACAGTG TCTCGGTGGG GTCAGTGGGT AATGAACGGC AGCTCACTAA TGTTGCTGCG
GGGACTCAGG GCACTGATGC GGTGAATCTG GATCAACTCA ACCATAGCAT GTCGAATGTC
ACCAACGACG CCAATGCTTA TACAGACCAG CGCTATTCTG CACTTAAAGA AGATCTGAAA
AAACAGGATA GTACGTTAAG TGCGGGGATC GCCGGTGCCA TGGCGATGGC GAGCCTGACT
CAACCCTATA CGCCGGGTGC CAGCATGGCG ACCATTGGTG CGGCCAGCTA TCGGGGCCAG
TCGGCGCTGT CGGTGGGGGT GTCGAGTATT TCTGACAGTG GGCGATGGGT CAGCAAATTG
CAGGCCTCCT CTAATACACA AGGCGATATG GGGGTTGGTG TCGGCGTCGG TTATCAATGG
TAA
 
Protein sequence
MKSIQKCDCN YLARFAVSDS FIRDNSFRGL LAIFMVTIFV PGTAFSELYV NNANDPGCYA 
VVDDNNLNFK GRITGIVTNH IYCNTLTEAN LNTNGGQLFV GGQGGLPGYA PTPWTGTFTT
AVGTSNVATA YGFVVQKNGA FINGDTYVQG GLFLNGRKAT NLAPATISST STDAVVGSQL
YTVIQDGTRY FHANSANPQD SVPTGQDAIA VGPATVVNGN NGIGIGNSAV VGPSAVGGIA
IGPNTQATGT ASTALGAGTQ AQGAQSLALG AGAVTRQVNS IALGASSVTT VGAQGSYSAY
GLPTTQASVG EVGIGTAQGN RKITGVAAGS AGYDAVNVTQ LTAVGNKVDQ NTADITSLDG
RVTNVEGGMT SITNGGGIKY FHTHSTEPDS VASGSDSVAI GPNAQASGTT SIAMGAGSTA
QGAQSLALGA GAAASQANSI ALGASSVTTV GAESNYSAYG LTASQTSVGE VGVGTAQGNR
KITGVAAGSA DYDAVNVAQL TAVGDKVDQN TADITSLDGR VTNVEGEMTS ITNGGGVKYF
HTHSTEPDSV ASGSDSVAIG PNAQASGTAS VASGKDTLAS GNGAVAIGDA ASVSAEGSVA
LGQGSADNGR GAESYTGKYS TADNTTSGTV SVGNAATGET RTVSNVADGR EAMDAVNLRQ
LDGAMAAVGD TVSGLQNGTD GMFQVNNNSG QAKPSATGTD AMAGGAGSVA SGSHSTAMGT
GSKATAANST ALGANSVADR ENSVSVGSVG NERQLTNVAA GTQGTDAVNL DQLNHSMSNV
TNDANAYTDQ RYSALKEDLK KQDSTLSAGI AGAMAMASLT QPYTPGASMA TIGAASYRGQ
SALSVGVSSI SDSGRWVSKL QASSNTQGDM GVGVGVGYQW