Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_3174 |
Symbol | |
ID | 5387756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | - |
Start bp | 3573740 |
End bp | 3579472 |
Gene Length | 5733 bp |
Protein Length | 1910 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640866181 |
Product | hemagglutination domain-containing protein |
Protein accession | YP_001402134 |
Protein GI | 153947282 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.361804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGTA AATTGTACAA ACTTATTTTT TGTCGTCGGC TGGGCTGCTT AATTGCCGTG GGAGAATTTA CCCGATCATA TGGCCGAGCC TTTTCGTCGA AAGGCGGTCA AGCCGGCAAT AATCAGCGCC GGGCCGTTGG TATCTTAAGC CGTCTGGCGA TGATGACGGG CTTGGCGCTT GGGATATTCC CCTTGCTGGT TCTGGCCCAT CCGGTGTTAC CGGTTAATGG CCATGTTGTC ATCGGGCAAG GGATGTTGGA TCAGCAAAGC AATACGCTTA CCGTGACACA GCAAACGGAT AAGTTAGCGA TTAATTGGGC TAGTTTTGAT ATCGCGCACG GACACAGTGT CATTTATGCC CAGCCGGGTA GCCAGAGCAT TGCATTGAAT CAGGTGCAGG GGCAGAGCGC ATCGCAAATT TATGGCCGCT TACAGGCAAA TGGTCAGGTC TTCTTGCTTA ACCCACGCGG GATACTGTTT GGTAAAGAGG CGCAGGTTAA TGTCGGTGGG TTGGTGGCGA GCACGAAATA TATGTCAAAT CCAGATTTTC TGTCTGGTGA CTACCGCCTG ATCGGGGGAG AGCGTGAAGG CAATATTATT AATCAGGCGA ATTTACGCTC GGCCCCAGGG GGATATATCG CACTGGTGGG TAACCGGATT GATAACCAGC GTTCAGGGTC TATCACCACC CCACAAGGGA ACACTGTGCT GGCAGTCGGT CACAGCGTGA CCTTGAATCT GGACCATGGG AATTTATTAG GTGTACAAAT TCAGGGAGAA ACCGTTGCTG CACTGATTCA AAATGGTGGG TTAATCCAGG CTGATGGCGG AGTGATTCAA CTGACGGCTA AGGGTAAAGA TATGCTGATG GATACGGTGA TTGATAATAC CGGTATCCTG CAAGCTAAAG GGTTGTCAGC GAAAAATGGT GCTATTTATC TTGATGGCGG TGGTGAGGGG GTGGTCAGCC AGATGGGGAC CATCGATGTT AACAATCAGC AGGGGCGTGG AGGGCGTGCG GTTGTTGAAG GGAAACGTAT TTATCTGAAT AAGAACAGTA ACATAGAAGC GCAGGGTTCT GCTGGCGGTG GCACTGTTTT AGTGGGTGGG GGCTGGCAAG GCAAAGATAA CCAGATCAGA AATGCCACGG CTGTGGTGAT GGATAAAGGC AGTAATATTG ATGTTTCGGC ATCTCGCAAC GGGCCGGGAG GAACTGCGGT TTTATGGTCT GAGGATTACA CCGGTTTTCA CGGTAATATC CGAGCCAGGG GCGGGCCTCA ATCAGGCGAT GGCGGTCGGG TTGAAACATC CAGCCAACGC AATTTACAAG CTTTTGGGCA AGTGGATGCC AGTGCCGTTC GTGGATCCGC AGGTTATTGG TTATTGGACC CTGCGGAGGT AACAATTGTT AGCAGTGGTG CTGAGAGTGG TGTAATGACT AAAGTGGGTA ATATACCCGC AGAGTTTGTT TCTAGCGCCC ATATTTTTAT TCCAACGGCC AATATTACTC AGATCCTCAA TAGGAGTATT AATACCCAGC TTAATAGTGG CACTAATGTT ACGATAACCA CCAGCAACAA TAGTTTAACG GGGTGCCAAT GGTGCAATAT AACTGTACAA GCCGATATCA CTAAAACGGC GGGAGCCGAT GCGACGCTGA CATTACAGGC TGACGGTAAT CTTGTAGTTA ATAATAATAT TACCGCTGAT GCTGGAAAAT TAAATTTAAA TTTATTAGCA GGAAATACCA CTGCGGATTC TGCCATCACG CTGAATAATA GCAAGGTTTT ATTAAATGGC GGTGATTTTT TAGCTAAACA TGCCAATGAT AATAATACGG CACGTATTAG CTTACTGGGC GGGCGATATG ATGTTGGCAA TTTCACCTTG GATGGGAATA CCGCATTAGC CTCACAAGTT GGGGTGAATA TCAGTAATGC GGCTAATATT AGCGTCGCGG GCGAAACCGT TATCTCCGGT GTGAGCAGTA ACAGCCGTGG GCAAGGGTGG CGTGGTATTG ATATATCAAA TAATTCGATC TTAACGGGCG TGGGGAATAT GACTTTCTCG ATAGGCTCTA ACTCGAATGT TTCATGGATG GGGTCATTCA CTAATGCAAC AATTACCAGT GATAATAATA TAATATTCCA AGGGACGGGG AGTTCGTCAG GTGGAGTGGA CTTTGTTAAT AGCCGTATTT TGTCAAAATC TGGCCGTGTT TTATTTGATA TAAATGGCAA TATTGTTGTT AAGAATGTTT ATGGGCTTCG GGTCAATAAT TCACAGATCA GTGCTAAGGA TGTTAAATTT GCAGTCAATG TCACTGGAGT TGATGGTTTT TTACTAAGAG ATAGTCATAT TACCGCAACT TCCGGTGATA TTAATGCCAA TGCTAATACC ATCAATAAAG GTATTTGGAT CTCAGGTAAA ACCAATTTAA ACGCCAGTGG TAATGTTAAT CTGCATGGTG TGACGACTAA TTCGGCCTAT GCGGGTGCTG ATGCAATAAA GATCAGCGGT AATTCCAGTA GTAACAATGT CAATATTACC GCTGGTGGTC ATATCTCTCT GATCGCGGTT AATGAAGGGA AAGAGATTGG AAGCACTGTT TCTGTAGATT ACGCCAATAT AATAGCTAAG AATGGAGACT TTAATTTAAA TATTACTGGG GTGAAGGGGA GTCCTTTTAA TAACACTACG ATAACTGCCA ACAATATTTC AATGAATGGC AATATTACCG CTAATGATGC GGTGTTGATG ACTAATACAT TCCTCACGGC TAAAGGCGAT ATCAAAACTG ATTTAACCTC TCCCACTAAA GGTTTATGGT TTAGGGGAAA TGGTGGGATG ACCGCGGCTA ATAATATACT CTTGGTTGCT AACAGCACAT CGAGTGGAGA AGCAGTGAAA ATCAATGCGT CTTCATCGAA CAAAATGAAT ATCACTGCAG GAAAAGATAT ATCTATAATA GCCGGTAATA GTAAAACAGC TACGGGACCT AACATTAATA TTGAAAATGT CAATATAGAA ACCAATAATG GAAACTTTAC GACTAACGGC ATAACAAGTA CATGGCTGTC GGGAGTGAAT GTTAGCGCGA ATGGTGTTGA TATAACCTCT AATTCTACTG GCACCGGTGG CATAGTATTG GATAATACTA ATATCCTGAC AACAGTAGGT GATATTAATA CAATAGTAAC CAATTCTTCC GGCAAAGGCA TTTGGATTAA ATCTAACTCA ACATTGAATT CTAATAAAGA TATCACCTTG GTTGGAGTAT CCGCCGGACA GAATGAAGGG GTCATTATTC AAGGTTCTTC AGATGCTTCA CGTAACAATA TCTCTGCTCA AGGGAATATC ACCTTAATAG GTAAAACGGG CAATGGCTCT GGTCAACGTT CATTAATCAA TTTGGGTAAT GTTAGTCTAA CATCAAGCGG AAAAAATATT GATATTAATG GTTCGTCAGT CGGTGCCGGG GATGTTTATT TTACCAATGT AGAACTTAAT GCTACCGCAG GTAATGTTTC TATTTATGCT GAAACGAAAA CCGCTTTATC GACATCATTA AATGCCGTAT TAAGCTTGGG GGGTAATAAC AGTATCAAAG CTCAAAATGG ATGGCTTATT GGTAAAGCGT TTAATACGAC ACAAGGGGCG GGTATTGGTT TTAGAGCCAA TAGTAGCTTA TCTGTTGACG GCAATATCAT TTTGAAAGGC GAGACCGAAG GGGTTGGGGC CACACGCAAA GGGATTGATT TCTATGGCGC GAATACACTG AATATTATTA AAGGTAGCCA ATTATCTCTC CTCGGTGAAA ATAAAGGGGC TCAAGATACC TCAGGTGGTA ATGGCATAAG TTATACCAGC CCAGCTAGAT TAACGGTTAA TAATAATGGT TCTTTAAAAA TGGAGGGGCG TTCAACCAGT GGCACGGGAA TTAACTTCCC AAGCAGCAAT AATACGCTGG TATTCAATGG TGATGGTGAC ACGCTGATTA AAGGCAGCAG TGTCGCGGGT ACGGGGGCCG CTATTTCCGG TGTTGTTAAT AATAGTACCG GCCCCATGAC GATTGAAGGA ATCAGTACCG AGGGTGCCGG TGTTCACCTT TTCAGTGCAG AACATCGTAT TGATCGCATT AATGTCACAG GGAGTTCAAC TCACGCCGAA GGTCTGCGGG TCAGTGGTAA TGCAGCGATT GTCGATACCA CATTGACCGG AAAGTCGATC AATGGCAGTG GTGTGAAGAT TGATTCATTG CCGGGCTCCA GTGTTGTTAC CCGTTCCGTC TTGGATAATG CCACGCTCAA TGGCAGCAGT AGCAGTGGGA AAGGGGTGGA AATTACCAGT GATATCAATG GTATTCATCA CAGTTCGATT AACGGAACGA CTACTGGCAC GGGCTACGGC ATTAATATTG GCGAAAATTT AAACGTTACC GGGACCAGTG AAGCTGACTT GTTGATTCTA CAAGGTGTGG CGACAACAGG TACTGGCACC GGAATAAAAC TCAATGGTAA TAATGATTTA AGTAATACCA GTTTAAATAG TTCTGCGGTT GATGGTATCG CTTTGGATAT CACGGGCCCG CTAGCTAACC AAGGGAATGT GATCCTAAAC GGCACGGCTT CTGGTTCGGG GATTGGTGCG CAGGTCAATG GTTCGCTAAG TGATAGTGTG GTTAACGGTA CGTCGACGAA TGGTATTGGT GTGCAAATTA ATGGATCGCT TGAAAACAGC CGCATCAACG GCATTTCGGC CAATGGCAGC GGGGTTAAAG TCGATGGCGA GACCACGCTG GATAACGCCA CGCTCAATGG CCACAGCAGC GAAGGCAAGG GTGTCGATCT GGCGGCCAAT CTGTCCGGCA ACCATGGCAG CGCAGTGCAT GGCGACACGG TCAATGGCAC TGGCATCGAC GTGGGTAAAG GCGTCACCCT GAGTGGTGGT GGCACGGATG AACCGTTAAC GGTCAGCGGC AATGCCAGCG GTGAGAAAGG CACCGGCGTG CAACTGGGCG GCAATAATAC CCTCGATAAC ACCACGCTGA GCGGCAACGC CACCGATGGT CATGGGGTAG AGATTAACAG CCGATTAATC AATAACGGCA ATACTACGAT TAATGGCAGA ACGTCTGATG ATGGTCACGG CGTACATATT AATGGGGCCA TCAGTGGCGG AGAAATCAAT GGTCATTCAG ACAATAGCCA CGGTGTTTTC CTCGATGAAA GTGCGTTACT TAATGACATC GTTATCGGAG GAGGGACCGG CTCATATAAA CCGCCGGTGT TTATAGCATT GCCTAAAACC ATCGGTGAGC ACGTGACTCT TAATGGTAAA CCGATTGATA AAACCCAGCC AGAAGGCAGT AAGGCACGTG AAGGTGATAA CCTGACAAGG GGTAAATATA CACCTTTGCC ACCGGTTACT GACCCTGAAT TGCCCCCAGC ATCAACCGAT GATGAGAAAA ATACAAAACA GACCTCGACG CTAACCCCAT CTCAGAAAAG AGAGGATCCA GACATGTTGA TAATGGCGCG AAACCATATC TTGAGTACCC TTGAAGGGCG TGATTTATCT TCATCTGTTG TCACTGAATC GGAGCAAAGT GCGGCGGGCG TTACCGGAAT TATGGTTTGT CTCCCTCTGA GTGAGGCCTC TGAACATGAA CCTTGCGATA CGTATATTTT AGACAAGGGA CAACCCCATC TCCCCATGAT GGTCAAGAAG TAA
|
Protein sequence | MNSKLYKLIF CRRLGCLIAV GEFTRSYGRA FSSKGGQAGN NQRRAVGILS RLAMMTGLAL GIFPLLVLAH PVLPVNGHVV IGQGMLDQQS NTLTVTQQTD KLAINWASFD IAHGHSVIYA QPGSQSIALN QVQGQSASQI YGRLQANGQV FLLNPRGILF GKEAQVNVGG LVASTKYMSN PDFLSGDYRL IGGEREGNII NQANLRSAPG GYIALVGNRI DNQRSGSITT PQGNTVLAVG HSVTLNLDHG NLLGVQIQGE TVAALIQNGG LIQADGGVIQ LTAKGKDMLM DTVIDNTGIL QAKGLSAKNG AIYLDGGGEG VVSQMGTIDV NNQQGRGGRA VVEGKRIYLN KNSNIEAQGS AGGGTVLVGG GWQGKDNQIR NATAVVMDKG SNIDVSASRN GPGGTAVLWS EDYTGFHGNI RARGGPQSGD GGRVETSSQR NLQAFGQVDA SAVRGSAGYW LLDPAEVTIV SSGAESGVMT KVGNIPAEFV SSAHIFIPTA NITQILNRSI NTQLNSGTNV TITTSNNSLT GCQWCNITVQ ADITKTAGAD ATLTLQADGN LVVNNNITAD AGKLNLNLLA GNTTADSAIT LNNSKVLLNG GDFLAKHAND NNTARISLLG GRYDVGNFTL DGNTALASQV GVNISNAANI SVAGETVISG VSSNSRGQGW RGIDISNNSI LTGVGNMTFS IGSNSNVSWM GSFTNATITS DNNIIFQGTG SSSGGVDFVN SRILSKSGRV LFDINGNIVV KNVYGLRVNN SQISAKDVKF AVNVTGVDGF LLRDSHITAT SGDINANANT INKGIWISGK TNLNASGNVN LHGVTTNSAY AGADAIKISG NSSSNNVNIT AGGHISLIAV NEGKEIGSTV SVDYANIIAK NGDFNLNITG VKGSPFNNTT ITANNISMNG NITANDAVLM TNTFLTAKGD IKTDLTSPTK GLWFRGNGGM TAANNILLVA NSTSSGEAVK INASSSNKMN ITAGKDISII AGNSKTATGP NINIENVNIE TNNGNFTTNG ITSTWLSGVN VSANGVDITS NSTGTGGIVL DNTNILTTVG DINTIVTNSS GKGIWIKSNS TLNSNKDITL VGVSAGQNEG VIIQGSSDAS RNNISAQGNI TLIGKTGNGS GQRSLINLGN VSLTSSGKNI DINGSSVGAG DVYFTNVELN ATAGNVSIYA ETKTALSTSL NAVLSLGGNN SIKAQNGWLI GKAFNTTQGA GIGFRANSSL SVDGNIILKG ETEGVGATRK GIDFYGANTL NIIKGSQLSL LGENKGAQDT SGGNGISYTS PARLTVNNNG SLKMEGRSTS GTGINFPSSN NTLVFNGDGD TLIKGSSVAG TGAAISGVVN NSTGPMTIEG ISTEGAGVHL FSAEHRIDRI NVTGSSTHAE GLRVSGNAAI VDTTLTGKSI NGSGVKIDSL PGSSVVTRSV LDNATLNGSS SSGKGVEITS DINGIHHSSI NGTTTGTGYG INIGENLNVT GTSEADLLIL QGVATTGTGT GIKLNGNNDL SNTSLNSSAV DGIALDITGP LANQGNVILN GTASGSGIGA QVNGSLSDSV VNGTSTNGIG VQINGSLENS RINGISANGS GVKVDGETTL DNATLNGHSS EGKGVDLAAN LSGNHGSAVH GDTVNGTGID VGKGVTLSGG GTDEPLTVSG NASGEKGTGV QLGGNNTLDN TTLSGNATDG HGVEINSRLI NNGNTTINGR TSDDGHGVHI NGAISGGEIN GHSDNSHGVF LDESALLNDI VIGGGTGSYK PPVFIALPKT IGEHVTLNGK PIDKTQPEGS KAREGDNLTR GKYTPLPPVT DPELPPASTD DEKNTKQTST LTPSQKREDP DMLIMARNHI LSTLEGRDLS SSVVTESEQS AAGVTGIMVC LPLSEASEHE PCDTYILDKG QPHLPMMVKK
|
| |