Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YPK_2156 |
Symbol | |
ID | 6087846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis YPIII |
Kingdom | Bacteria |
Replicon accession | NC_010465 |
Strand | - |
Start bp | 2388368 |
End bp | 2393263 |
Gene Length | 4896 bp |
Protein Length | 1631 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641597224 |
Product | filamentous haemagglutinin domain-containing protein |
Protein accession | YP_001720895 |
Protein GI | 170024390 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.435206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA ATAAATTTAA ACTTTCCCCA GCAGGAAAGT TGACGGTTAT CTTATCTTTA ATTCTTACTC CAATCACAAA TAGCTATTCT GCTGAAATAG AAGCAGCGGG AAATACGTAC ATGAGGGGTA ATGAACATAT CCCAAGTGTT TATAATAATC CTGATGGTGT GAGTGTGATA AATATCGCTC CTCCGTCAGA GCATGGTCTC TCGCATAATC AATATATGGA ATTTCATGTT AATGAACATG GGGTCGTGTT TAATAATTCA CTTGAGAGAG TTGTAAAAAA TGGAGTGACT TATGATGCTA ACCTTAATTT ACGTGGCTCA CCAGCACGTG TGATATTAAA TGAAGTGGTG GGGCTAAATG CTTCAGTATT GGCTGGGCAC CAGGATATCG TAGGCATACC TGCAGACTAT ATTCTGGCAA ATGCTAACGG TATTAGCTGT CAGGGATGTA GTTTTGCGCC AGAGTTTAAA AATGTCACGT TAGCCGTAGG GAAAGTCGTT ACGGTTCGTG GTGATCTACG CAGTATAGAT ACCATAGGGA ATGCTAATTT ATTGAATGTG TCAAATGATC GCGATGATAA TAATATGGCT GATGCATTGA CACTAATTGC GCCAGTTATT AGCACTAATG GCCACATTAA AGTCAAAGAC GATGCGGATT TTGTTGTGGG CCAAAATATG TACCATTTTA TGAAAGATAA AACTCCAGAG GTAGAAGCAG GTAATAGTAA AATAAAAACA ATTGATGGCT ACTATCTTGG CAGTATTTCC GCTAACCGTA TTAATTTAGT TGACACAAGG GAAGATAATA ACATTAATTT ATTTGGTGAT GTGGCCGCAG AGGAAACCAA GGTGGTCACC TCTGGCACGT TGCGATTAAT AGCGGCAGAA GACGGCAGGC AGGATATAAC CATAAAAAAT GGAATGAATA TATCGGCGAA CAAGATTGAT ACTACACGGG AATTTACTGC TGATGAAGTG AAGTTATTTG ACATAAAAGA AAATAAAGTC AACAAAACCA TTATTAATGC TGGCCGTATT GATTTTGTTG CTGTCGAAGA TGTTAAATTG GCAGGTACTA CAATTTTAAG TAATGATGAT CTCTCAATTA CGGCAAAGAG TTTGCATGTT GACTCACATT TAATTAAACA TTCAAAGAGC ACAGGCGAGG TGGTTACTCA TGTGAGTATT ATTGATGAAC CAACTAAGAA AGTAGAAAAT GAATATAATG ATCATGTCTC ACAAGCCAGT GCAATTATGA GTCGGAAGAA CGTTAAATTA CATGGGCAGG ATGGTTTAGA ACTGAAAAAT GCCAATATCC AAGCCTATGG GGATATTAAA TTGTCTTCTG AAGGTGATAT TCATTTAAAT GGTTCAACTG AAACCAATAC TAGAATAAAT AATATAACCT ACATTAATCA TGATAATGAT TTTAAAAAAG GTCATGATAA TGTAAAAACT GTCACTGAAA GGTTTGCCCC TCTTGATATG AAGGCGAGGG GGAATATTAA TATACAGAGC AAAAACACGC ATATCCATGG TGCCAAAATA GCCTCAGAGG GTGAGTTATC AATAGATGCC AAGGGTGACG TCTATATTGG GGTGGCAAGT ATGTTAACTT CAGAGTTTAA AGATATTGAC TACAATCAGT GGGGGGGGGC TCATGGTTCG GAAAAAGACA AGATAGAAGA GTATGTTTAT ACCGGTAATA AGTCAGATTT AGTCGGAGGG CGGGTAAAAA TCACTGCGGG TAATGATGCC AGAATATTTG GTGGAAAAAT AAATGGAGTA GATGGTGGAG AGATCTCGGC TCAAAATTAT CTGAGCATTG ATGGTGTGCT GGGGACACGC AGCTTTAAAA GGGATCAAAA AACTGGCGGT ATTATGCATA CCACCAAGAA TACTTCTACT GCCGACAATC ACTATGAAAA ATTTATCGAC AGCGAAATTA GTTCTGATGG CGATTTCCGC ATATTCAGTC AAAAAGACCT TTATATTGAT GGTAGTCGAA TTAATGTAAA CGGAAAGCTA GATATTAATG CTAATGAGAA GTTGACCGTA CAGGCTGCTC GTCAGCAACA AAAAATAGAT GAGGAAAAAA CCCGCCTCAG CATAGAGTGG TTTGCTAAAG AAAGCAGTGA TAAGCAGTAT CGTGCGGGCT TTCTCATTAA TCATCAAAAA GACACTGAAA ATACACTGAG AGATGAACAC CAAATTGCAA CATTGAGTGC AGAACAGATT AATCTCACTG CCGGAGATGA TATTAAATTC TTTGGCACTG GCATCAGTAC CTCTAAGGGT GACGTGATAA TAAAAACACC TAAAAATGTT GGATTTTTCA CCGCGAAAAA CCGTGCACTA ATCAATAAAA ATCAGGTTAA TAATAGTGGG GGTTTTTATT TCACGTCAGG GATGGATAAA ACAGGTAATG GCTTACAATA TACCCATATT GATAAAGAAA GTTACAGTGA CATTGAGAAT AATCTGGTCG TTAAAACGCA TATTAAGGGT GATTTAAATA TTAATGCAGG AGGCGATCTT AATCAGCAAG GGACGCAGCA TGATGTGGCT AAAAACTATT CAGTTGAAGC CTCGAATATT AATAATATGG CGAGCAATAA TCTTGCTTTC TCCAAAACAG ATACATTACA GGTTGATGTC AGCATCGGTA ATAATATTGA TCACAGCGGG ATGACCCGTC CAATAGAAAA AGTCATTAAA GATCCGGCTA ATACGCTTGA TTATATCGGT GGCAGAGGAA GCCAGAAAGG TGTTTCAGAC CCAACGATTG GTCTGGATGT GGATGTATCA GGCAGTCGAA CCAAAACGTC AGACAATGAC GCGTTAGCGT TGGTTACCTC GATTAAAGCC CAAGACATTA AGCAGGTGGC GAAAAAAGAT GTGCTGGATG AAGGGACCCA ATATCACGCT ACCGAGGGCG GTATGAGTTT ACAGGGGGCA CGTCATTTCA GCCGTGCTGC AGTAAACAGT AAAGCAAATA CAACCGAGAA AGAAAAAGGT GAAGTAAGTC TTCGGGGGGG CATGACGGCC ACTCAGGAGA TAAAAGGCCA TCTGGGGGTT AAGGTGGAGA CCAGCCAGGG GGACAGCTAT GCTGAAGAGA TGTTGGTCGG GAATATTAAT GCCAAGTCGG GAGTTTCTAT CAAAACGACC GGGGATGCCT ATTATTATGC GACTAATATT GAGGGAGGAA ATGGGGATGT CACCATTGAT GCGGGCAATA ATCTTTATTT TGACCAGGTA CAGGATAGCC AACGCAGCAG TAATATAAAA TTTTCGGGTA ATGGAAAACT GAGTCTCGGT GGCTCTTCTG GCAGCAAGGA GTTTCGCCTT GAAGGGGGCG GGGGCTACCA ACAAGGTCGA AGCCAGCGCA CTGACGCTAT TGTAAGTAAA ATCAATACAC AGGGTAATGT AACGCTCAAA GCGGGCGCGG ATCTTACTAC AAAAGGTATG CAGATCGGCA AGCAAGGAAC CCGGGCAAGC GATGTCTCTT TGTTGGCGGC TGGAGAGGTC AAACTGTTGG CCGCGGTGTC TGGCAGTGCC GATATTAATG ATGGTGCGCT GGCCGACTTC CGTCTGGGGG GGAAACGAGC AACAGGCGGT GCTTCAAAAG AAGGTTTTGT GGGAGCAGGC GTTCAGGCTG ATAAAGTCAA TCAATCGGTT TCTGACCGTC AGGGCGGCCA TATTTATAGT AAGAATACGG TGTCTATAAA ATCAGACAGT GACAGCAATC AAGCCATCCA TTTAGAAGGG TTAAAAATTG ATGCACCAAA AGTCGATTTG AGTGCGCAAC AGGGTGGGGT ATTTATCGAG TCTGCATTAA GTGAATTGCC AAAGGATAAT TGGAATTTTG GGTTTAACCT GGATATGGTC CTGAAAAACA CATCGCCTAA AAAAGAAGAT GGAACAATTG ACAAGGATAA AGCCAGTGAA AGTTATTATA AAGGTGCTGG CATTAAAGTA ATGGTGGATA AACAAGATAT GTTTAAACAT CAGAATGCGC ATATCAACAC TGCATATTTT TCTTTGAATA CCAAAAAAGA TGCAGTAATG AAAGGGGCGC GAATTGAGAG CACGCGGGCT GATGTTAACG TGGGCCGGGA CCTGACGATC GAAAGCCTCA AGAGTAGGGA AGACAGTGTC AAGGTGGATG TTGAGTTATC GCTAAGCCAT ACCAATGATA AAAGCAGTAG CGTTACATCT AAACTCTCGA AGCTCACGAC CAAGAAGTTC GAGAAACAGA CTCAAGAAAA GTTAGACTCA GGTATTAAAG ATATTGGGTT AATGTATAAC AATAAAGTAA AACCAAAAGA TACGATGGGC GGGGTTGGTT TCAGTAAAGA ATCTCAAGGA GTATATCTTC CAACATTATC TTCTGAGACA AAATCCCGTA ATTTTGGTGA TAAAACGGCA CGCTATCTGG GCGGTCAATT GAAGGGGATC GCCAGCGGCC CAGAAGGCCT TGATGGGCGT GCGAAGTTGG ATGTACAAAT AGTAAACAAT GATGCGGTTG CTGAACAATC CGGTATTTTC GGTATTGATG AGACGAATAT TTCGGTAAAA GGGACCACCA AACTACACGG CGCGGAAATA AGTAGTGGAT CAGGGCAGCT CACCTTGGAA ACAGAGAACA GGGAACTGAG CAATATAAAA AATAGTACTC ACAAGGGGGG CGGTGGGTTT AATGTTTCTC CTAGTGTATT AGGGAATTTA ACTGGGGCTG GTAAAGATGT CTCTGAGGGG AAAACCCCCT TTATTCATCA CCCTCACAAT AGTTCAGATG AGAGTGAGTC TGTTGGTAAA ATCAAGGGTG AAAAAGCCAC TGGCCTTGCT TTTGATGAAC TCCCTTTTAG CCCAAAACCT CGCTAA
|
Protein sequence | MKINKFKLSP AGKLTVILSL ILTPITNSYS AEIEAAGNTY MRGNEHIPSV YNNPDGVSVI NIAPPSEHGL SHNQYMEFHV NEHGVVFNNS LERVVKNGVT YDANLNLRGS PARVILNEVV GLNASVLAGH QDIVGIPADY ILANANGISC QGCSFAPEFK NVTLAVGKVV TVRGDLRSID TIGNANLLNV SNDRDDNNMA DALTLIAPVI STNGHIKVKD DADFVVGQNM YHFMKDKTPE VEAGNSKIKT IDGYYLGSIS ANRINLVDTR EDNNINLFGD VAAEETKVVT SGTLRLIAAE DGRQDITIKN GMNISANKID TTREFTADEV KLFDIKENKV NKTIINAGRI DFVAVEDVKL AGTTILSNDD LSITAKSLHV DSHLIKHSKS TGEVVTHVSI IDEPTKKVEN EYNDHVSQAS AIMSRKNVKL HGQDGLELKN ANIQAYGDIK LSSEGDIHLN GSTETNTRIN NITYINHDND FKKGHDNVKT VTERFAPLDM KARGNINIQS KNTHIHGAKI ASEGELSIDA KGDVYIGVAS MLTSEFKDID YNQWGGAHGS EKDKIEEYVY TGNKSDLVGG RVKITAGNDA RIFGGKINGV DGGEISAQNY LSIDGVLGTR SFKRDQKTGG IMHTTKNTST ADNHYEKFID SEISSDGDFR IFSQKDLYID GSRINVNGKL DINANEKLTV QAARQQQKID EEKTRLSIEW FAKESSDKQY RAGFLINHQK DTENTLRDEH QIATLSAEQI NLTAGDDIKF FGTGISTSKG DVIIKTPKNV GFFTAKNRAL INKNQVNNSG GFYFTSGMDK TGNGLQYTHI DKESYSDIEN NLVVKTHIKG DLNINAGGDL NQQGTQHDVA KNYSVEASNI NNMASNNLAF SKTDTLQVDV SIGNNIDHSG MTRPIEKVIK DPANTLDYIG GRGSQKGVSD PTIGLDVDVS GSRTKTSDND ALALVTSIKA QDIKQVAKKD VLDEGTQYHA TEGGMSLQGA RHFSRAAVNS KANTTEKEKG EVSLRGGMTA TQEIKGHLGV KVETSQGDSY AEEMLVGNIN AKSGVSIKTT GDAYYYATNI EGGNGDVTID AGNNLYFDQV QDSQRSSNIK FSGNGKLSLG GSSGSKEFRL EGGGGYQQGR SQRTDAIVSK INTQGNVTLK AGADLTTKGM QIGKQGTRAS DVSLLAAGEV KLLAAVSGSA DINDGALADF RLGGKRATGG ASKEGFVGAG VQADKVNQSV SDRQGGHIYS KNTVSIKSDS DSNQAIHLEG LKIDAPKVDL SAQQGGVFIE SALSELPKDN WNFGFNLDMV LKNTSPKKED GTIDKDKASE SYYKGAGIKV MVDKQDMFKH QNAHINTAYF SLNTKKDAVM KGARIESTRA DVNVGRDLTI ESLKSREDSV KVDVELSLSH TNDKSSSVTS KLSKLTTKKF EKQTQEKLDS GIKDIGLMYN NKVKPKDTMG GVGFSKESQG VYLPTLSSET KSRNFGDKTA RYLGGQLKGI ASGPEGLDGR AKLDVQIVNN DAVAEQSGIF GIDETNISVK GTTKLHGAEI SSGSGQLTLE TENRELSNIK NSTHKGGGGF NVSPSVLGNL TGAGKDVSEG KTPFIHHPHN SSDESESVGK IKGEKATGLA FDELPFSPKP R
|
| |