Gene YPK_2156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2156 
Symbol 
ID6087846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2388368 
End bp2393263 
Gene Length4896 bp 
Protein Length1631 aa 
Translation table11 
GC content41% 
IMG OID641597224 
Productfilamentous haemagglutinin domain-containing protein 
Protein accessionYP_001720895 
Protein GI170024390 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.435206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA ATAAATTTAA ACTTTCCCCA GCAGGAAAGT TGACGGTTAT CTTATCTTTA 
ATTCTTACTC CAATCACAAA TAGCTATTCT GCTGAAATAG AAGCAGCGGG AAATACGTAC
ATGAGGGGTA ATGAACATAT CCCAAGTGTT TATAATAATC CTGATGGTGT GAGTGTGATA
AATATCGCTC CTCCGTCAGA GCATGGTCTC TCGCATAATC AATATATGGA ATTTCATGTT
AATGAACATG GGGTCGTGTT TAATAATTCA CTTGAGAGAG TTGTAAAAAA TGGAGTGACT
TATGATGCTA ACCTTAATTT ACGTGGCTCA CCAGCACGTG TGATATTAAA TGAAGTGGTG
GGGCTAAATG CTTCAGTATT GGCTGGGCAC CAGGATATCG TAGGCATACC TGCAGACTAT
ATTCTGGCAA ATGCTAACGG TATTAGCTGT CAGGGATGTA GTTTTGCGCC AGAGTTTAAA
AATGTCACGT TAGCCGTAGG GAAAGTCGTT ACGGTTCGTG GTGATCTACG CAGTATAGAT
ACCATAGGGA ATGCTAATTT ATTGAATGTG TCAAATGATC GCGATGATAA TAATATGGCT
GATGCATTGA CACTAATTGC GCCAGTTATT AGCACTAATG GCCACATTAA AGTCAAAGAC
GATGCGGATT TTGTTGTGGG CCAAAATATG TACCATTTTA TGAAAGATAA AACTCCAGAG
GTAGAAGCAG GTAATAGTAA AATAAAAACA ATTGATGGCT ACTATCTTGG CAGTATTTCC
GCTAACCGTA TTAATTTAGT TGACACAAGG GAAGATAATA ACATTAATTT ATTTGGTGAT
GTGGCCGCAG AGGAAACCAA GGTGGTCACC TCTGGCACGT TGCGATTAAT AGCGGCAGAA
GACGGCAGGC AGGATATAAC CATAAAAAAT GGAATGAATA TATCGGCGAA CAAGATTGAT
ACTACACGGG AATTTACTGC TGATGAAGTG AAGTTATTTG ACATAAAAGA AAATAAAGTC
AACAAAACCA TTATTAATGC TGGCCGTATT GATTTTGTTG CTGTCGAAGA TGTTAAATTG
GCAGGTACTA CAATTTTAAG TAATGATGAT CTCTCAATTA CGGCAAAGAG TTTGCATGTT
GACTCACATT TAATTAAACA TTCAAAGAGC ACAGGCGAGG TGGTTACTCA TGTGAGTATT
ATTGATGAAC CAACTAAGAA AGTAGAAAAT GAATATAATG ATCATGTCTC ACAAGCCAGT
GCAATTATGA GTCGGAAGAA CGTTAAATTA CATGGGCAGG ATGGTTTAGA ACTGAAAAAT
GCCAATATCC AAGCCTATGG GGATATTAAA TTGTCTTCTG AAGGTGATAT TCATTTAAAT
GGTTCAACTG AAACCAATAC TAGAATAAAT AATATAACCT ACATTAATCA TGATAATGAT
TTTAAAAAAG GTCATGATAA TGTAAAAACT GTCACTGAAA GGTTTGCCCC TCTTGATATG
AAGGCGAGGG GGAATATTAA TATACAGAGC AAAAACACGC ATATCCATGG TGCCAAAATA
GCCTCAGAGG GTGAGTTATC AATAGATGCC AAGGGTGACG TCTATATTGG GGTGGCAAGT
ATGTTAACTT CAGAGTTTAA AGATATTGAC TACAATCAGT GGGGGGGGGC TCATGGTTCG
GAAAAAGACA AGATAGAAGA GTATGTTTAT ACCGGTAATA AGTCAGATTT AGTCGGAGGG
CGGGTAAAAA TCACTGCGGG TAATGATGCC AGAATATTTG GTGGAAAAAT AAATGGAGTA
GATGGTGGAG AGATCTCGGC TCAAAATTAT CTGAGCATTG ATGGTGTGCT GGGGACACGC
AGCTTTAAAA GGGATCAAAA AACTGGCGGT ATTATGCATA CCACCAAGAA TACTTCTACT
GCCGACAATC ACTATGAAAA ATTTATCGAC AGCGAAATTA GTTCTGATGG CGATTTCCGC
ATATTCAGTC AAAAAGACCT TTATATTGAT GGTAGTCGAA TTAATGTAAA CGGAAAGCTA
GATATTAATG CTAATGAGAA GTTGACCGTA CAGGCTGCTC GTCAGCAACA AAAAATAGAT
GAGGAAAAAA CCCGCCTCAG CATAGAGTGG TTTGCTAAAG AAAGCAGTGA TAAGCAGTAT
CGTGCGGGCT TTCTCATTAA TCATCAAAAA GACACTGAAA ATACACTGAG AGATGAACAC
CAAATTGCAA CATTGAGTGC AGAACAGATT AATCTCACTG CCGGAGATGA TATTAAATTC
TTTGGCACTG GCATCAGTAC CTCTAAGGGT GACGTGATAA TAAAAACACC TAAAAATGTT
GGATTTTTCA CCGCGAAAAA CCGTGCACTA ATCAATAAAA ATCAGGTTAA TAATAGTGGG
GGTTTTTATT TCACGTCAGG GATGGATAAA ACAGGTAATG GCTTACAATA TACCCATATT
GATAAAGAAA GTTACAGTGA CATTGAGAAT AATCTGGTCG TTAAAACGCA TATTAAGGGT
GATTTAAATA TTAATGCAGG AGGCGATCTT AATCAGCAAG GGACGCAGCA TGATGTGGCT
AAAAACTATT CAGTTGAAGC CTCGAATATT AATAATATGG CGAGCAATAA TCTTGCTTTC
TCCAAAACAG ATACATTACA GGTTGATGTC AGCATCGGTA ATAATATTGA TCACAGCGGG
ATGACCCGTC CAATAGAAAA AGTCATTAAA GATCCGGCTA ATACGCTTGA TTATATCGGT
GGCAGAGGAA GCCAGAAAGG TGTTTCAGAC CCAACGATTG GTCTGGATGT GGATGTATCA
GGCAGTCGAA CCAAAACGTC AGACAATGAC GCGTTAGCGT TGGTTACCTC GATTAAAGCC
CAAGACATTA AGCAGGTGGC GAAAAAAGAT GTGCTGGATG AAGGGACCCA ATATCACGCT
ACCGAGGGCG GTATGAGTTT ACAGGGGGCA CGTCATTTCA GCCGTGCTGC AGTAAACAGT
AAAGCAAATA CAACCGAGAA AGAAAAAGGT GAAGTAAGTC TTCGGGGGGG CATGACGGCC
ACTCAGGAGA TAAAAGGCCA TCTGGGGGTT AAGGTGGAGA CCAGCCAGGG GGACAGCTAT
GCTGAAGAGA TGTTGGTCGG GAATATTAAT GCCAAGTCGG GAGTTTCTAT CAAAACGACC
GGGGATGCCT ATTATTATGC GACTAATATT GAGGGAGGAA ATGGGGATGT CACCATTGAT
GCGGGCAATA ATCTTTATTT TGACCAGGTA CAGGATAGCC AACGCAGCAG TAATATAAAA
TTTTCGGGTA ATGGAAAACT GAGTCTCGGT GGCTCTTCTG GCAGCAAGGA GTTTCGCCTT
GAAGGGGGCG GGGGCTACCA ACAAGGTCGA AGCCAGCGCA CTGACGCTAT TGTAAGTAAA
ATCAATACAC AGGGTAATGT AACGCTCAAA GCGGGCGCGG ATCTTACTAC AAAAGGTATG
CAGATCGGCA AGCAAGGAAC CCGGGCAAGC GATGTCTCTT TGTTGGCGGC TGGAGAGGTC
AAACTGTTGG CCGCGGTGTC TGGCAGTGCC GATATTAATG ATGGTGCGCT GGCCGACTTC
CGTCTGGGGG GGAAACGAGC AACAGGCGGT GCTTCAAAAG AAGGTTTTGT GGGAGCAGGC
GTTCAGGCTG ATAAAGTCAA TCAATCGGTT TCTGACCGTC AGGGCGGCCA TATTTATAGT
AAGAATACGG TGTCTATAAA ATCAGACAGT GACAGCAATC AAGCCATCCA TTTAGAAGGG
TTAAAAATTG ATGCACCAAA AGTCGATTTG AGTGCGCAAC AGGGTGGGGT ATTTATCGAG
TCTGCATTAA GTGAATTGCC AAAGGATAAT TGGAATTTTG GGTTTAACCT GGATATGGTC
CTGAAAAACA CATCGCCTAA AAAAGAAGAT GGAACAATTG ACAAGGATAA AGCCAGTGAA
AGTTATTATA AAGGTGCTGG CATTAAAGTA ATGGTGGATA AACAAGATAT GTTTAAACAT
CAGAATGCGC ATATCAACAC TGCATATTTT TCTTTGAATA CCAAAAAAGA TGCAGTAATG
AAAGGGGCGC GAATTGAGAG CACGCGGGCT GATGTTAACG TGGGCCGGGA CCTGACGATC
GAAAGCCTCA AGAGTAGGGA AGACAGTGTC AAGGTGGATG TTGAGTTATC GCTAAGCCAT
ACCAATGATA AAAGCAGTAG CGTTACATCT AAACTCTCGA AGCTCACGAC CAAGAAGTTC
GAGAAACAGA CTCAAGAAAA GTTAGACTCA GGTATTAAAG ATATTGGGTT AATGTATAAC
AATAAAGTAA AACCAAAAGA TACGATGGGC GGGGTTGGTT TCAGTAAAGA ATCTCAAGGA
GTATATCTTC CAACATTATC TTCTGAGACA AAATCCCGTA ATTTTGGTGA TAAAACGGCA
CGCTATCTGG GCGGTCAATT GAAGGGGATC GCCAGCGGCC CAGAAGGCCT TGATGGGCGT
GCGAAGTTGG ATGTACAAAT AGTAAACAAT GATGCGGTTG CTGAACAATC CGGTATTTTC
GGTATTGATG AGACGAATAT TTCGGTAAAA GGGACCACCA AACTACACGG CGCGGAAATA
AGTAGTGGAT CAGGGCAGCT CACCTTGGAA ACAGAGAACA GGGAACTGAG CAATATAAAA
AATAGTACTC ACAAGGGGGG CGGTGGGTTT AATGTTTCTC CTAGTGTATT AGGGAATTTA
ACTGGGGCTG GTAAAGATGT CTCTGAGGGG AAAACCCCCT TTATTCATCA CCCTCACAAT
AGTTCAGATG AGAGTGAGTC TGTTGGTAAA ATCAAGGGTG AAAAAGCCAC TGGCCTTGCT
TTTGATGAAC TCCCTTTTAG CCCAAAACCT CGCTAA
 
Protein sequence
MKINKFKLSP AGKLTVILSL ILTPITNSYS AEIEAAGNTY MRGNEHIPSV YNNPDGVSVI 
NIAPPSEHGL SHNQYMEFHV NEHGVVFNNS LERVVKNGVT YDANLNLRGS PARVILNEVV
GLNASVLAGH QDIVGIPADY ILANANGISC QGCSFAPEFK NVTLAVGKVV TVRGDLRSID
TIGNANLLNV SNDRDDNNMA DALTLIAPVI STNGHIKVKD DADFVVGQNM YHFMKDKTPE
VEAGNSKIKT IDGYYLGSIS ANRINLVDTR EDNNINLFGD VAAEETKVVT SGTLRLIAAE
DGRQDITIKN GMNISANKID TTREFTADEV KLFDIKENKV NKTIINAGRI DFVAVEDVKL
AGTTILSNDD LSITAKSLHV DSHLIKHSKS TGEVVTHVSI IDEPTKKVEN EYNDHVSQAS
AIMSRKNVKL HGQDGLELKN ANIQAYGDIK LSSEGDIHLN GSTETNTRIN NITYINHDND
FKKGHDNVKT VTERFAPLDM KARGNINIQS KNTHIHGAKI ASEGELSIDA KGDVYIGVAS
MLTSEFKDID YNQWGGAHGS EKDKIEEYVY TGNKSDLVGG RVKITAGNDA RIFGGKINGV
DGGEISAQNY LSIDGVLGTR SFKRDQKTGG IMHTTKNTST ADNHYEKFID SEISSDGDFR
IFSQKDLYID GSRINVNGKL DINANEKLTV QAARQQQKID EEKTRLSIEW FAKESSDKQY
RAGFLINHQK DTENTLRDEH QIATLSAEQI NLTAGDDIKF FGTGISTSKG DVIIKTPKNV
GFFTAKNRAL INKNQVNNSG GFYFTSGMDK TGNGLQYTHI DKESYSDIEN NLVVKTHIKG
DLNINAGGDL NQQGTQHDVA KNYSVEASNI NNMASNNLAF SKTDTLQVDV SIGNNIDHSG
MTRPIEKVIK DPANTLDYIG GRGSQKGVSD PTIGLDVDVS GSRTKTSDND ALALVTSIKA
QDIKQVAKKD VLDEGTQYHA TEGGMSLQGA RHFSRAAVNS KANTTEKEKG EVSLRGGMTA
TQEIKGHLGV KVETSQGDSY AEEMLVGNIN AKSGVSIKTT GDAYYYATNI EGGNGDVTID
AGNNLYFDQV QDSQRSSNIK FSGNGKLSLG GSSGSKEFRL EGGGGYQQGR SQRTDAIVSK
INTQGNVTLK AGADLTTKGM QIGKQGTRAS DVSLLAAGEV KLLAAVSGSA DINDGALADF
RLGGKRATGG ASKEGFVGAG VQADKVNQSV SDRQGGHIYS KNTVSIKSDS DSNQAIHLEG
LKIDAPKVDL SAQQGGVFIE SALSELPKDN WNFGFNLDMV LKNTSPKKED GTIDKDKASE
SYYKGAGIKV MVDKQDMFKH QNAHINTAYF SLNTKKDAVM KGARIESTRA DVNVGRDLTI
ESLKSREDSV KVDVELSLSH TNDKSSSVTS KLSKLTTKKF EKQTQEKLDS GIKDIGLMYN
NKVKPKDTMG GVGFSKESQG VYLPTLSSET KSRNFGDKTA RYLGGQLKGI ASGPEGLDGR
AKLDVQIVNN DAVAEQSGIF GIDETNISVK GTTKLHGAEI SSGSGQLTLE TENRELSNIK
NSTHKGGGGF NVSPSVLGNL TGAGKDVSEG KTPFIHHPHN SSDESESVGK IKGEKATGLA
FDELPFSPKP R