Gene YpsIP31758_0299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0299 
SymbolshlA 
ID5387666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp341552 
End bp346408 
Gene Length4857 bp 
Protein Length1618 aa 
Translation table11 
GC content45% 
IMG OID640863269 
Producthemolysin 
Protein accessionYP_001399293 
Protein GI153949445 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.420803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAC ACACCTTTAA GCTCTCACCG GCAGGGAAAT TGGCGGCGGC GGTAACCATT 
ATTTCGGTTT CTGTTGCCAC CTGTTATGCC GCCGGGATTG TCGGAGCGGG CGATTCTGCA
CATAAACCTG ATGTTAGCTC GGTAAATGGC ACATCAGTAA TTAATATCGT GCAACCGTCA
GCCTCCGGCT TATCACATAA CCAATTTCAG GATTTTAATG TTGGCGAAAA GGGGGCGGTA
CTGAATAACG CCACCAGTGC AGGTAATTCA ATACTTGCCG GGCAGTTAGC CGCTAACCAA
AATTTAAATG GTCAAGCGGC TAGCATTATT CTTAATGAAG TCATCAGCCG TAATCCTTCT
CTATTATTGG GCCAACAAGA AATTTTTGGT ATGACGGCGG ATTATATTCT GGCCAACCCA
AATGGCATTA CCTGTAATGG CTGTGGTTTT ATGAATACCA ACCGCGAGTC ATTGGTCGTC
GGTAATCCCT TGATTGAACA AGGATCGCTG AAGGGTTTTG AAACTTTCAA TAACCATAAT
TGGTTGAAAA TTCAGGATAA GGGGCTGACC GCCAATAAAA TATTAGATCT CATTGCACCC
AAGATTGAAA TCACAGGCGC AGTCACCTCA ACTGAAGCCA TCAACGCCTT ATCAGGCAGT
AACCGGGTTT CCCTTGATGG CACAATATTG CTATCCCGCC CAATACCATT TTTAGGTAAA
GACCATATTG ATGGTAAATA TCTTGGCAGT ATGCAATCGG GCCGGATTAA TTTAGTCAGT
ACCCTTGCAG GCAGTGGGGT AAAAATTGCT GGTAGTCTAA ACGCTAGCGA AGCAATAAAC
GCCACGGTTA AAGGAAAGCT GCAATTAGAA GCCGCCAAAT TAGATGGCAA TGACATTAAC
ATCAAAGCCA ATAGCATCCA GGCATTCGGT AATCTTCATA AAAATGAAGA CAATGGGGAT
GTTACGCAGA GTCTGGAACG CACTCAGTTG AAAGGCAAAA ATATTGCTAT TGTCGCGAAT
AAAAAGAATC AGCTTTCCGC AGTAAAAATC GACGCTGACA ATGTGACGCT GAGAGGAGGT
GAATTGGTAT TGGATGAGAG AATCCTCACC AATACAGAAA AGGAATCTTC ATCAGACGGT
GGAAGTGGGC TGTGGGGCCT TGGTAAGTGG AATTGGTCTG AAAATAAAGA GGAGAAACAA
CAAACCAGTA TTGGCACCAC CATTACAGCG AAAAATAATG CCTCTCTTGA ATCAACTCAA
GATGATATTA AACTGTCGGC TGCCACAATA ACCGCGGGTA AAAACCTTGC CATCAAAGCC
AAAAAAGATT TACACATAGA CGGTGCTATT GAAGAGAATT CAATTCACGA CCACGGCCAT
AATTATAAAC ATATGGTCAA AGATGAATCT TGGAATAATA AAACCACCAA ACAAACACTC
AATAAAACCA CACTTGAGGC CGGTAAGAAT TTGGGCCTCA CCGCAGAAAA TAAAATCACG
ACTCAGGGAA TCAAAGCTTC TGCGGGTGGT GACGTCGTAA TTGATGCCAA CGACGTTAAA
ATTGGGGTAC AAAAAACCAG TAATCAGGAG ACAACCGACG GTAAACATGA AAGAAACTTG
GGACTCGGTG GTGTCGATCA CAATAATAAC GATAAGTATG CGGAAACCAG CCATAGCTCA
GAAATAACCG CTGACGGTAA TATACTCATC AGCGTGAAAG ATGACGTCGC CATTACTGGC
AGTAAAGTAA AAGCAACTAA AGATGGTTTT GTTCAAGCTA AAGAGGGTGG GATCAAAATC
GATAACGCCA TCAGTACCAC AACCAGTAAA GTTGATGAGC GTACGGGGGT AGCATTTGAT
ATTACCGGCA GTTCTAAAAA AGCCAATAAC AGTGAAGAAA AATCTACCGG TAGTGAAGTT
ATCTCTGAAG CAAACCTGAA AATTATCAGT AAAAAGGATG TCGATGTAAT TGGCAGCCTG
GTAAAAAGCG CCGGTGAATT AGGCATTGAG ACCTTAGGCG ATATCAATGT CGCAGCAGCT
CAAGAGCAAG AAAAAATCGA TGAACAAAAA ACCCAGCTGA CGATTGATGG CTTTACCAGT
GATGACGGTA AAAACCAGTA TCAGGCGGGG CTAAAACTGG AACACACCAG CGAGAGTGAA
AAAACGGAGA AGGTGACAAA TCACGGCTCT ACCCTTGAGG GCGGCACCGT CAAACTGGAG
GCAGATAAAG ATGTGACATT CACTGGGTCA CAACTCAATA CCACCGGGGG AGATGCGGAT
ATCACTGCTG AAAACGTCTC TTTTGTCGCT GCACAAGACA CCACCACCAG CAATAAAGAA
AAAGAGACCG TGGGTGTGAA TGCTCACTAT ACCGGTGGGA TGGATAAAGC GGGTAGCGGT
GCGGGGGTTA ATTATGAAGA GACGAAAACG GACAGCGAGA AATCAACTGC GGTGGTTTCC
CAGACCGATA TTAAAGGTAA TCTGAATATT AACGCCGAGC AGGATATTAC CAACCAAGGT
ACTGATCATA AAGTTGATGG CTCTTATAAC GCAGATGCCA CTAACGTTCA CAATCTGGCC
GCTGAAAATA CCGAAGAAAC CACGACTAAC AGCACAACGG TTAAGGTTGG CGTCGGTGCC
AATGTGGCTT ATGACGGTAT CACCCGCCCT GTTGAACAAG TGATTGAAAA AGGTAAAAAG
CTGGATGTCG GGGGCGTTAT TGAAAATGCA GGAGAGGTCT CGCCTGACTC AGCCAATGTC
GGAATAGACC TCTCTGCTAA AGTCGATTTA AAGGAGAGCA CACTCAGTAG CTCTCAATCC
GTAGTAACGT CAATAAAAAG CGGCGATACC ACGATCAATG CGTCTGGTGA CATCGAAGAT
CAAGGAACGA AATACGAGGC CGATAAAGGC GCGATAAATC TACATGCCAC CAACCACACT
TTTGAAGCTG CCGTTGACCG AGTTGAAAAG CATGAGGAAG AAGTCACCGC TGGCGTAGAT
GTGCGAGTTT ATACCAACAC CGGTAAGGAT ATTACGGTTG ATGGTAAAGG TAAGGGAGCC
AATAACAAGC AGGATATCAA AGGAGAAATC TCCCAGGTTG GCAGCATGGT CGCAGCAAAT
GGCATCAATA TCACGGTAAA AGAAGATGCA ACTTATACAG GGACTGATCT CGATGCTGGA
GACGGAAAGA TTGCAATTAC GGCGGGCAAA GACATTCACT TTGAACAGGC AACCAACCAC
ACCAGTGAAA GCCATAATAA TATTGAGGCC AATGCCAAAG CAAACTTCGG CACTAAGGCA
AACAGTAAAG AGTTTGGCGG CGGCTTGGGT GGCGGTCATA GTCAAGGCAG CGCCAGTACC
GATATTGCAC AAGTCAGCCA TCTACAAGGT AAACAAGGCA TCGAACTGAA TGCTGGAAAT
GATTTAACAC TGCAAGGTAC CGAGTTTGGC ACTAAAGACA CGTCTACCGG TGATGTTCAA
CTCAACGCGG GTGGAAAAGT AGACTTCCAA GCGGCACAGT CACAAAGTTC TAAACAAGAT
ATGACCTGGT CTGCCAGCGT CAAGGCAGGA AAAAACAAAA GTAACAGCAC CAGTGAAAAT
GATGATGGCA CAAAAACGCA TACCAACAAC AAAGGGTTTA GCGCAGGAGC AGAAGCCAAA
GTTGCCAATA CGGACGAATC GAACACTCGC CATCAGGGCG GTGCAATCAA CAGTAATGGC
GTTGTAACCA TCAAAGCGGG TAGCGATGAT AAGCAAGCCA TCCATCTGCA AAATACCAAC
ATCACCAGTC AAGAGACCAT TCTGGACGCG GATAATGGCG GGATCGTAAT GGAGTCGGCT
CAGGATAAAG AGCACAAAAA TAACTGGAAT GTTAACGCCA ACCTTAATGG CAGCCAGAAA
AACACCATTA AAAGCGATGA TCAAGGTGTT GTAGATAAAG ATAGCGCCAA GAAAATTCAT
AATGCGGGCA TCAAACTCGG TGGCGGTGTT GATAAACTGG ACTCAGTAAC GCAACAAAAT
ACCCATATCA ACGGTGATAA GGTCACCCTG AACAGCGGTA AAAATACCGA ATTGGCGGGT
GCGGTGATCC AGGCTAACCA GATTAATGGG CAAATTGGTG GGGATCTAAA CGTCATTAGT
CGCGAAAATA GACTCAATAA AGTCAATGTT TCTGCGGCAC TTGGTGCCAG CCACAGTAAT
GCAAAACAAG ATAGCTTGAT CAGCCAGGTA TCTGATATCA GCCCTATCGG TTCAGATAAA
ATCAAAAATA AACTGGAAGA AATGTCCACC AAGATCTTCG ACAAAGTCGA AAATAAATTC
AACACCTTAG GAAAATCAAA GGACGATAGC GTCCAGACCA CCAGCTATAC CAAAGACGGT
AAAACGGTGA AGGTCAGCGA ATCCGATGAG AAGAAGGAAA CCAAAGATAA ATGGTGGCAG
AAAGGGGCTA AATCTGTCGG TAATAAAATC AAAAGCGCTG TACAGGATGA ACAAGTAGAA
GGTGGTAGTG GCAGCGTCAA AGTGAACGTA GAGGTTGTCG AGAGCCAAGG GGTTGAGGAG
CAATCTGCTA TCCGGGGTAC ACAGAATGTG GATCTGACCG TTAAGGGTAA AACTGAACTG
GTTGGCGGAA AGATATCCAG TAAGAATTCG GATGTTAACC TAAAAACTAA TGGGTTAGAT
ACGCAGGATA TCAACGGGAA ACATACTGAA GGAGGCGCAC GGTTGAACGC TTCTTCATCG
GTTATGGGTA TGATTAGCGA TGGCGCTAAG GATGTAATGG ATGGCAAGGC ACCACTGGTC
AGCGGGCATG GCAAATCTGA ACAGAAAAAT GCAACGGGTG GTGTCACCAG AGAATAA
 
Protein sequence
MKKHTFKLSP AGKLAAAVTI ISVSVATCYA AGIVGAGDSA HKPDVSSVNG TSVINIVQPS 
ASGLSHNQFQ DFNVGEKGAV LNNATSAGNS ILAGQLAANQ NLNGQAASII LNEVISRNPS
LLLGQQEIFG MTADYILANP NGITCNGCGF MNTNRESLVV GNPLIEQGSL KGFETFNNHN
WLKIQDKGLT ANKILDLIAP KIEITGAVTS TEAINALSGS NRVSLDGTIL LSRPIPFLGK
DHIDGKYLGS MQSGRINLVS TLAGSGVKIA GSLNASEAIN ATVKGKLQLE AAKLDGNDIN
IKANSIQAFG NLHKNEDNGD VTQSLERTQL KGKNIAIVAN KKNQLSAVKI DADNVTLRGG
ELVLDERILT NTEKESSSDG GSGLWGLGKW NWSENKEEKQ QTSIGTTITA KNNASLESTQ
DDIKLSAATI TAGKNLAIKA KKDLHIDGAI EENSIHDHGH NYKHMVKDES WNNKTTKQTL
NKTTLEAGKN LGLTAENKIT TQGIKASAGG DVVIDANDVK IGVQKTSNQE TTDGKHERNL
GLGGVDHNNN DKYAETSHSS EITADGNILI SVKDDVAITG SKVKATKDGF VQAKEGGIKI
DNAISTTTSK VDERTGVAFD ITGSSKKANN SEEKSTGSEV ISEANLKIIS KKDVDVIGSL
VKSAGELGIE TLGDINVAAA QEQEKIDEQK TQLTIDGFTS DDGKNQYQAG LKLEHTSESE
KTEKVTNHGS TLEGGTVKLE ADKDVTFTGS QLNTTGGDAD ITAENVSFVA AQDTTTSNKE
KETVGVNAHY TGGMDKAGSG AGVNYEETKT DSEKSTAVVS QTDIKGNLNI NAEQDITNQG
TDHKVDGSYN ADATNVHNLA AENTEETTTN STTVKVGVGA NVAYDGITRP VEQVIEKGKK
LDVGGVIENA GEVSPDSANV GIDLSAKVDL KESTLSSSQS VVTSIKSGDT TINASGDIED
QGTKYEADKG AINLHATNHT FEAAVDRVEK HEEEVTAGVD VRVYTNTGKD ITVDGKGKGA
NNKQDIKGEI SQVGSMVAAN GINITVKEDA TYTGTDLDAG DGKIAITAGK DIHFEQATNH
TSESHNNIEA NAKANFGTKA NSKEFGGGLG GGHSQGSAST DIAQVSHLQG KQGIELNAGN
DLTLQGTEFG TKDTSTGDVQ LNAGGKVDFQ AAQSQSSKQD MTWSASVKAG KNKSNSTSEN
DDGTKTHTNN KGFSAGAEAK VANTDESNTR HQGGAINSNG VVTIKAGSDD KQAIHLQNTN
ITSQETILDA DNGGIVMESA QDKEHKNNWN VNANLNGSQK NTIKSDDQGV VDKDSAKKIH
NAGIKLGGGV DKLDSVTQQN THINGDKVTL NSGKNTELAG AVIQANQING QIGGDLNVIS
RENRLNKVNV SAALGASHSN AKQDSLISQV SDISPIGSDK IKNKLEEMST KIFDKVENKF
NTLGKSKDDS VQTTSYTKDG KTVKVSESDE KKETKDKWWQ KGAKSVGNKI KSAVQDEQVE
GGSGSVKVNV EVVESQGVEE QSAIRGTQNV DLTVKGKTEL VGGKISSKNS DVNLKTNGLD
TQDINGKHTE GGARLNASSS VMGMISDGAK DVMDGKAPLV SGHGKSEQKN ATGGVTRE