Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0299 |
Symbol | shlA |
ID | 5387666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | - |
Start bp | 341552 |
End bp | 346408 |
Gene Length | 4857 bp |
Protein Length | 1618 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640863269 |
Product | hemolysin |
Protein accession | YP_001399293 |
Protein GI | 153949445 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.420803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAC ACACCTTTAA GCTCTCACCG GCAGGGAAAT TGGCGGCGGC GGTAACCATT ATTTCGGTTT CTGTTGCCAC CTGTTATGCC GCCGGGATTG TCGGAGCGGG CGATTCTGCA CATAAACCTG ATGTTAGCTC GGTAAATGGC ACATCAGTAA TTAATATCGT GCAACCGTCA GCCTCCGGCT TATCACATAA CCAATTTCAG GATTTTAATG TTGGCGAAAA GGGGGCGGTA CTGAATAACG CCACCAGTGC AGGTAATTCA ATACTTGCCG GGCAGTTAGC CGCTAACCAA AATTTAAATG GTCAAGCGGC TAGCATTATT CTTAATGAAG TCATCAGCCG TAATCCTTCT CTATTATTGG GCCAACAAGA AATTTTTGGT ATGACGGCGG ATTATATTCT GGCCAACCCA AATGGCATTA CCTGTAATGG CTGTGGTTTT ATGAATACCA ACCGCGAGTC ATTGGTCGTC GGTAATCCCT TGATTGAACA AGGATCGCTG AAGGGTTTTG AAACTTTCAA TAACCATAAT TGGTTGAAAA TTCAGGATAA GGGGCTGACC GCCAATAAAA TATTAGATCT CATTGCACCC AAGATTGAAA TCACAGGCGC AGTCACCTCA ACTGAAGCCA TCAACGCCTT ATCAGGCAGT AACCGGGTTT CCCTTGATGG CACAATATTG CTATCCCGCC CAATACCATT TTTAGGTAAA GACCATATTG ATGGTAAATA TCTTGGCAGT ATGCAATCGG GCCGGATTAA TTTAGTCAGT ACCCTTGCAG GCAGTGGGGT AAAAATTGCT GGTAGTCTAA ACGCTAGCGA AGCAATAAAC GCCACGGTTA AAGGAAAGCT GCAATTAGAA GCCGCCAAAT TAGATGGCAA TGACATTAAC ATCAAAGCCA ATAGCATCCA GGCATTCGGT AATCTTCATA AAAATGAAGA CAATGGGGAT GTTACGCAGA GTCTGGAACG CACTCAGTTG AAAGGCAAAA ATATTGCTAT TGTCGCGAAT AAAAAGAATC AGCTTTCCGC AGTAAAAATC GACGCTGACA ATGTGACGCT GAGAGGAGGT GAATTGGTAT TGGATGAGAG AATCCTCACC AATACAGAAA AGGAATCTTC ATCAGACGGT GGAAGTGGGC TGTGGGGCCT TGGTAAGTGG AATTGGTCTG AAAATAAAGA GGAGAAACAA CAAACCAGTA TTGGCACCAC CATTACAGCG AAAAATAATG CCTCTCTTGA ATCAACTCAA GATGATATTA AACTGTCGGC TGCCACAATA ACCGCGGGTA AAAACCTTGC CATCAAAGCC AAAAAAGATT TACACATAGA CGGTGCTATT GAAGAGAATT CAATTCACGA CCACGGCCAT AATTATAAAC ATATGGTCAA AGATGAATCT TGGAATAATA AAACCACCAA ACAAACACTC AATAAAACCA CACTTGAGGC CGGTAAGAAT TTGGGCCTCA CCGCAGAAAA TAAAATCACG ACTCAGGGAA TCAAAGCTTC TGCGGGTGGT GACGTCGTAA TTGATGCCAA CGACGTTAAA ATTGGGGTAC AAAAAACCAG TAATCAGGAG ACAACCGACG GTAAACATGA AAGAAACTTG GGACTCGGTG GTGTCGATCA CAATAATAAC GATAAGTATG CGGAAACCAG CCATAGCTCA GAAATAACCG CTGACGGTAA TATACTCATC AGCGTGAAAG ATGACGTCGC CATTACTGGC AGTAAAGTAA AAGCAACTAA AGATGGTTTT GTTCAAGCTA AAGAGGGTGG GATCAAAATC GATAACGCCA TCAGTACCAC AACCAGTAAA GTTGATGAGC GTACGGGGGT AGCATTTGAT ATTACCGGCA GTTCTAAAAA AGCCAATAAC AGTGAAGAAA AATCTACCGG TAGTGAAGTT ATCTCTGAAG CAAACCTGAA AATTATCAGT AAAAAGGATG TCGATGTAAT TGGCAGCCTG GTAAAAAGCG CCGGTGAATT AGGCATTGAG ACCTTAGGCG ATATCAATGT CGCAGCAGCT CAAGAGCAAG AAAAAATCGA TGAACAAAAA ACCCAGCTGA CGATTGATGG CTTTACCAGT GATGACGGTA AAAACCAGTA TCAGGCGGGG CTAAAACTGG AACACACCAG CGAGAGTGAA AAAACGGAGA AGGTGACAAA TCACGGCTCT ACCCTTGAGG GCGGCACCGT CAAACTGGAG GCAGATAAAG ATGTGACATT CACTGGGTCA CAACTCAATA CCACCGGGGG AGATGCGGAT ATCACTGCTG AAAACGTCTC TTTTGTCGCT GCACAAGACA CCACCACCAG CAATAAAGAA AAAGAGACCG TGGGTGTGAA TGCTCACTAT ACCGGTGGGA TGGATAAAGC GGGTAGCGGT GCGGGGGTTA ATTATGAAGA GACGAAAACG GACAGCGAGA AATCAACTGC GGTGGTTTCC CAGACCGATA TTAAAGGTAA TCTGAATATT AACGCCGAGC AGGATATTAC CAACCAAGGT ACTGATCATA AAGTTGATGG CTCTTATAAC GCAGATGCCA CTAACGTTCA CAATCTGGCC GCTGAAAATA CCGAAGAAAC CACGACTAAC AGCACAACGG TTAAGGTTGG CGTCGGTGCC AATGTGGCTT ATGACGGTAT CACCCGCCCT GTTGAACAAG TGATTGAAAA AGGTAAAAAG CTGGATGTCG GGGGCGTTAT TGAAAATGCA GGAGAGGTCT CGCCTGACTC AGCCAATGTC GGAATAGACC TCTCTGCTAA AGTCGATTTA AAGGAGAGCA CACTCAGTAG CTCTCAATCC GTAGTAACGT CAATAAAAAG CGGCGATACC ACGATCAATG CGTCTGGTGA CATCGAAGAT CAAGGAACGA AATACGAGGC CGATAAAGGC GCGATAAATC TACATGCCAC CAACCACACT TTTGAAGCTG CCGTTGACCG AGTTGAAAAG CATGAGGAAG AAGTCACCGC TGGCGTAGAT GTGCGAGTTT ATACCAACAC CGGTAAGGAT ATTACGGTTG ATGGTAAAGG TAAGGGAGCC AATAACAAGC AGGATATCAA AGGAGAAATC TCCCAGGTTG GCAGCATGGT CGCAGCAAAT GGCATCAATA TCACGGTAAA AGAAGATGCA ACTTATACAG GGACTGATCT CGATGCTGGA GACGGAAAGA TTGCAATTAC GGCGGGCAAA GACATTCACT TTGAACAGGC AACCAACCAC ACCAGTGAAA GCCATAATAA TATTGAGGCC AATGCCAAAG CAAACTTCGG CACTAAGGCA AACAGTAAAG AGTTTGGCGG CGGCTTGGGT GGCGGTCATA GTCAAGGCAG CGCCAGTACC GATATTGCAC AAGTCAGCCA TCTACAAGGT AAACAAGGCA TCGAACTGAA TGCTGGAAAT GATTTAACAC TGCAAGGTAC CGAGTTTGGC ACTAAAGACA CGTCTACCGG TGATGTTCAA CTCAACGCGG GTGGAAAAGT AGACTTCCAA GCGGCACAGT CACAAAGTTC TAAACAAGAT ATGACCTGGT CTGCCAGCGT CAAGGCAGGA AAAAACAAAA GTAACAGCAC CAGTGAAAAT GATGATGGCA CAAAAACGCA TACCAACAAC AAAGGGTTTA GCGCAGGAGC AGAAGCCAAA GTTGCCAATA CGGACGAATC GAACACTCGC CATCAGGGCG GTGCAATCAA CAGTAATGGC GTTGTAACCA TCAAAGCGGG TAGCGATGAT AAGCAAGCCA TCCATCTGCA AAATACCAAC ATCACCAGTC AAGAGACCAT TCTGGACGCG GATAATGGCG GGATCGTAAT GGAGTCGGCT CAGGATAAAG AGCACAAAAA TAACTGGAAT GTTAACGCCA ACCTTAATGG CAGCCAGAAA AACACCATTA AAAGCGATGA TCAAGGTGTT GTAGATAAAG ATAGCGCCAA GAAAATTCAT AATGCGGGCA TCAAACTCGG TGGCGGTGTT GATAAACTGG ACTCAGTAAC GCAACAAAAT ACCCATATCA ACGGTGATAA GGTCACCCTG AACAGCGGTA AAAATACCGA ATTGGCGGGT GCGGTGATCC AGGCTAACCA GATTAATGGG CAAATTGGTG GGGATCTAAA CGTCATTAGT CGCGAAAATA GACTCAATAA AGTCAATGTT TCTGCGGCAC TTGGTGCCAG CCACAGTAAT GCAAAACAAG ATAGCTTGAT CAGCCAGGTA TCTGATATCA GCCCTATCGG TTCAGATAAA ATCAAAAATA AACTGGAAGA AATGTCCACC AAGATCTTCG ACAAAGTCGA AAATAAATTC AACACCTTAG GAAAATCAAA GGACGATAGC GTCCAGACCA CCAGCTATAC CAAAGACGGT AAAACGGTGA AGGTCAGCGA ATCCGATGAG AAGAAGGAAA CCAAAGATAA ATGGTGGCAG AAAGGGGCTA AATCTGTCGG TAATAAAATC AAAAGCGCTG TACAGGATGA ACAAGTAGAA GGTGGTAGTG GCAGCGTCAA AGTGAACGTA GAGGTTGTCG AGAGCCAAGG GGTTGAGGAG CAATCTGCTA TCCGGGGTAC ACAGAATGTG GATCTGACCG TTAAGGGTAA AACTGAACTG GTTGGCGGAA AGATATCCAG TAAGAATTCG GATGTTAACC TAAAAACTAA TGGGTTAGAT ACGCAGGATA TCAACGGGAA ACATACTGAA GGAGGCGCAC GGTTGAACGC TTCTTCATCG GTTATGGGTA TGATTAGCGA TGGCGCTAAG GATGTAATGG ATGGCAAGGC ACCACTGGTC AGCGGGCATG GCAAATCTGA ACAGAAAAAT GCAACGGGTG GTGTCACCAG AGAATAA
|
Protein sequence | MKKHTFKLSP AGKLAAAVTI ISVSVATCYA AGIVGAGDSA HKPDVSSVNG TSVINIVQPS ASGLSHNQFQ DFNVGEKGAV LNNATSAGNS ILAGQLAANQ NLNGQAASII LNEVISRNPS LLLGQQEIFG MTADYILANP NGITCNGCGF MNTNRESLVV GNPLIEQGSL KGFETFNNHN WLKIQDKGLT ANKILDLIAP KIEITGAVTS TEAINALSGS NRVSLDGTIL LSRPIPFLGK DHIDGKYLGS MQSGRINLVS TLAGSGVKIA GSLNASEAIN ATVKGKLQLE AAKLDGNDIN IKANSIQAFG NLHKNEDNGD VTQSLERTQL KGKNIAIVAN KKNQLSAVKI DADNVTLRGG ELVLDERILT NTEKESSSDG GSGLWGLGKW NWSENKEEKQ QTSIGTTITA KNNASLESTQ DDIKLSAATI TAGKNLAIKA KKDLHIDGAI EENSIHDHGH NYKHMVKDES WNNKTTKQTL NKTTLEAGKN LGLTAENKIT TQGIKASAGG DVVIDANDVK IGVQKTSNQE TTDGKHERNL GLGGVDHNNN DKYAETSHSS EITADGNILI SVKDDVAITG SKVKATKDGF VQAKEGGIKI DNAISTTTSK VDERTGVAFD ITGSSKKANN SEEKSTGSEV ISEANLKIIS KKDVDVIGSL VKSAGELGIE TLGDINVAAA QEQEKIDEQK TQLTIDGFTS DDGKNQYQAG LKLEHTSESE KTEKVTNHGS TLEGGTVKLE ADKDVTFTGS QLNTTGGDAD ITAENVSFVA AQDTTTSNKE KETVGVNAHY TGGMDKAGSG AGVNYEETKT DSEKSTAVVS QTDIKGNLNI NAEQDITNQG TDHKVDGSYN ADATNVHNLA AENTEETTTN STTVKVGVGA NVAYDGITRP VEQVIEKGKK LDVGGVIENA GEVSPDSANV GIDLSAKVDL KESTLSSSQS VVTSIKSGDT TINASGDIED QGTKYEADKG AINLHATNHT FEAAVDRVEK HEEEVTAGVD VRVYTNTGKD ITVDGKGKGA NNKQDIKGEI SQVGSMVAAN GINITVKEDA TYTGTDLDAG DGKIAITAGK DIHFEQATNH TSESHNNIEA NAKANFGTKA NSKEFGGGLG GGHSQGSAST DIAQVSHLQG KQGIELNAGN DLTLQGTEFG TKDTSTGDVQ LNAGGKVDFQ AAQSQSSKQD MTWSASVKAG KNKSNSTSEN DDGTKTHTNN KGFSAGAEAK VANTDESNTR HQGGAINSNG VVTIKAGSDD KQAIHLQNTN ITSQETILDA DNGGIVMESA QDKEHKNNWN VNANLNGSQK NTIKSDDQGV VDKDSAKKIH NAGIKLGGGV DKLDSVTQQN THINGDKVTL NSGKNTELAG AVIQANQING QIGGDLNVIS RENRLNKVNV SAALGASHSN AKQDSLISQV SDISPIGSDK IKNKLEEMST KIFDKVENKF NTLGKSKDDS VQTTSYTKDG KTVKVSESDE KKETKDKWWQ KGAKSVGNKI KSAVQDEQVE GGSGSVKVNV EVVESQGVEE QSAIRGTQNV DLTVKGKTEL VGGKISSKNS DVNLKTNGLD TQDINGKHTE GGARLNASSS VMGMISDGAK DVMDGKAPLV SGHGKSEQKN ATGGVTRE
|
| |