Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK0459 |
Symbol | |
ID | 3027175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 534407 |
End bp | 537745 |
Gene Length | 3339 bp |
Protein Length | 1112 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 637544676 |
Product | internalin protein |
Protein accession | YP_082066 |
Protein GI | 52144762 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein [COG5386] Cell surface protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.695574 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAGTGG CTGCGACATT ATCGTTGCCG TTTGCGGTTT ATGCTACACC TATCTTAGCT GCTACTGCTG CTACAGAGAA TATGGCTGTA CAAAGTCCAA AAAAGCATGT TTTTGATGCG GTAATAAAGG CTTATAAAGA TAACTCAGAT GAAGAGTCAT ATGCAACTGT ATATATAAAA GATCCAAAGC TGACGATTGA AAATGGGAAA AGAATAATTA CAGCAACATT AAAAGATAGT GATTTCTTTG ACTATCTGAA AGTCGAAGAT AGTAAAGAGC CAGGTGTCTT CCATGATGTA AAGGTGCTTT CAGAAGATAA AAGAAAACAT GGAACGAAAG TTATACAATT TGAAGTAGGT GAGTTAGGAA AAAGATATAA TATGCAAATG CATATTTTAA TTCCCACTTT AGGGTACGAT AAGGAATTCA AAATTCAGTT TGAAGTAAAT ATGCGCACAT TTGTAGAAAG CGATATAGAA GAGGATGAAG AAGAACAAAT TGAAGATACA CAAAATATCA TACGTGATAA ACGATTACAA CAAGCAATTA ATAAAAATGT ATTAAATAGA AAAGATGTAA ATGAACCTAT ATTTGAAGAA GATTTAAAAG AAATTAAAGA GCTAAATATA TATGCAGGTC AAGGAATTGA GAGTCTAAAA GGTTTAGAGT ATATGGAAAA TCTAGAAAGA ATAACAATAC AAGGATCTGA TGTACGAAAT ATAGCTCCTA TTTCACAACT AAAACGTTTA AAAGTAGTTG ATCTATCTTT TAATAAAATA GAAAATGTTG AGCCGCTTGT AAACTTAGAA AAACTGGATA TACTAGAGCT ACAAAATAAT AGAATTGCTG ACGTAACGCC ACTAAGTCAA CTTAAAAAGG TTAGGACAAT TAATTTATCA GGTAATAAAA TTAGTGATAT AAAGCCTTTA TATAATGTTT CTTCTTTAAG AAAGTTATAT GTAAGCAATA ATAAAATTAC TGATTTTACA GGCATTGAGC AATTGAATAA ATTAGGGACA TTAGGGGTAG GAAGTAACGG GCTTGTAAAT ATTGAACCGA TTAGTCAGAT GAGTGGCATT GTTGAACTTA ATCTTGAAAA AAATGATATT AAAGATATTA CATCATTATC TAAACTAACT GGCTTACAAT CACTTAACTT GGAAGAAAAC TATGTTTCGG ATGTATCATC ACTTAGTAAT TTGATTAATT TATATGAATT AAAACTTGCG ACAAATGAGA TTCGTGATAT AAGACCTATT CAAGAATTAG GAAAACGAAT TAAGATTGAT GCTCAAAGGC AAAAGGTCTT TTTAGATGAA GCCTATATGA ATGAAGAAGT GAAAATTCCT GTATATGATG TAAATGGGAC AGCACTTCAA AATATTGAGT GGAAGAGTGA AGGCGGAAGT ATTACGAACG GAGTAATAAA GTGGAATAGC CTTGGGGAAA AAATGTATGA ATTTAAGATG GATGCTGGCG AAAGTAAGAT AAGGTTCCAA GGGAGGGTAA TACAAAATAT TGTTGAAAAA CGAGAAGAGA GTTCGAACGT AATTCAAGAT ATGAAACTAA GACAATACAT GAATAAACAT AATTTTGAAC GGAAAAATGT AAATACCCCT ATAACGAAAG AAGATTTATT AACAGTTAAG GCTTTGAAAA TTACGGATGG GAAAAAAGAG GGGATAACAG ATTTTTCTGG ATTAGAATTC ATGACAAATA TGGAAGAATT GATATTACAA AATGCTAATA TGAAAAATGT GAAATTTATC TCAAGTTTGA GAAATTTGAA GTCAGTAGAT TTATCCTATA ATCAAATTGA AGATATTAAA CCGCTTCATT CATTAGAGAA TCTTGAAAAA TTAAATATTA GCAATAACGG TATAAAAAAT GTTCCAGAAC TATTTAAGAT GCAGACATTA AAAACTCTAG ACCTATCAAA TAATAAACTT GATAATGCTG CTTTGGATGG AATTTATCAA TTGGAAAATC TAGATGCATT GTTAGTAAAT AATAATGAAA TCAATAATTT AGATGAGATT GGCAAAGTTA GCAAATTGAA TAAGCTAGAA ATGATGGGCA ATAAAGTACG AGATATTTCT CCATTAGCTA ACTTGAAAAA CTTACAGTGG TTAAATTTAG CCAATAATAA GATTCAAGAT ATCTCTAGTT TATCCTCTAT ACTTGATTTA CTTAGTTTGA AATTAGCTGG AAACGAGATT CGTGATGTAA GACCAATTAT TCAATTGGCT CAATGGATAA CAGTTGATAT TAAAAACCAA AAAATTGTTT TAGAAGATGG ACAAATGAAT CAAGAAATCC AAATTCCTAT CTATGATTTA GAGGGAGAAA TCTTTGAAGA TATTGAACTG AAGAGTGAAG CCGGTATCGT TACCGATAGA GGAACAGTCG TATGGAAAAC TCCAGGAGAA AAAAATTATG TATTCTCCTT AAATGGTAAT TATCACGGTC TATCTCTATT ATTCAGTGGT ACAGTTATGC AAAATATAGT AGCGAAAGAA GAACCAAAAG AACCAGTGGA AGAAGTTGAA GGTTCGAAAG AAGAACCAAT AAAAGAAGCT GAAGGATCAA AAGAAGAGCC AAAAGGGCCA GCAAAAGAAG TTGAAGGTCC GAAAGAAGAA GTGAAAGAAC CGGCAAAAGA AGTTGAAGGT CCGAAAGAAG AAGTGAAAGA ACCGGCAAAA GAAGTTGAAG GCCTGAAAGA AGAAGTAAAA GAACCGGCAA AAGAAGTTGA AGGCCCGAAA GAAGAAGTAA AAGAACCGGC AAAAGAAGTT GAAGGCCCGA AAGAAGAAGT AAAAGAACCG GCAAAAGAAG TTGAAGGCCC GAAAGAAGAA GTAAAAGAAC CAGCAAAAGA AGTTGAAGGT CCGAAAGAAG AAGTGAGAGA ACCAACAAAA GAAGTTGAAG GTCCGAAAGA AGAAGTGAAA GAACCAATGA AAGAAGTTGA AGGATCGAAA GAAGAAGTGA AAGAACCAAC GAAAGAAGCT GAAGGATCGA AAGAAGAAGT GAAAGAGCCA ACAACAGAAG TTGAAGGATC GAAAGAAGTA AAAGAACCAG GAAAAGAAGT TGAAGGTTCA AAAGATGCAA TAAATCAATC AGCAGTAGCT CAAGAAACAA ACGTGAACAA TCAAGTTGGG AAAGAAAAAG TAGTAGAGAA TCAAAACATG AAAGAAAATA AACCAGCTGT TACTAAGCAA GAAGAAAGTA AGAAATCACT AGGAGCAACA GGTGGACAAG AGAATACATC AACATTACTT TCAGGCTTAG CACTAGTTCT TTCAGCATTG AGTATGTTTG TATTTAGAAA GAGATTATTT AAGAAATAA
|
Protein sequence | MLVAATLSLP FAVYATPILA ATAATENMAV QSPKKHVFDA VIKAYKDNSD EESYATVYIK DPKLTIENGK RIITATLKDS DFFDYLKVED SKEPGVFHDV KVLSEDKRKH GTKVIQFEVG ELGKRYNMQM HILIPTLGYD KEFKIQFEVN MRTFVESDIE EDEEEQIEDT QNIIRDKRLQ QAINKNVLNR KDVNEPIFEE DLKEIKELNI YAGQGIESLK GLEYMENLER ITIQGSDVRN IAPISQLKRL KVVDLSFNKI ENVEPLVNLE KLDILELQNN RIADVTPLSQ LKKVRTINLS GNKISDIKPL YNVSSLRKLY VSNNKITDFT GIEQLNKLGT LGVGSNGLVN IEPISQMSGI VELNLEKNDI KDITSLSKLT GLQSLNLEEN YVSDVSSLSN LINLYELKLA TNEIRDIRPI QELGKRIKID AQRQKVFLDE AYMNEEVKIP VYDVNGTALQ NIEWKSEGGS ITNGVIKWNS LGEKMYEFKM DAGESKIRFQ GRVIQNIVEK REESSNVIQD MKLRQYMNKH NFERKNVNTP ITKEDLLTVK ALKITDGKKE GITDFSGLEF MTNMEELILQ NANMKNVKFI SSLRNLKSVD LSYNQIEDIK PLHSLENLEK LNISNNGIKN VPELFKMQTL KTLDLSNNKL DNAALDGIYQ LENLDALLVN NNEINNLDEI GKVSKLNKLE MMGNKVRDIS PLANLKNLQW LNLANNKIQD ISSLSSILDL LSLKLAGNEI RDVRPIIQLA QWITVDIKNQ KIVLEDGQMN QEIQIPIYDL EGEIFEDIEL KSEAGIVTDR GTVVWKTPGE KNYVFSLNGN YHGLSLLFSG TVMQNIVAKE EPKEPVEEVE GSKEEPIKEA EGSKEEPKGP AKEVEGPKEE VKEPAKEVEG PKEEVKEPAK EVEGLKEEVK EPAKEVEGPK EEVKEPAKEV EGPKEEVKEP AKEVEGPKEE VKEPAKEVEG PKEEVREPTK EVEGPKEEVK EPMKEVEGSK EEVKEPTKEA EGSKEEVKEP TTEVEGSKEV KEPGKEVEGS KDAINQSAVA QETNVNNQVG KEKVVENQNM KENKPAVTKQ EESKKSLGAT GGQENTSTLL SGLALVLSAL SMFVFRKRLF KK
|
| |