Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH187_A0682 |
Symbol | |
ID | 7074929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH187 |
Kingdom | Bacteria |
Replicon accession | NC_011658 |
Strand | + |
Start bp | 624887 |
End bp | 627868 |
Gene Length | 2982 bp |
Protein Length | 993 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 643449175 |
Product | internalin protein |
Protein accession | YP_002336685 |
Protein GI | 217958141 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein [COG5386] Cell surface protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000101433 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAGAA ATAAAAGAAA ACATATAAAT GCAATGATTA TAGCGGCGAC GTTATCACTT CCGTTTGCTG TTTATTCAAC ACCTGCTTTA GCGGCAGTAG CAATTGAGGC GAATAAAACG GGACAAGGTT TAGAAGATGG TACATATGAC GCTGTTATTA AAGCGTATAA AGATAAAACA AATGAAGAGT CTATGGCAGC TGTTTATATA AAAAATCCGA AATTAACAGT TGAGAACGGA AAGAAAATTG TAACAGCAAC GTTAAGTGAT AGTGATTTCT TTCAATACTT GAAAACAGAG GATATACATA CTCCAGGTGT GTTTCATGAT GTGAAAGTAA TATCAGAAGA TAAAAAGAAA AATGGAACGA AAGTGATTCA GTTTGAAGTA GGGGAATTAG GGAAGAGATA TAATATGCAG ATGCATATTT ATATTCCGAC AATGGCTTAT GACAATAAGT ATCAAGTACA GTTTGAAGTA AACACATTAA ATTTAGAAAA TAATGTTCCA GAAGAACAAA AGGAAAATGA AGAGGATAAA TTGGATCAAC AAGATAAAAA CGGAAATGTA GTATTAGATA AGCAATTACA AAAGCATATT AATAAATATA ACTTGAATAG AGAAAATTTA GATACCCCAA TAACTAAGGA AGATTTATTA AAAGTTAAAT CTTTAATAGT CGTTGAAGCC AAAAGTAAAG GAATAAAAGA CGTAAGCGGT CTAGAATATA TGAAGAACTT AGAAAACTTA ACGTTGGAAG AAGTTAAGTT AGAAAATATA AAATTTATCT CGAATTTGAG GCAATTGAAA TCATTAAGTA TAACCTATGG CGAACTTGAA GATATTGGAC CTTTGGCTGA GTTAGAACAT ATTGAGAGTT TAAGCTTGAG AAATAATAAA ATTTCAGATT TAAGCCCACT AAGTCAAATG AAGAAGATTA AATTGCTAGA TTTAAATAGT AATTATATAA AAGATATTAA ACCATTATTT ACAGCGAAAT CTTTAAGGAC TTTGACTGTA GCAAATAATC AAATTAGTAA TGCTAATCTT GCTGGGATTG AGCAACTGAA GAATGTGAAG AGTTTATCTT TAAGTAACAA TGGACTTACT AATATTGAAC ATATTACACC AATGAAAAAA TTAGTAGAGT TAGACCTTTC TAAAAATGAA TTAGAAAACA TCGAACCTTT ATCAAGAATG TCTACTGTAC AATCACTTAA TTTAGAAGAA AACTATATTT CAGATATAAC ACCACTTAGT CAATTAACAG GTTTATATGA TTTAAAGCTA GCGTCAAATG AAATTCGTGA TGTTAGACCG GTTCAAGAGT TAGGAAAAAG AATGTACATT GACGTTCAAA GACAAAAAAT CTTTTTAGAT GATGTAGAAA AAGATAAGGA AGTTAAAATA CCTATCTATA ATTTACAAGG AGAGCCACTC GATACTATTC AATTAAAGAA TGGAGATGGA ATAGTTAATA ATGGTTCTGT TAAATGGAGT ACTACCGGTG AAAAAACATA CGAATTTATA TTAGATATAA AGCCAGAAGA AAATCGTATT AAGTTTAATG GAATAGTAAT TCAAAATGTT GTTGAAAGGT TAGATGAAAT AAAAGAGGAT AATGAACAAA AGGAAAATGT AATTCTCGAT AAAACTTTAC AACAACATAT TAATAAAGAG AATTTAGGTA GAGAGAATTT AAACACTCCC ATCACCAAAG AAGATTTATT ACAGATTAAA AAATTAGAGA TCCTTAAAGA AAAAGGAAAT GAGATAAAAG ATATAACAGG TTTAGAGTAC ATGACGAACT TAGAAAATCT CACTTTAGAA GGAGTAGGTC TGAAAAATAT TGAGTTCATC TCAAACTTGA AACAATTGAA TAATGTGAAT GTATCTCATA ATCAAATTGA AGATATAACA CCACTATCTT CATTGGAAAA TTTACAGTGG TTAAATCTTG AAGACAATCA TATTAAAGAT GTAACGGTTA TTGGTTCCAT GCTAAACCTA TTTAGCTTAA ATCTAGCTGG GAATGAGATT CGTGATGTAA GGCCGTTAAT ACAATTAGGC CAGTGGGGAA CAATTGATGT TAGAAGGCAA AAGGTCATTT TGGATGATGC AGAAATAAAT AAAGAAGTGA AGATACCTGT ATATGATTTA GAAGGGGAAC GAATTGAAAA GATTACGTTA AAGAGTGCAG GTGGAATGCT TACTGATGAG GGAATCATTT GGAGTACTCT AGGAGAAAAA ATATATGAAT TTGACTTGGA TGCAGATCAT TATGAGACGG GCATATTATA TAGTGGCATC GTCATGCAGA ATATAGTAGA AAAATTAATA CCGAAAGAAG AAGTAAAAGA GCCGGAAAAA GAAGTTGAAG AAACAAAAGA AGAAGTGAAA GAAACGATAA AAGAAGTTGA AGAAGAGCAA GAAGAAGTAA AAGAGCCGGA AAAAGAAGTT GAAGAAACAA AAGAAGAAGT GAAAGAAACG ATAAAAGAAG TTGAAGAAGA GCAAGAAGAA GTAAAAGAGC CGGAAAAAGA AGTTGAAGAA ACAAAAGAAG AAGTGAAAGA AACGATAAAA GAAGTTGAAG AAGAGCAAGA GGAAGTAAAA GAGTCAATAA AAGAAGTTGA AGAAGAGCAA GAGGAAGTAA AAGAGCCAAT AAAAGAAGTT GAAGAGGTAA AAGAAGAAGT GGACGAGCCA ACAACAGGAG TTGAAGAGGC GAAAGCTGAG ATAAAAGGAA CAGGAAAAGA AATTGAAGGT TCAAAAGACG CAGTAAATCA ATCCACAGTA GTCCAAGAAC AAAACGTGAA TAATCAAGTT GTGAAAGAAA ATAAACCAGT TGTTAATAAG CAAGAAGAAA GTAAGAAATC ATTAGGAGCA ACAGGTGGAC AAGAGAATAC ATCAACATTA CTTTCAGGCA TAGCGTTAGT TCTTTCAGCG ATGAGTATGT TTGTATTTAG AAAGAGATTA TTTAAGAAAT AA
|
Protein sequence | MKRNKRKHIN AMIIAATLSL PFAVYSTPAL AAVAIEANKT GQGLEDGTYD AVIKAYKDKT NEESMAAVYI KNPKLTVENG KKIVTATLSD SDFFQYLKTE DIHTPGVFHD VKVISEDKKK NGTKVIQFEV GELGKRYNMQ MHIYIPTMAY DNKYQVQFEV NTLNLENNVP EEQKENEEDK LDQQDKNGNV VLDKQLQKHI NKYNLNRENL DTPITKEDLL KVKSLIVVEA KSKGIKDVSG LEYMKNLENL TLEEVKLENI KFISNLRQLK SLSITYGELE DIGPLAELEH IESLSLRNNK ISDLSPLSQM KKIKLLDLNS NYIKDIKPLF TAKSLRTLTV ANNQISNANL AGIEQLKNVK SLSLSNNGLT NIEHITPMKK LVELDLSKNE LENIEPLSRM STVQSLNLEE NYISDITPLS QLTGLYDLKL ASNEIRDVRP VQELGKRMYI DVQRQKIFLD DVEKDKEVKI PIYNLQGEPL DTIQLKNGDG IVNNGSVKWS TTGEKTYEFI LDIKPEENRI KFNGIVIQNV VERLDEIKED NEQKENVILD KTLQQHINKE NLGRENLNTP ITKEDLLQIK KLEILKEKGN EIKDITGLEY MTNLENLTLE GVGLKNIEFI SNLKQLNNVN VSHNQIEDIT PLSSLENLQW LNLEDNHIKD VTVIGSMLNL FSLNLAGNEI RDVRPLIQLG QWGTIDVRRQ KVILDDAEIN KEVKIPVYDL EGERIEKITL KSAGGMLTDE GIIWSTLGEK IYEFDLDADH YETGILYSGI VMQNIVEKLI PKEEVKEPEK EVEETKEEVK ETIKEVEEEQ EEVKEPEKEV EETKEEVKET IKEVEEEQEE VKEPEKEVEE TKEEVKETIK EVEEEQEEVK ESIKEVEEEQ EEVKEPIKEV EEVKEEVDEP TTGVEEAKAE IKGTGKEIEG SKDAVNQSTV VQEQNVNNQV VKENKPVVNK QEESKKSLGA TGGQENTSTL LSGIALVLSA MSMFVFRKRL FKK
|
| |