Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCE_0607 |
Symbol | |
ID | 2751406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus ATCC 10987 |
Kingdom | Bacteria |
Replicon accession | NC_003909 |
Strand | + |
Start bp | 624479 |
End bp | 627745 |
Gene Length | 3267 bp |
Protein Length | 1088 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637277414 |
Product | internalin, putative |
Protein accession | NP_976934 |
Protein GI | 42779687 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein [COG5386] Cell surface protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAATA ATAAAAGAAA ACATATAAAT GCAATGTTAA TAGCGGCGAC GTTGTCGTTG CCATTTGCTA CATATTCTAC GCCGGCGTTA GCAGCTGTAG TAAGTGAAGT AAATAAAGCT GGGCATATTT TAAAGGATGG AACGTATGAT GTTGTTTTAA AAGCATATAA TGAGAAAACA AATGAAGAAT CTCGAGCTAC AACGTATATA AAAGAACCGA AAGTAACAAT TGAGAACGGT AAAAAAATTG TAACAGCTAC GCTAAATGAT AGTGATTTTT TCCAATACCT TAAGGTAGAA GATAGTCAAA ATCCAGGTAC TTTACATGAT GTAAAAGTAC TTTCAGAAGA TAAGAGAAAA AATGGGACAA AAGTGATTCA GTTTGAAATT GGAGAACTAG GAAAAAGATA TAAAATGCAA ATGCATATTT TCATTCCATC TATGGGATAT GATGAAAAGT ATCAAGTGCA GTTTGAAGTA AATACAGTTC ACTCAGAAAA TAATACTATA GAAGAATCAA AAGAGAAAAA AGAAGAGCAA CAACAAGTAA AAAACATAAT ATCTGATAAT AAATTACAAC AATATATTAA TAAAAGTGTT TTACAACGAG CAGATATAAA TGCACCTATT ACTGAGGAAG ATGCAGCACA AATTAAAGAG TTAAAGGTGT ACTTAGGAAA GGGTATTGAG AGTTTAGAAG GTCTGCAGTA TATGGAAAAT CTAGAAGCAT TTGAATTACA TGAGTCTAAT GTCAAAGATA TATCACCAAT ATCAAGTCTA AAAAAATTAA AAACAATGAA GTTATATTTG AATCCAATCG AAAATATTGC ACCTATTTCT CAACTGGAAA AACTACAATT CTTAACTTTA CGTGATAACA AAATTAGTGA TTTAACACCA TTAAGCCAGT TGAAAAAAGT AAAGGTGTTA GATTTAATTG GAAATGAAAT CACGGATATT AAGCCGCTAT TTTCAATGGA CTCTGTAACT AAATTATATT TAAGTAATAA TAAAATTAGT GATCTGACAG ATATTGAGAA ATTAGATGAT TTACGCTTGT TATGGATAGG AAATAATTAT ATTGATAATC TGACAGAAAT TGGTAAATTG AAGAATCTTG TTGAACTAGA AGTTGCAAAT GCTGAAATCA GAGATTTAAC ACCATTAGCA AAAATGCAAC AGTTACAATC ACTTGATTTA GAGCAAAATT ATATTTCTGA TATTTCGCCA ATTAGTAAAT TAAATAATTT ATATGCTTTG AATTTAATAG CAAATGAAAT TCGTGACATT AGACCAGTGA AAGAATTAGG GAAAAGGGTT CCTATTAAAC TTCAGCGACA AAAAATCTTT TTAAGTGATG GAGTAGTAAA TGAAGATATA AAAATTCCTA TATACGATTC AAATGGTGAA ATAGTTCAAA ATATTAAATG GCAGGGCGAG GAAGGGACGC TTAATAACGG ATCTGTTAAA TGGAATAGTA CAGGAGAAAA AGTATATGAA TTTAAATTAG AAACAGATTC TGCTGAAAGT CAAATACTAT TTAATGGAAC AGTATACCAA AACATCGTTG AAAAACATGA AGATATAAAT ATTATTCAAG ATAAAAATTT ACAAAAATTC ATTAATAAAA ATGGTTTAGG AAGAGCGAAC TTAGAATCAT CTATAACAAA AGAAGATTTA TTACAAATTA AATCATTAAA AATAGTTGAT GGGAAAAATC AAGGCATTAC TGATATTTCT GGTCTAGAAT ATATGACAAA CATAGAAGAG TTGGTTCTAG ATAATATTGA GCTGAAAAAT GTAGATTTTA TCTCGAATTT GAGAAGCTTG AAGGCTGTGA AATTAACTTC GAATCAACTT GAAAATATTG AACCACTTTC GAAATTAGAT AAGCTTGAAA AAATAGATAT AAGTGACAAT AATGTGAAGA ACATTAGACC ATTGTTCACA TTAAATGCAA TGAAAAATTT AAATGTATCT AATAATAAGC TTAATGATGC ATCACTTCAA GAAATTCAAC AGTTGAAGAA TTTAGAAGTA TTAAAGTTAA ATCATAATGA AATTAGTAAT GTGGAAGCTA TTAGTGAAAT AAGTATGTTG AACGAACTTG AATTGGTAGG AAATAAAGTA GTAGATATAA CGCCATTAAG TAAATTGAAA AATTTACAGT GGTTAGATTT ATCCGATAAT AAAATTCAAG ATATTTCTAT TTTTGCTTCG ATGCTAGATT TAATAAGCTT AAAGTTACCT GGTAATGAGA TTCGTGACAT TAGACCGATT ATACAATTAT CTCAGTGGAG TACAATAGAT ATTAGAAGGC AGAAAATTAC TTTAGATGAT GTGCAAATGA ATCAAGCTGT GAAGATCCCT GTTCATGATG TAGAAGGAGT ACCGCTTGAG GACATTACAC TGAAAAGTGA AGGCGGAATT ATTAATGAAG AAGGTACAAT CACTTGGAGT ACGCCAGGGG AGAAAGTATA TGAGTTTACG TTTGATGGAA ATAATTATTT TGGATTAGGT ATTTGGTTTA GTGGAGAAGT AATACAAAAC GTTGTAAATA AAACTGAATC AAAAGAAGAA ACACCTAAAC CAGTAGTAGA AGAAAAGCCA AAAGAGGAAA CGACTAAACC AGTGGTAGAA GAAAAACCAA AAGAGGAAAC ATCTAAACCA GTAGTAGAAG AAAAACCAAA AGAGGAAACA TCTAAACCAG TAGTAGAAGA AAAACCAAAA GAGGAAACGA CTAAACCAGT GATGGAAGAG AAACCAAAAG AAGAAACGAC TAAACCAGTG GTAGAAGAAA AACCAAAAGA GGAAACGACT AAACCAGTGG TAGAAGAAAA ACCAAAAGAG GAAACATCTA AACCAGTGGT AGAAGAAAAA CCAAAAGAGG AAACGACTAA ACCAGCGGTG GAAGAGAAAC CAAAAGAAGA AACAAGTAAA CCAGTGGTAG AAGAAAAACC AAAAGAGGAA ACGACTAAAC CAGCGGTGGA AGAGAAGTCG AAAGAAGAAA CACCTAAACT AGTTATGGAA GAGAAATCAA AAGAAGGAAC AAGTAAACCA GTAGTAGAAG AGCGACGAAA AGAAGGCAAC AAGCTAGCAA AGGAAAATGA ATCCAATAAA CAAGTGGATA ATAAAAAAGA AGAGAGTAAA AACACTTTAG CTGCAACGGG TGGGCAAGAG AGCAATGTAT CTTTACTTTC TGGACTAGCA TTCGTTTTAT CTGCACTTAG TATGTTTGTA TTTAGAAAAA AATTATTTAA GAAGTAA
|
Protein sequence | MKNNKRKHIN AMLIAATLSL PFATYSTPAL AAVVSEVNKA GHILKDGTYD VVLKAYNEKT NEESRATTYI KEPKVTIENG KKIVTATLND SDFFQYLKVE DSQNPGTLHD VKVLSEDKRK NGTKVIQFEI GELGKRYKMQ MHIFIPSMGY DEKYQVQFEV NTVHSENNTI EESKEKKEEQ QQVKNIISDN KLQQYINKSV LQRADINAPI TEEDAAQIKE LKVYLGKGIE SLEGLQYMEN LEAFELHESN VKDISPISSL KKLKTMKLYL NPIENIAPIS QLEKLQFLTL RDNKISDLTP LSQLKKVKVL DLIGNEITDI KPLFSMDSVT KLYLSNNKIS DLTDIEKLDD LRLLWIGNNY IDNLTEIGKL KNLVELEVAN AEIRDLTPLA KMQQLQSLDL EQNYISDISP ISKLNNLYAL NLIANEIRDI RPVKELGKRV PIKLQRQKIF LSDGVVNEDI KIPIYDSNGE IVQNIKWQGE EGTLNNGSVK WNSTGEKVYE FKLETDSAES QILFNGTVYQ NIVEKHEDIN IIQDKNLQKF INKNGLGRAN LESSITKEDL LQIKSLKIVD GKNQGITDIS GLEYMTNIEE LVLDNIELKN VDFISNLRSL KAVKLTSNQL ENIEPLSKLD KLEKIDISDN NVKNIRPLFT LNAMKNLNVS NNKLNDASLQ EIQQLKNLEV LKLNHNEISN VEAISEISML NELELVGNKV VDITPLSKLK NLQWLDLSDN KIQDISIFAS MLDLISLKLP GNEIRDIRPI IQLSQWSTID IRRQKITLDD VQMNQAVKIP VHDVEGVPLE DITLKSEGGI INEEGTITWS TPGEKVYEFT FDGNNYFGLG IWFSGEVIQN VVNKTESKEE TPKPVVEEKP KEETTKPVVE EKPKEETSKP VVEEKPKEET SKPVVEEKPK EETTKPVMEE KPKEETTKPV VEEKPKEETT KPVVEEKPKE ETSKPVVEEK PKEETTKPAV EEKPKEETSK PVVEEKPKEE TTKPAVEEKS KEETPKLVME EKSKEGTSKP VVEERRKEGN KLAKENESNK QVDNKKEESK NTLAATGGQE SNVSLLSGLA FVLSALSMFV FRKKLFKK
|
| |