Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCB4264_A0588 |
Symbol | |
ID | 7099558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus B4264 |
Kingdom | Bacteria |
Replicon accession | NC_011725 |
Strand | + |
Start bp | 561422 |
End bp | 564406 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 643468143 |
Product | internalin protein |
Protein accession | YP_002365348 |
Protein GI | 218235426 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein [COG5386] Cell surface protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.887348 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAACAAA ATAAAAGAAA ACGTATAAAT GCAATGATTA TAGCGGCGGC GTTATCACTT CCGTTTGCTG TGTATTCAAC ACCTGCTTTA GCGGCAGTAG CAATTGAGGC GAATAAAACT GGACAAGGTT TAGAAGATGG TACATATGAC GCTGTTATTA AAGCGTATAA AGATAAAACG AATGAAGAGT CTATGGCAGC TGTTTATATA AAGGATCCGA AATTAACAAT TGAGAATGGA AAGAAAATTG TAACAGCAAC ATTAAGTGAT AGTGATTTCT TTCAATACTT GAAAACAGAG GATATTCATA CTCCAGGTGT GTTTCATGAT GTGAAAGTAA TATCAGAAGA CAAAAAGAAA AATGGAACGA AAGTGATTCA GTTTGAAGTA GGGGAATTAG GAAAAAGGTA TAATATGCGA ATGCATATTT ATATTCCAAC AATGGCCTAT GACAATAAGT ACCAAGTACA GTTTGAAGTA AATACATTGA ATTTAGATAA AGATGTTCCA GAAGCACAAA AGGAAAATAA AGAGGATAAA GTGGATCAAC AAGATGCGAA TGTAATAGTA GATAAGCAAT TACAAAGGCA TATTAATAAA TATAACTTGA ATAGAGAGAA TCTAGATACT CCAATAACTA AGGAAGATTT ATTAAAAGTT AAATCCTTAA TAGTCGTTGA AGCTAAAAGT AAAGGAATAA AAGACGTAAG TGGTCTAGAA TATATGACGA ACTTAGAAAA CTTAACGTTG GAAGAAGTTA AGTTAAAAAA TATAAAATTT ATCTCGGACT TGAGACAATT GAAATCATTA AGTATAACCT ATGGTGAACT TGAAGATATT GGGCCTTTGG CTAAGTTAGA GCATATTGAG TTTTTAACTT TGAGAAACAA TAAAATCTCA GATTTAAGCC CATTAAGCCA AATGAAGAAG ATTAAAATGC TAGATTTAAA TAGTAATTAT ATAAAAGATA TTAAACCATT ATTTACAGTG AAATCTTTAA GGACTTTGAC TGTAGCAAAT AATCAAATTA GTAATGATAA CCTTGCTGGA ATTGAGCAAT TGAAAAATGT AAAAAACTTA TCTTTAAGTA ACAATGGACT TACGAATATT GAACATATTA CATCAATGAA AAAATTAGTA GAGTTAGATC TTGCTAAAAA TGAATTAGAA AACATCGAAC CTTTATCAAG ATTATCTACT GTACAATCAC TTAATTTAGA AGAAAACTAT ATTTCAGATA TAACGCCACT TAGTCACTTA ACAGACTTAT ATGATTTAAA GCTAGGTTCA AATGAAATTT GTGATGTTAG ACCTGTTCAA GAGCTAGGAA AAAGAATATA TATCGATATT CAAAGACAAA AAATCTTTTT AGATAATGTA GAAAAAGATA AGGAAGTTAA AATACCTATC TATAATTTAC AAGGAGAGCC ACTTGATACT ATTCAATTGA AGAGTGAAGA TGGGATAGTT AATAATGGTT CTGTTAAATG GGGGACTACT GGTGAAAAAA CATACGAATT TACGTTGGAT ATAAAGCCAG AAGAGAATCG TATTAAGTTT AATGGAACAG TAATTCAAAA TGTTGTTGAA AGATTAGACG AAATCAAGGA AACAATAAAA GAGGATAATG AACAAAAGGA AAATGTAATT CTCGATAAAA CTTTACAACA ACATATTAAT AAAGAGAATT TAGGTAGAGA GAATGTAAAT GCTCCTATCA CAAAAGAAGA TTTATTACAG ATTAAAAAAC TAGAGATACT TAAAGAAAAA GGAAATGAGA TAAAAAATAT AACAGGTTTA GAGTACATGA CGAACTTAGA AAACCTCACT TTAGAAGGAG TAGGCCTGAA AAATATTGAG TTCATCTCAA ACTTGAAACA ATTGAATAAT GTGAATGTAT CTCATAATCA AATTGAAGAT ATAACACCAC TATCTTCATT GGAAAATTTA CAGTGGTTAA ATCTTGCGGA CAATCATATT AAAGATGTAA CGGTTATTGG TTCCATGCTA AACTTATTTA GCTTAAATCT AGCTGGGAAT GAGATTCGTG ATGTAAGGCC GTTAATACAA TTAGGTCAGT GGGGAACAAT TGATGTTAGA AGGCAAAAAG TCATTTTGGA TGATGCAGAA ATAAATAAAG AAGTGATAAT TCCTGTATAT GATTTAGAAG GAGAGCCAAT TGAAAAGATT ACACTAAAGA GTGAAGGTGG AACACTTACT GATGAGGGAA TTATTTGGAG TACTCTAGGG GAAAAAATAT ATGAATTTGA TTTAGATGCA GATCATTATG AGACTGGCAT ATTATATAGT GGCATTGTCA TGCAGAATAT AGTAGAAAAA TTAATACCAA AAGAAGAAGT GAAAGAACCA GCAAAAGAAG TTGAAGAAAC AAAAGAAGAA GTGAAAGAAC CGATAAAAGA AGTTGAAGAA ACAAAAGAAG AAGTGAAAGA ACCGATAAAA GAAGTTGAAG AAACAAAAGA AGAAGTGAAA GAACCGGTAA AAGAAGTTGA AAGTACAAAA GAAGAAGTGA AAGAACCGGT AAAAGAAGTT GAAGAAACAA AAGAAGAAGT GAAAGAACCG GTAAAAGAAG TTGAAGAAAC AAAAGAAGAA GTGAAAGAAC CGGTAAAAGA AGTTGAAGAA GCAAAAGAAG AAGTAAAAGA ACCGGTAAAA GAAGTTGAAG AAACAAAAGA AGAAGTAAAA GAGCCGGTAA AAGAAGTTGA AGAAGCGAAA GAACCAAAGA AAGAAGTAAA AGAATCAGCA ACAGGATTGG ATCAAGAGCC AAAAGGGAAT AATCAAGTTG TTGAAAATGA GGGAAGAAAA GCAGACACTT TAAATAAACA ACATACTAAT AAGCCAGAGG AAGGCAAGAA ATTTTTACCA TCAACAGGCG GTGAAGCTAG CACATCGACT TTACTTTCTG GGTTAACACT TGTTCTTTCC GCACTAAGTA TGTTCGTATT TAGAAAGAGA TTATTTAAGA AATAA
|
Protein sequence | MKQNKRKRIN AMIIAAALSL PFAVYSTPAL AAVAIEANKT GQGLEDGTYD AVIKAYKDKT NEESMAAVYI KDPKLTIENG KKIVTATLSD SDFFQYLKTE DIHTPGVFHD VKVISEDKKK NGTKVIQFEV GELGKRYNMR MHIYIPTMAY DNKYQVQFEV NTLNLDKDVP EAQKENKEDK VDQQDANVIV DKQLQRHINK YNLNRENLDT PITKEDLLKV KSLIVVEAKS KGIKDVSGLE YMTNLENLTL EEVKLKNIKF ISDLRQLKSL SITYGELEDI GPLAKLEHIE FLTLRNNKIS DLSPLSQMKK IKMLDLNSNY IKDIKPLFTV KSLRTLTVAN NQISNDNLAG IEQLKNVKNL SLSNNGLTNI EHITSMKKLV ELDLAKNELE NIEPLSRLST VQSLNLEENY ISDITPLSHL TDLYDLKLGS NEICDVRPVQ ELGKRIYIDI QRQKIFLDNV EKDKEVKIPI YNLQGEPLDT IQLKSEDGIV NNGSVKWGTT GEKTYEFTLD IKPEENRIKF NGTVIQNVVE RLDEIKETIK EDNEQKENVI LDKTLQQHIN KENLGRENVN APITKEDLLQ IKKLEILKEK GNEIKNITGL EYMTNLENLT LEGVGLKNIE FISNLKQLNN VNVSHNQIED ITPLSSLENL QWLNLADNHI KDVTVIGSML NLFSLNLAGN EIRDVRPLIQ LGQWGTIDVR RQKVILDDAE INKEVIIPVY DLEGEPIEKI TLKSEGGTLT DEGIIWSTLG EKIYEFDLDA DHYETGILYS GIVMQNIVEK LIPKEEVKEP AKEVEETKEE VKEPIKEVEE TKEEVKEPIK EVEETKEEVK EPVKEVESTK EEVKEPVKEV EETKEEVKEP VKEVEETKEE VKEPVKEVEE AKEEVKEPVK EVEETKEEVK EPVKEVEEAK EPKKEVKESA TGLDQEPKGN NQVVENEGRK ADTLNKQHTN KPEEGKKFLP STGGEASTST LLSGLTLVLS ALSMFVFRKR LFKK
|
| |