Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH820_0608 |
Symbol | |
ID | 7191640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH820 |
Kingdom | Bacteria |
Replicon accession | NC_011773 |
Strand | + |
Start bp | 566143 |
End bp | 569181 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 643554019 |
Product | internalin protein |
Protein accession | YP_002449581 |
Protein GI | 218901747 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein [COG5386] Cell surface protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 202 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAACAAA ATAAAAGAAA ACGTATAAAT GCAATGGTTA TAGCGGCGGC GTTATCACTG CCGTTTGCTG TTTATTCAAC ACCTGCTTTA GCGGCAGTGG CAATTGAGGC GAATAAAACT GGACATGTTT TAGAAGATGG TACATATGAC GCTGTTATTA AGGCGTATAA AGATAAAACG AATGAAGAAT CTATGGCAGC TGTTTATATA AAAAATCCGA AATTAACAAT TGAGAATGGA AAGAAAATTG TAACGGCAAC GTTAAGTGAT AGTGATTTCT TCCAATATCT AAAAACAGAA GATATTCATA CTCCTGGTGT ATTTCATGAT GTGAAAGTAA TATCAGAAGA TAAAAAGAAA AATGGAACGA AAGTGATTCA GTTTGAAGTA GGAGAATTAG GAAAAAGGTA TAATATGCGA ATGCATATTT ATATTCCAAC AATGGCCTAT GATAATAAGT ACCAAGTACA ATTTGAAGTA AATACATTGA ATTTAGATAA AGATGTTCCA GAAGAACAAA AGGAAAATAA GGAGGATAAA TTGGATCAAC AAGATGCGAA TGTAATAATA GATAAGCAAT TACAAAGGCA TATTAATAAA TATAACTTGA ATAGAGAGAA TTTAAATGCG CCAATAACTA AGGAAGATTT ATTAAAAGTT AAATCTTTAA TAGTCGTTGA AGCTAAAAGT AAAGGAATAA AAGACGTAAG CGGTCTAGAA TATATGACGA ACTTAGAAAA CTTAACGTTG GAAGAAGTTA AGTTAGAAAA TATAAAATTT ATCTCGAATT TGAGGCAATT AAAATCAGTA AGTATAACCT ATGCCGAACT TGAAGATATT GGACCTTTGG CTGAGTTAGA ACATATTGAG AGTTTAAGCT TGAGAAATAA TAAAATTTCA GATTTAAGCC CACTAAGTCA AATGAAGAAG ATTAAATTGC TAGATTTAAA TAGTAATTAT ATAAAAGATA TAAAGCCATT ATTTACAGTG AAATCTTTAA GGACTTTAAC TGTAGCAAAT AACCAAATTA GTAATGCAGG TCTTGAAGGA GTTCACCAAT TAAAGAATTT AAAGACATTT GAAATAAGCA ATAATGGATT GAGTAATGTC GAACATATTA ATGGAATGAA TAAATTAATT GAATTAGGGC TTTCCAAAAA TGAATTAGTA GATCTTACAC CATTATCAAA ATTATCAGGG TTACAAAAAC TAAATTTAGA AGAAAACTTT ATTTCAGATA TAACGCCACT TAGTCAATTA ACAAGTTTAT ATGATTTAAA ACTAGGTTCA AATGAAATTC GTGATGTTAG ACCGGTTCAA GAGCTAGGAA AAAGAATGTA TATTGATATT CAAAGACAAA AAATCTTTTT AGATGATGTA GAAAAAGATA AGGAAGTTAA AATACCTATC TATAATTTAC AAGGAGAGCC AATTGATACT ATTCAATTGA ATAGTGAAGA TGGAATAGTT AATAATGGTT CTGTTAAATG GGGTACTACC GGTGAAAAAA CATACGAATT TATGTTAGAT ATAAAGCCAG AAGAGAATCG TATTAAGTTT AATGGAACAG TAATTCAAAA TGTTGTTGAA AGGTTAGATG AAATAAAAGA GGATAATGAA CAAAAGGAAA GTGTAATTCT CGATAAAACT TTACAACAAC ATATTAATAA AGAGAATTTA GGTAGAGAGA ATTTAAACGC TCCTATCACA AAAGAAGATT TATTACAGAT TAAAAAATTA GAGATACTTA AAGAAAAAGG AAAAGAGATA AAAGATATAA CAGGTTTAGA GTACATGACG AACTTAGAAA AACTCACTTT AGAAGGAGTA GGTTTAAAGA ATCTCGAATT TATCTCGAAC TTAGAAAAGT TGAACGATGT GAATGTATCT CATAATCAAA TTGAGGATAT AACACCACTA TCTGCATTAA AAAATCTACA ATGGTTAAAT CTTGCGGACA ATCATATTAA AGATGTATCG GTTCTCGGTT CCATGCTAGA TTTACTTAGC TTAAAATTAT CTGGAAATGA GATTCGTGAT GTAAGGCCGT TAATACAATT AGGTCAGTGG TTTTCAATTG ATGTGGGAAG ACAAAAAATC GTTTTAAGTG AAGCGAAAGT AAATGAGGAA ATTCAAGTTC CTGTATATGA TTTAGAAGGA GAAAGTATTG AGAATATTAA ATTGATAAGC GAAGGAGGGA CGTTTAATAA CGGAGTAATA AAATGGAATA CCCCAGGTGA AAAGGTATAT AAATTTGATT TAGATTCTGA TGGAATTAGC ATAAGGTTTA ACGGAACAGT TATACAGAGT ATAGTGGAAA AAGAAGAAGT GAAAGAACCG GTAAAAGAAG TTGAAGAAGC AAAAGAAGAA GTGAAAGAAC CGGTAAAAGA AGTTGAAGAA GCAAAAGAAG AAGTGAAAGA ACCGGTAAAA GAAGTTGAAG AAACAAAAGA AGAAGTAAAA GAGCCGGTAA AAGAAGTTGA AGAAGCAAAA GAAAAAGTGA AAGAACCGGT AAAAGAAGTT GAAGAAGCAA AAGAAGAAGT GAAAGAACCG GTAAAAGAAG TTGAAGAAAC AAAAGAAGAA GTAAAAGAGC CGGTAAAAGA AGTTGAAGAA GCAAAAGAAG AAGTGAAAGA ACCGATAAAA GAAGTTGAAG AAACAAAAGA AGAAGTGAAA GAACCGGTAA AAGAAGTTGA AGAAACAAAA GAAGAAATAA AAGAGCCGGT AGAAGAAGTT GAAGGTACAA AAGAAGAAGT AAAAGAGCCA ATAAAAGAAG TTGAAGAAGC GAAAGAACCA AAGAAAGAAG TAAAAGAATC AGCAACAGGA TTGGATCAAG AGCCAAAAGG GAAAAATCAA GTTGTTGAAA ACGAGGGAAG AAAAGCAAAC ACTTTAAATA AACAATATAC TAATAAGCCA GAGGAAGGCA AGAAATCTTT ACCATCAACA GGCGGTGAAG CTAGCACATC GACTTTACTT TCTGGCATAA CACTTGTTCT TTCCGCACTA AGTATGTTCG TATTTAGAAA GAGGTTATTT AAGAAATAA
|
Protein sequence | MKQNKRKRIN AMVIAAALSL PFAVYSTPAL AAVAIEANKT GHVLEDGTYD AVIKAYKDKT NEESMAAVYI KNPKLTIENG KKIVTATLSD SDFFQYLKTE DIHTPGVFHD VKVISEDKKK NGTKVIQFEV GELGKRYNMR MHIYIPTMAY DNKYQVQFEV NTLNLDKDVP EEQKENKEDK LDQQDANVII DKQLQRHINK YNLNRENLNA PITKEDLLKV KSLIVVEAKS KGIKDVSGLE YMTNLENLTL EEVKLENIKF ISNLRQLKSV SITYAELEDI GPLAELEHIE SLSLRNNKIS DLSPLSQMKK IKLLDLNSNY IKDIKPLFTV KSLRTLTVAN NQISNAGLEG VHQLKNLKTF EISNNGLSNV EHINGMNKLI ELGLSKNELV DLTPLSKLSG LQKLNLEENF ISDITPLSQL TSLYDLKLGS NEIRDVRPVQ ELGKRMYIDI QRQKIFLDDV EKDKEVKIPI YNLQGEPIDT IQLNSEDGIV NNGSVKWGTT GEKTYEFMLD IKPEENRIKF NGTVIQNVVE RLDEIKEDNE QKESVILDKT LQQHINKENL GRENLNAPIT KEDLLQIKKL EILKEKGKEI KDITGLEYMT NLEKLTLEGV GLKNLEFISN LEKLNDVNVS HNQIEDITPL SALKNLQWLN LADNHIKDVS VLGSMLDLLS LKLSGNEIRD VRPLIQLGQW FSIDVGRQKI VLSEAKVNEE IQVPVYDLEG ESIENIKLIS EGGTFNNGVI KWNTPGEKVY KFDLDSDGIS IRFNGTVIQS IVEKEEVKEP VKEVEEAKEE VKEPVKEVEE AKEEVKEPVK EVEETKEEVK EPVKEVEEAK EKVKEPVKEV EEAKEEVKEP VKEVEETKEE VKEPVKEVEE AKEEVKEPIK EVEETKEEVK EPVKEVEETK EEIKEPVEEV EGTKEEVKEP IKEVEEAKEP KKEVKESATG LDQEPKGKNQ VVENEGRKAN TLNKQYTNKP EEGKKSLPST GGEASTSTLL SGITLVLSAL SMFVFRKRLF KK
|
| |