Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B4750 |
Symbol | |
ID | 7183450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | + |
Start bp | 527943 |
End bp | 530930 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 643548324 |
Product | internalin protein |
Protein accession | YP_002444017 |
Protein GI | 218895606 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein [COG5386] Cell surface protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.000909405 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGAAACAAA ATAAAAGAAA ACGTATAAAT GCAATGATTA TAGCGGCGGC GTTATCACTT CCGTTTGCTG TTTATTCAAC ACCTGCTTTA GCGGCAGTGG CAATTGAGGC GAATAAAACG GGACAAGGTT TAGAAGATGG TACATATGAT GCTGTTATTA AAGCGTATAA AGATAAAACG AATGAAGAGT CTATGGCAGC TGTTTATATA AAGGATCCGA AATTAACAAT TGAGAATGGA AAGAAAATTG TAACAGCAAC GTTAAGTGAT AGTGATTTCT TCCAATACTT GAAAACAGAG GATATTCATA CGCCAGGTGT GTTTCATGAT GTAAAGGTCC TATCAGAAGA CAAAAAGAAA AATGGGACGA AGGTTATTCA ATTTGAAGTT GGAGAATTAG GAAAAACATA CAATATGCAA ATGCATATTT ATATTCCAAC GATGGCCTAT GATAATAAAT ATCAAGTGCA GTTTGAAGTG AATGCTATAA ATTTAGAAAA CAATGTTTCA GAAAAACAAA AGGAAAATAA AGAGGAGCAA CAAGATGAAA ACGGAAATGT AATATTAGAT AAGCAATTAC AAAAATATAT TAATAAATAT AACTTAGATA GAGATAATGT AGATGCGCCA ATCACAAAGA AAGATTTATT ACAAATTAAA ACATTATCCA TTTATTCAGG TAAAGGGATA AATGAAATAG CTGGTTTAGA GTATATGACA AATTTAGAGA AGTTGACGTT ACGAGAGTCT AATGTAACAG ATATATCAGC TATCTCGAAA TTGAGAAGTT TGAAGTACGT TGATTTAACT TCTAATTCAA TTGAAAGTAT TCATCCAATT GGGCAATTAG AGAATATTAA TATGCTTTTT TTAAGAGATA ATAAAATTTC TGATCTTACA CCATTAAGTA AAATGAAAAA AATCAAAACA TTAGATTTAA TCGGTAATAA CATTAAAGAT ATCCAGCCAT TATTTACATT ATCAACTATG AAACAATTAT ACTTAGCAAA TAATCAAATC AGTGATCTTA ATGGAATTGA TCGATTAAAT AATGTGGAAC TATTATGGAT AGGGAACAAT AAAATTAATA ATGTTGAATC TATTAGTAAA ATGAGTAATC TTATTGAACT AGAAATTGCT GATAGTGAAA TAAAAGATAT ATCACCATTA TCTCAATTAG GAATTTTACA AGTGCTGAAT TTAGAAGAGA ATTATATCTC TGATATATCG CCGTTGAGCA CTTTAACAAA TTTACATGAG ATAAATCTTG GAGCAAATGA AATTTCTGAC GTAAGGCCTG TTGAGGAATT AGGTAAGCGA ATTTCAATTG ACATTCAAAG ACAAAAAATC TTTTTAAATG AAGCAAGCGT AGATGAGGAA TTAAAAATCC CAGTATACAA CCTTAAGGGA GAACCACTTC AAAATATTAA TGTAAAAAGT GAGGGGGCTA CTCTGAATAA CGGATTTATA AAATGGAATA GTCCTGGAGA AAAAATATAT GAATTTAAAC TAGATACTAA TTCTACTGAA AGTAAAATAA GATTTAATGG TACGGTTATA CAGAATATAG TTGAAAAACA AAAAGAACGT GCAAATGTAA TTCTCGATAA AACTTTACAA CAACATATTA ATAAAGAGAA TTTAGGTAGA GAGAACTTAA ACGCTCCTAT CACAAAAGAA GATTTATTAC AGGTTAAAAA ATTAGAGATA CTTAAAGAAA AAGGAAATGA GATAAAAGAT ATAACAGGTT TAGAGTACAT GACGAACTTA GAAAACCTTA CTTTAGAAGG AGTAGGCCTG AAAAATATTG ATTTCATCTC AAACTTGAAA CGATTGAATA ATGTGAATGT ATCTCATAAT CAAATTGAAG ATATAACACC GCTATCTTCA TTGAAAAATT TACAGTGGTT AAATCTTACT GAGAATCGTA TTACAGATGT AACGGTTCTT GGCTCAATGT TAGACTTACT TAGTTTAAAA TTAGCTGAAA ATGAGATTCG TGATGTAAGG CCATTAATAC AATTAGGTCA GTGGGTAACA ATTGATGTTA GAAGGCAAAA GGTCATTTTG GATGATGCAG AAATAAATAA AGAAGTGAAA ATACCTGTAT ATGATTTAGA GGGAGAGCCA ATTGAAAAGA TTACACTAAA GAGTGAAGGT GGAACTCTTA CTGATGAGGG AATCATTTGG CGTACTTTAG GAGAAAAAAT ATATGAATTT GATTTAGATG CAGATCATTA TGAGACTGGC ATATTATATA GTGGCATTGT AATGCAGAAT ATAGTAGAAA AATTAATACC AAAAGAAGAA GTGAAAGAAC CAACAAAGGA AGTTGAAGAG TCAAAAGAAG AAGTGAAAGA ACCAACAAAA GAAGTGGAAG AAACAAAAGA AGAAGTGAAA GAACCAACAA AGGAAGTGGA AGAGTCAAAA GAAGAAGTGA AAGAACCAAC AAAGGAAGTG GAAGAGTCAA AAGAAGAAGT GAAAGAACCA ACAAAAGAAG TTGAAGAGTC AAAAGAAGAA GTAAAAGAAC CAACAAAAGA AGTTGAAGAG TCAAAAGAAG AAGTGAAAGA ACCAACAAAG GAAGTGGAAG AGTCAAAAGA AGAAGTGAAA GAACCAACAA AGGAAGTGGA AGAGTCAAAA GAAGAAGTGA AAGAACCAAC AAAAGAAGTT GAAGAGTCAA AAGAAGAAGT AAAAGAACCA ACGAAAGAAG TGGAAGAGTC AAAAGAAGAA GTGAAAGAAC CAACAAAAGA AGTTGAAGAA GCGAAAGAGG AAGTAAAAGA GCCAAAAGGA AATAATCAGG TTGTTGAAAA CGAAGGCAGA ACAGCAGATA CTTTAAATAC ACAACATGTT AATAAGACGG AGGAAGGAAA GAAATCTTTA CCATCAACAG GCGGTGAAGC TAGCACATCG ACTTTACTTT CTGGAATAAC ACTTGTTCTT TCCGCACTAA GTATGTTCGT ATTTAGAAAG AGGTTATTTA AGAAATAA
|
Protein sequence | MKQNKRKRIN AMIIAAALSL PFAVYSTPAL AAVAIEANKT GQGLEDGTYD AVIKAYKDKT NEESMAAVYI KDPKLTIENG KKIVTATLSD SDFFQYLKTE DIHTPGVFHD VKVLSEDKKK NGTKVIQFEV GELGKTYNMQ MHIYIPTMAY DNKYQVQFEV NAINLENNVS EKQKENKEEQ QDENGNVILD KQLQKYINKY NLDRDNVDAP ITKKDLLQIK TLSIYSGKGI NEIAGLEYMT NLEKLTLRES NVTDISAISK LRSLKYVDLT SNSIESIHPI GQLENINMLF LRDNKISDLT PLSKMKKIKT LDLIGNNIKD IQPLFTLSTM KQLYLANNQI SDLNGIDRLN NVELLWIGNN KINNVESISK MSNLIELEIA DSEIKDISPL SQLGILQVLN LEENYISDIS PLSTLTNLHE INLGANEISD VRPVEELGKR ISIDIQRQKI FLNEASVDEE LKIPVYNLKG EPLQNINVKS EGATLNNGFI KWNSPGEKIY EFKLDTNSTE SKIRFNGTVI QNIVEKQKER ANVILDKTLQ QHINKENLGR ENLNAPITKE DLLQVKKLEI LKEKGNEIKD ITGLEYMTNL ENLTLEGVGL KNIDFISNLK RLNNVNVSHN QIEDITPLSS LKNLQWLNLT ENRITDVTVL GSMLDLLSLK LAENEIRDVR PLIQLGQWVT IDVRRQKVIL DDAEINKEVK IPVYDLEGEP IEKITLKSEG GTLTDEGIIW RTLGEKIYEF DLDADHYETG ILYSGIVMQN IVEKLIPKEE VKEPTKEVEE SKEEVKEPTK EVEETKEEVK EPTKEVEESK EEVKEPTKEV EESKEEVKEP TKEVEESKEE VKEPTKEVEE SKEEVKEPTK EVEESKEEVK EPTKEVEESK EEVKEPTKEV EESKEEVKEP TKEVEESKEE VKEPTKEVEE AKEEVKEPKG NNQVVENEGR TADTLNTQHV NKTEEGKKSL PSTGGEASTS TLLSGITLVL SALSMFVFRK RLFKK
|
| |