Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B3962 |
Symbol | |
ID | 7183524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | + |
Start bp | 1282156 |
End bp | 1284438 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 643549102 |
Product | putative internalin |
Protein accession | YP_002444772 |
Protein GI | 218896361 |
COG category | [M] Cell wall/membrane/envelope biogenesis [S] Function unknown |
COG ID | [COG4886] Leucine-rich repeat (LRR) protein [COG5386] Cell surface protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.769038 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAAAAAA ATTATATGAA GGCGCTAGTA GTAGCGACAA CATTAGCAAT TCCATTTGCT ACGTACTCTA CTCCAGCATT AGCAGCACTA AAAGTTGAAG CAAATCAATC GGTAGCAGCA GCGAGTGATC GCACGTATGA TACTGAGATT AAAATATATA AGGATCAAAA AGACGAGCCA TCTATGGTTT CTCAATATAT AAAAGATCCT AAAGTAACAA TTGCAGCTGG CAAAAAAATT GTCACTGTAA CAATGCAAGA TAGCGATTAT TTTCAATATC TTAGAATAGA AGATAGAAAC CAGCCTGGTG TATTTCATGA TGTGAAAGTT TTGTCAGAAG ATAAGAGGAA GAATGGAACG AAAGTAGTTC AATTTGAAAT TGGCGAGTTT GAGAAGAAGC ACAATATGCA AATGCATATA CTTATTCCAG CTATTGGATA TGATCACAAA TATCAAGTTC AATTTGAAAT TAAAGATCCA ACTGTAGGTA ACAAAGAAAC AGAGAAACCA GATGATAACT CTAATTCAGG CAATACGGAA ACGGATAATC CAGTTGATAA TCAAAACATG ATAACAGATA ACAAATTAAG AGAACTTGTT AATAAAAAAG TATTTAATAG AAAAGATTTA AATACACCAA TTACGAAAGA AGAGTTATTA CAAGTAAAGG ATTTGTTTTT AAATACAAAC GAGATACTTG ATTATAGTGC ATTAAAATAT ATGCCAAATT TAAAATCTTT AACAGTTGCG AATGCAAAGA TAAAAGATCC GTCGTTCTTT GCGAACCTAA AGCAATTAAA TCATTTAGCT TTGCGTGGTA ATGAGTTTTC AGATGTAACG CCACTTGTTA AGATGAATAA TTTAGAGTCT CTTGATTTAA GTAATAATAA AATTACAAAT GTTGCACCAC TAACTGAAAT GAAAAATGTA AAAACTTTAT ATCTATCAGG CAACCAAATA GAAGATGTAA CAGCATTAGC GAAAATGGAA CAACTAGATT ACTTGAATTT AGCAAATAAT AAAATTAAGA ACGTTGCTCC ATTAAGTGCT TTAAAAAATG TAACATACTT AACTTTAGCA GGTAATCAAA TTGAAGATAT TAAACCGTTA TATTCATTAC CTTTAAAAGA TTTAGTATTA ACGCGTAATA ATGTTAAAGA TTTATCGGGT ATTGATCAAA TGAATCAATT AAATAAATTA TTTGTCGGGA AAAATCAAAT TAAAGATGTG ACACCACTTG CTAAAATGAC TCAGCTTACA GAATTAGATT TACCCAATAA TGAATTAAAA GATATTACTC CATTATCAAG TCTAGTAAAC TTACAAAAGC TTGATTTAGA AGCGAATTAT ATTACAGACT TATCACCAGT GAGCAATTTG AAAAAATTAG TATTTCTAAG TTTTGTTGCA AATGAAATCC GTGATGTCCG ACCAGTTATA GAACTAAGTA AGACGGCTTA TATCAATGTC CAAAACCAAA AAGTATTTTT AGAGGAAACA GAAGTAAATA AAGAAGTAAA AGTACCTATA TATGAAAAAG ATGGTGAGAT TTCTACGAAA ATTCGTCTGA AGAGCGATAA CGGTACGTAT AGTAATGGTG TAGTGAAATG GAGTACACCA GGTGAGAAAG TATATGAATT TGGTGTGAAA GATCCATTTG CGGATACAGG AATCTTCTTT ACAGGGTCTG TTATTCAAAA TGTAGTAGAA AGTAAAGACG GTAATACATC TAAAGAAGAT GAGAAAACAG AAGTGGTAGA ATTTAAAGAT GTACCAAAGG GACATTGGTC AGAAGAAGCA ATTAATTACT TAGCGAAAGA AAAGTTATTT ATAGGCTATG GAAATGGTGA ATTTGGATTT GGTGATAACA TTACTCGTGG ACAAGTAGCT CTTCTAATAC AAAGGTATTT AAAATTAGAA AATAATCTAG AACCAAAAAC GGCATTTACA GATACGAAAG GAAATATGTA TGAAACGGCT ATTGATGCAG TGGTTCAAGC TGGTATTATG ACAGGGTATG GAAATGATAT ATTCCGTCCA GATGGAGTAT TAACTCGATA TGAAATGTCA GTAGTACTAC AAAGAGTATT TCAGTTAAAA GAAAATGAAA ATAGTGCAGA GAATTTCAAA GATATACCAA ATGGCCATTG GGCGAAAGGA TATGTGAAAG CTCTAGTAGA TAATAAAATA TCAAAAGGTG ACGGGGAAGG GAATTTTTTA GGAGATAATT TCGTAACACG TGAACAATAT GCACAGTTTT TGTATAATGC AATAAAGAAA TAA
|
Protein sequence | MKKNYMKALV VATTLAIPFA TYSTPALAAL KVEANQSVAA ASDRTYDTEI KIYKDQKDEP SMVSQYIKDP KVTIAAGKKI VTVTMQDSDY FQYLRIEDRN QPGVFHDVKV LSEDKRKNGT KVVQFEIGEF EKKHNMQMHI LIPAIGYDHK YQVQFEIKDP TVGNKETEKP DDNSNSGNTE TDNPVDNQNM ITDNKLRELV NKKVFNRKDL NTPITKEELL QVKDLFLNTN EILDYSALKY MPNLKSLTVA NAKIKDPSFF ANLKQLNHLA LRGNEFSDVT PLVKMNNLES LDLSNNKITN VAPLTEMKNV KTLYLSGNQI EDVTALAKME QLDYLNLANN KIKNVAPLSA LKNVTYLTLA GNQIEDIKPL YSLPLKDLVL TRNNVKDLSG IDQMNQLNKL FVGKNQIKDV TPLAKMTQLT ELDLPNNELK DITPLSSLVN LQKLDLEANY ITDLSPVSNL KKLVFLSFVA NEIRDVRPVI ELSKTAYINV QNQKVFLEET EVNKEVKVPI YEKDGEISTK IRLKSDNGTY SNGVVKWSTP GEKVYEFGVK DPFADTGIFF TGSVIQNVVE SKDGNTSKED EKTEVVEFKD VPKGHWSEEA INYLAKEKLF IGYGNGEFGF GDNITRGQVA LLIQRYLKLE NNLEPKTAFT DTKGNMYETA IDAVVQAGIM TGYGNDIFRP DGVLTRYEMS VVLQRVFQLK ENENSAENFK DIPNGHWAKG YVKALVDNKI SKGDGEGNFL GDNFVTREQY AQFLYNAIKK
|
| |