Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH820_5324 |
Symbol | |
ID | 7189951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH820 |
Kingdom | Bacteria |
Replicon accession | NC_011773 |
Strand | + |
Start bp | 5020245 |
End bp | 5022041 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643558734 |
Product | enterotoxin |
Protein accession | YP_002454244 |
Protein GI | 218906410 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0791] Cell wall-associated hydrolases (invasion-associated proteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 7.216360000000001e-60 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGAAAAAAT ACCTTGCCGG TCTTGCGGCA GTGTCTGTAG CAGGAGGAGC AGCACCTACA CTTGATAGTG TTCAAGCTGC CCCTGAACAA AATACACAAA AAACTGCTAC AACTGTCCAA GCTTCTGCAT CAAACAGCTC ATCTTATACG GTAAACGCTA GCGTATTACA TGTTCGTGCA GGATCAAGTA CTTCTCACGA CATCATCTCT CGCGTTTATA ACGGTCAATC ACTAAACGTG ATTGGCGAAG AAAATGGTTG GTACAAAATT AACATTAATG GAAAAACAGG CTTTGTTAGT GGTGAATTTG TATCAAAAAA TGGTACAAGC AATTCAAATG TAAGTACAAC AGGTGGAAAA AATAAAGTTA CTGCTGATGT ATTACGTGTA CGTACTGCTC CTAACACTTC TAGTTCTGTT TCAGGACGTG TATATGAAGG ACAAACATTA AACGTAATTG GTCAAGAAAA TGGTTGGGTA AAAATCAATC ATAATGGACA AGTTGGCTAT GTAAGTGGCG AATTCGTATC TGGTGTTTCT TCTAATGCAG GTTCTTCAAA CAGCAATACG AATAATAATA ACCAAGAATC TGTAAAACCA GCAAGCGGAA ACTATACAGT AAATGTATCT TCCCTTCGTG TTCGTACAGG CCCTAGCACT TCTCACACAA CTGTAGGTTC TGTTACAAAA GGACAAGTAG TACAAGTTGT TGGCGAAGTG CAAGATTGGT TCAAAATCAA TTATGCAGGT CAAACGGCTT ACGTAAGTAA AGACTACGTA ACAAAAGGCG GTTCTAGCGA TAACGTTACA CAAGGAAACA ACCAAAATAA TAATCAAAAC AATAATGTAA CTGTTCAAAC TGGTGGTACT TACGTTGTTA ACGCAACATC TCTACGCGTT CGTACAGGTC CTGCTACTTA CCATAGCGTA ATTGGTGGCG TATTAAATGG TACGACATTA AACGTAATTG GCTCTGAAGG TAGCTGGTTT AAAGTTAACT ATCAAGGAAA AACAGGCTAC GTTAGTAGCG AATTCATGAA ATTCGTTAAA GGTGGCACTA CTACTCCTGA GCAACCAAAA CAACCTGAAC AACCTAATCA AGGTGCAATT GGTGACTACT ACATTAATGC TTCTGCCTTA AATGTACGTA GTGGTGAAGG TACAAATTAT AGAATCATAG GCGCACTTCC ACAAGGACAG AAGGTTCAAG TAATCTCTGA AAACTCTGGA TGGAGCAAAA TTAACTACAA CGGTCAAACT GGTTATATCG GAACACGTTA CCTTTCTAAA ACACCAGTTG GCGGCGCAGT AGATAATAAT AAGCCTAACA ACAACCAAAA TAACAATCAA AACAATAACA ACAATAACAA CAATAACAAC AATAACAACA ATAACAACAA TAACAACAAT AACAACAATA CAGGTAATAA TAGCGGCAAC AGTTCTTCCA TACTTGCATA TGCAAAAGGA ATGCAAGGCG TACCATACGT TTGGGGCGGA ACTTCTGCTA ACGGTGTGGA CTGCAGTGGC TACATCTACC ACGTATTTAA GAAATTTGGT CATAACATTA GCCGTCAAAG TGTTGCGGGA TATTGGGGTA GCCTACCACA AACTTCAAAT CCACAACCAG GCGACTTAAT TTATTTCCAA AACACTTATA AATCGGGTCC TTCTCACATG GGTATTTACC TTGGGGGCGG ATCATTTATC CAAGCTGGAG ATAAAGGTGT AGCAATCGCT TCATTAAGCA ATTCTTATTG GAGTAAGCAC TTCTTAGGTT ATACGAAAGC ACCTTAA
|
Protein sequence | MKKYLAGLAA VSVAGGAAPT LDSVQAAPEQ NTQKTATTVQ ASASNSSSYT VNASVLHVRA GSSTSHDIIS RVYNGQSLNV IGEENGWYKI NINGKTGFVS GEFVSKNGTS NSNVSTTGGK NKVTADVLRV RTAPNTSSSV SGRVYEGQTL NVIGQENGWV KINHNGQVGY VSGEFVSGVS SNAGSSNSNT NNNNQESVKP ASGNYTVNVS SLRVRTGPST SHTTVGSVTK GQVVQVVGEV QDWFKINYAG QTAYVSKDYV TKGGSSDNVT QGNNQNNNQN NNVTVQTGGT YVVNATSLRV RTGPATYHSV IGGVLNGTTL NVIGSEGSWF KVNYQGKTGY VSSEFMKFVK GGTTTPEQPK QPEQPNQGAI GDYYINASAL NVRSGEGTNY RIIGALPQGQ KVQVISENSG WSKINYNGQT GYIGTRYLSK TPVGGAVDNN KPNNNQNNNQ NNNNNNNNNN NNNNNNNNNN NNNTGNNSGN SSSILAYAKG MQGVPYVWGG TSANGVDCSG YIYHVFKKFG HNISRQSVAG YWGSLPQTSN PQPGDLIYFQ NTYKSGPSHM GIYLGGGSFI QAGDKGVAIA SLSNSYWSKH FLGYTKAP
|
| |