Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH820_4661 |
Symbol | |
ID | 7189763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH820 |
Kingdom | Bacteria |
Replicon accession | NC_011773 |
Strand | - |
Start bp | 4412591 |
End bp | 4415314 |
Gene Length | 2724 bp |
Protein Length | 907 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643558071 |
Product | cell surface protein |
Protein accession | YP_002453607 |
Protein GI | 218905773 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5386] Cell surface protein |
TIGRFAM ID | [TIGR03063] sortase B cell surface sorting signal [TIGR03656] heme uptake protein IsdC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 122 |
Fosmid unclonability p-value | 0.442353 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTTAA TGATATTTAC ATTCGTATCA ACACTACAAC CACTTGCAGT TCAAGCAGCT ACTCAATTAG CTGACGGTGA ATACTCAATC GGTTTTAAAG TTCTTAAAGA CGCATCGGAT GAAGTATCCA TGATGAATGA ATACTCTGTA AGTCCAGGAA CTTTAAAAGT GAAGGATGGG AAAAAGAAAG TGTCCTTTAC ATTAAAAAAT AGTTCATGGA TTACGAAATT TGAAACAGAC AAAGCAGGTC AACTTGTTGA GACAAATGTA ATTAGTGAAG ATAAAGAAAA AGATACAAGA GTAGTAGAAT TCGATGTGGA AGATGTAGAG AAGATATTAA AAGCGAAAGT AAAAGTAGAT ATTGATTTTC TGAACTATCA TCATGAATAT GATGTTCGTA TTGCATTTGA TCAAAATAGC ATTACACCAA TTCATGTAGA AAAACCAGAT GAAAAAGAGG ACCCAGCTAA TAAGCCAGAT CCAAATGAAA CTACGGATCC AGGTCAGAAG CCCGACCAAA AGCCTGACCC AGATCAACAA CCAAATTCTA ACACAATTGA AGATGGTGCG TACAGCATTC CTTTCAAAGT GTTAAAAGAT AAAACAGATG AAGAATCTAA AATGAATAGT TACATGGAAA ATCCAGGAGT ATTGAAAGTA GAAAATGGTA AGAAAAAAGC GGTTGTAACG TTAAAAAGTA GCTCATTAAT TAAAAATTTC CAAACGGAAA AAGATGGTGC ATTTGTTGAT GCAAAAGTAG TGAGTGAAGA TAAAGAAAAA GATACAAGAG TAGTAGAGTT TGAAATAGCT GATTTATCGA AAAAACTTAA TACAAAAGTA TTTATTGAGA TGGCATCAAG AAATTATAAA CAAACGCATG ACGTACAACT TGTATTTGAA CAAGACAAAT TGGAACCTAT TAAAAGTGAA GACAAACAAC CAGACGGAGA TAAACAACCA GACGGAGATA AACAACCAGA CGGAGGCAAA CAACCAGATG GAGATAAACA ACCAGACGGA GGCAAACAAC CAGACGGAGA CAAGCAACCA GACGGAGACA AACAACCAGA CGTAGATACC ATTAAAGATG GTGAATACAG TATTGGTTTT AAAGTATTGA AAGATAAAAC AGAAGAAATT TCAATGATGA ATACGTACAC GAAGAGTCCA GGTGTACTAA AAGTGAAAGA TGGAAAGAAA TATGTATCCT TCACATTAAC GAATAGCTCA TGGATTACAA AGTTCGAATT TGAAAAGAAT AATTCATTTG TTGATGCAAG TGTATTAAGT GAAGATAAGA AAGCTGATAC ACGTGTAGTA GAAGTAGAAG TAGCTGATTT ATCTAAGAAA CTAAATGCAA AAGTGAAAGT AGATATTGAT TCAATGAATT ATCACCATTT CTATGATATT CAATTTGCAT TTGATAACGA TAGTATTCAA CCGTTAGACA ATCAAGGCGA AAATGACAAC CAAGGTGGAA ACGACAACCA AGGTGGAAAC GACAACCAAG GTGGAAATGA CAACCAAGGC GGAAACAACA ACCAAGGCGG AAACGACAGC CAAGACGGTA ACACAGCAAT TGATCCAAAC GCTCTTAAAG ACGGTGAATA CAGTATCGGT TTTAAAGTGT TAAAAGATAA AACAGAAGAA ATTTCAATGA TGAACACATA TACGAAGAAT CCAGGTGTAT TAAAAGTGAA AGATGGAAAG AAATATGTAT CCTTCACATT AACAAATAGC TCATGGATTA CGAAGTTTGA GTTTGAAAAG AATGGTGCGT TCGTCGATGC GCAAGTATTA GGTATAAATA AAGAGAAAGA TACAAGAGTA GTAGAAGTGG AAATAGATGA TTTATCGAAA AAGTTAAATG CAAAAGTGAA GGTAGATATC GATGCGATGA ATTATCATCA TTTCTATGAT ATTCAATTTG CATTTGATAA AGGAAGTATT AAAGCTTTAG GTAACCAAGG TGGAGATACT AACCAAGATG GTAATGGTAA TCAAGTCGGA AGCGATAACC AAGGTGGAAG TAACAACCAA GATGGAACGA ATAATCTAAA TGAAAACCCA ACAGTTGATC CGAAAAATTT AAAAGATGGT CAGTATGATA TTGCTTTTAA AGTGTTAAAA GATAAGACAG AAGAAATTTC AATGATGAAT CAATATGTTG TAAGTCCAGC AAGATTAACA GTGAAAGATG GCAAGAAGTA TGTTGCAATG ACACTGAAAA ATAGTGAATG GATTACGAAA TTCCAAACAG AAAAGAATGG TGGATTTGCA GATGCGAAAG TAGTAAGTGA AGATAAGGCT ACCAATACAA GAGTAGTAGA ATTTGAAGCT AATGATTTAT TTGCAAAATT AAATGCAAAA GTACAAGTAG ATATCGATTC AATGAATTAC CATCATTTCT ACGACGTACA AATTCAATTT GATCCGACGA AGATTGGCGC TGTAGGAACG GTAAAAGAAG AGCCAAAAAA AGATCCAAAG AATGAACCGA AAAACCCAGT AACTACACCA AAAGTAGATA ATGTAAAAAC AGTAGGAACT CCTGATTTTA ACCGGAATGC AGATGGTAAA AAGAAAAACG AAGCTACAAA TAATGATTCG AAAAAAGAGA AAAACTCAAA AACTGCAGAT ACAGCACAAC TTGGTTTATA CATGGTGTTA CTGCTAGGTT CACTTGCTTT ACTAGTTCGT AAATATAGAG CAGGTAGATT GTAA
|
Protein sequence | MFLMIFTFVS TLQPLAVQAA TQLADGEYSI GFKVLKDASD EVSMMNEYSV SPGTLKVKDG KKKVSFTLKN SSWITKFETD KAGQLVETNV ISEDKEKDTR VVEFDVEDVE KILKAKVKVD IDFLNYHHEY DVRIAFDQNS ITPIHVEKPD EKEDPANKPD PNETTDPGQK PDQKPDPDQQ PNSNTIEDGA YSIPFKVLKD KTDEESKMNS YMENPGVLKV ENGKKKAVVT LKSSSLIKNF QTEKDGAFVD AKVVSEDKEK DTRVVEFEIA DLSKKLNTKV FIEMASRNYK QTHDVQLVFE QDKLEPIKSE DKQPDGDKQP DGDKQPDGGK QPDGDKQPDG GKQPDGDKQP DGDKQPDVDT IKDGEYSIGF KVLKDKTEEI SMMNTYTKSP GVLKVKDGKK YVSFTLTNSS WITKFEFEKN NSFVDASVLS EDKKADTRVV EVEVADLSKK LNAKVKVDID SMNYHHFYDI QFAFDNDSIQ PLDNQGENDN QGGNDNQGGN DNQGGNDNQG GNNNQGGNDS QDGNTAIDPN ALKDGEYSIG FKVLKDKTEE ISMMNTYTKN PGVLKVKDGK KYVSFTLTNS SWITKFEFEK NGAFVDAQVL GINKEKDTRV VEVEIDDLSK KLNAKVKVDI DAMNYHHFYD IQFAFDKGSI KALGNQGGDT NQDGNGNQVG SDNQGGSNNQ DGTNNLNENP TVDPKNLKDG QYDIAFKVLK DKTEEISMMN QYVVSPARLT VKDGKKYVAM TLKNSEWITK FQTEKNGGFA DAKVVSEDKA TNTRVVEFEA NDLFAKLNAK VQVDIDSMNY HHFYDVQIQF DPTKIGAVGT VKEEPKKDPK NEPKNPVTTP KVDNVKTVGT PDFNRNADGK KKNEATNNDS KKEKNSKTAD TAQLGLYMVL LLGSLALLVR KYRAGRL
|
| |