Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH820_3799 |
Symbol | |
ID | 7186853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH820 |
Kingdom | Bacteria |
Replicon accession | NC_011773 |
Strand | - |
Start bp | 3631593 |
End bp | 3632879 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643557210 |
Product | zinc protease, insulinase family |
Protein accession | YP_002452749 |
Protein GI | 218904915 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 81 |
Fosmid unclonability p-value | 0.000000108391 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGAAAA TTGTTTATGA GCAATTAAAA GAGACACTCT ATTATGAAAA ACTTCCTAAT GGATTAGATG TATATATTTT ACCGAAGCAA GGATTTAATA AAACATTTGC AACGTTTACG ACAAAATATG GTTCTGTGGA TAATACATTC GTACCACTAG GTAAAGAAGA AATGATTCGT GTACCTGATG GGATTGCTCA TTTTCTTGAG CATAAATTAT TTGAAAAAGA AGATCATGAC GCTTTCCAAT TGTTTAGTAA ACAAGGGGCT TCCGCGAATG CTTTCACGTC TTTCACAAGA ACAGCTTATC TTTTTTCGTG TACATCAAAT GTAGAACAAA ATTTAAATAC ATTGTTAAAC TTCGTACAAG AGCCTTACTT TTCTGAAAAA ACAGTCGAAA AAGAGAAGGG GATTATCGGA CAAGAAATTC AAATGTATCA AGATAATCCA GATTGGCGCT TGTACTTTGG ATTAATTGAT AGTTTGTTTG TAAAGCACCC GATTAAAATT GATATTGCAG GGACGATCGA GTCTATTAGT AAAATTACGA AAGACCTACT ATATGAATGT TATGAAACGT TTTATCATCC AAGCAATATG TTAATGTTTG TTGTGGGTGC AATTGATCCA GAGAAAACAA TGGATTTAGT ACGTGAAAAT CAAGCGAAAA AAGATTATAA AAACCAGCCG GAAATTGTAC GTTCATTTGA AGAAGAACCA GATGAGGTAA ATGAAAAGAA GAAAATTATT TCCATGCCTG TACAAACTCC GAAATGTTTA GTTGGTATTA AAGCGACAAA CTTAAAAGAA AAGGGAGAAG CCCTTTTAAA ACAAGAAATT GCGCTTACGT TACTTTTAGA TTATTTATTT GGGAAAAGCT CCGTTCATTA CGAATCTTTA TATAATGAAG GGCTCATCGA TGATTCGTTC TCGTATGATT ATACAGAAGA GAATAACTTC GGTTTTGCAA TGGTTGGCGG CGATACGAAG CAACCTGATG AGCTGGAAGA GCGTTTGAAA AGTATTTTAT TAAACACAAA TTATAATCAA TTAGATGAGG CGGCATTAGA ACGAGTAAAG AAAAAGAAAA TAGGTGGCTT TTTACGTTCT TTGAATTCAC CGGAATATAT TGCAAATCAA TTTACACGAT ATGCGTTTAA TGAATCGAGT CTGTTTGATG CATTGACTGT ATTAGAAAGT CTAACGGTTC AAGATTTACA AGAAGTAGCT CAATTACTAT TATCAGAAGA GAAAATGAGT GTTTGCCAAG TTTTACCGAA AAAATAA
|
Protein sequence | MEKIVYEQLK ETLYYEKLPN GLDVYILPKQ GFNKTFATFT TKYGSVDNTF VPLGKEEMIR VPDGIAHFLE HKLFEKEDHD AFQLFSKQGA SANAFTSFTR TAYLFSCTSN VEQNLNTLLN FVQEPYFSEK TVEKEKGIIG QEIQMYQDNP DWRLYFGLID SLFVKHPIKI DIAGTIESIS KITKDLLYEC YETFYHPSNM LMFVVGAIDP EKTMDLVREN QAKKDYKNQP EIVRSFEEEP DEVNEKKKII SMPVQTPKCL VGIKATNLKE KGEALLKQEI ALTLLLDYLF GKSSVHYESL YNEGLIDDSF SYDYTEENNF GFAMVGGDTK QPDELEERLK SILLNTNYNQ LDEAALERVK KKKIGGFLRS LNSPEYIANQ FTRYAFNESS LFDALTVLES LTVQDLQEVA QLLLSEEKMS VCQVLPKK
|
| |