Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B2551 |
Symbol | |
ID | 7184895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 2641084 |
End bp | 2642787 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643550496 |
Product | metalloendopeptidase |
Protein accession | YP_002446166 |
Protein GI | 218897755 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000846614 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0000000000277026 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAAAACA CAAAGACATT AACTAAGGTA GCATTAACAA CAGGATTAGC TTTAACAGCT GTGGCACCAT ATGGGGTAGG TCATGCAGAG GAAACAGATC AGTTACAAGT TCAAATTCAG GAGGAATCGT TCCGTTCGGG TGAACTTACA CAACCGTCAC AAAAGGCACC AGAAAATGTA GTAAAGGATG CACTTAAGGA GAAAACAGAG CAAGCTCTCT CCCAAAAACA AGTCAATGGA GAAGCAGGAG TAGATTATAA AGTCCTTCAA AAACGTGGTT CGTATGATGG AACTACACTT GTACGTATGC AGCAAACATA CGAAGGAAAA GAAGTATATG GTCATCAATT AACTGCACAC GTAGATAATA ATGGTGTTAT TAAAAGTGTT TCAGGTGATA GCGCGCAAAA TCTAAAACAA GAAGAATTGA AAAAACCTAT TAATCTGTCA AAAGATGAAG CAAAACAATT TATTTATACG AAGTACGGGA ACGATATCAA GTTTATTACT GAGCCAGAAG TTAAAGAGGT TATTTTTGTT GATGAAAATA ATGGACAAGC TAAAAATGCA TACCAAGTTA CCTTTGAAGC TGCAACACCA AACTATGTTT CTGGTACTTA TTTAGTAAAT GCACAAAATG GTGATATGTT AAAAAATATG GTACAAGAAT CTAACTTAAA AGCTAGTGAC AAACTAGTTG GAGCTTTAAA GAAAAGTAAA CAAAGCAGTC TTACATCATT AACTGGAACA GGAAAAGATG ATTTAGGTAT TTCTCGTTCA TTTGGTATCT CTAAACAAAG TAATGGCAAA TATGCTCTTG CTGATTATAC AAGAGGGCAA GGAATTGAAA CATATGATGT GAATTACAGA GATATTACTA AAGAAGAAAG CTATTATCCT GGTACATTAG CGACTAGTAC TTCGGCAACA TTTAACGATC CAAAAGCAGT AAGTGCTCAT TACTTAGCAA CAAAAGTATT TGATTTTTAT AAAGATAAAT ACAAACGTAA TAGCTTTGAT AATAAGGGAC AAAAGGTAGT TTCAGTTGTA CATGCTTGGG ATTCTGAAGA GACGAATGAT CCGAAAAATT GGCAGAATGC ATTAAGTGCT AATAATGGAA GTATGCTTGT ATATGGGGAT CCTATTGTTA AAGCATATGA CGTAGCTGGA CATGAATTTA CACATGCTGT TACTTCTAGT GAATCTAATC TTGAATATTA CGGTGAATCC GGTGCGATTA ATGAGGCGCT ATCTGATATT ATGGGAACAT CTATTGAGAA ATACGTAAAT AATGGAAACT TTAATTGGAC AATGGGAGAA CAAACGGGAT CTGTTTTCCG TGATATGGAA AACCCAGCTT CTGTTCCATC TTCACTCGGA GTACCTTATC CAGATGATTA CAGTGAATTT AACGATTTTA ATGGATGGGA TCAAGGCGGT GTTCATTTTA ATTCAAGTAT TATTAATAAA GTTGCGTATC TCATTGCAAA AGGCGGAACT CATAACGGAG TAACTGTTAA AGGAATTGGT GAAGATAAAA TGTTTGATAT TTTCTATTAT GCAAATACTG ATGAGTTAAA CATGACTTCT AATTTCAAAG AATTGAAATC AGCATGTATT CGAGTGGCAA CTAATAAATA TGGGGCGAAT ACAGCTGAAG TTCAAGCAGT TCAAAAGGCA TTTGATGCAG CAAAAATTAA GTAA
|
Protein sequence | MKNTKTLTKV ALTTGLALTA VAPYGVGHAE ETDQLQVQIQ EESFRSGELT QPSQKAPENV VKDALKEKTE QALSQKQVNG EAGVDYKVLQ KRGSYDGTTL VRMQQTYEGK EVYGHQLTAH VDNNGVIKSV SGDSAQNLKQ EELKKPINLS KDEAKQFIYT KYGNDIKFIT EPEVKEVIFV DENNGQAKNA YQVTFEAATP NYVSGTYLVN AQNGDMLKNM VQESNLKASD KLVGALKKSK QSSLTSLTGT GKDDLGISRS FGISKQSNGK YALADYTRGQ GIETYDVNYR DITKEESYYP GTLATSTSAT FNDPKAVSAH YLATKVFDFY KDKYKRNSFD NKGQKVVSVV HAWDSEETND PKNWQNALSA NNGSMLVYGD PIVKAYDVAG HEFTHAVTSS ESNLEYYGES GAINEALSDI MGTSIEKYVN NGNFNWTMGE QTGSVFRDME NPASVPSSLG VPYPDDYSEF NDFNGWDQGG VHFNSSIINK VAYLIAKGGT HNGVTVKGIG EDKMFDIFYY ANTDELNMTS NFKELKSACI RVATNKYGAN TAEVQAVQKA FDAAKIK
|
| |