Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK3238 |
Symbol | colA |
ID | 3027193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | - |
Start bp | 3359868 |
End bp | 3362924 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637547458 |
Product | collagenase |
Protein accession | YP_084824 |
Protein GI | 52142005 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCAA AGTCAAGTGT GAATTTTGCA CGGTATGGAT ACATATGTGA ACGTTACCTT ATTTTTTCTC AATACAGTAT AAAAAGTAAA AATAGTAAAA TAATGGCGTG GGACACTAAA AAATGTAAGG TAGGGAAGAG CATGAAAGGT TATTCAAAAA AAATGTTAGT AGGGGTAAGT TTTGCTAGTT TAATGTTAGG GAGTTTTCAA GGGGGCGCAT TGGCAGAAGG TACAAAGGGA GAGCAAGTTT CATATCGGAA TGTGCTCAAA ATGGAACCAG TTGGTGTACA ATTACCTGTG CAAGAATTAG CTCATTCATC AAAAGTGCTG AAAAATAAGT CTTTTGAGAA AAGGCTACAA TTTGCCGATT TGTCACAAAG GCCACCTGAA GTAAAAAAGG AAAGTAAGCA ATTAGCTGTA GCGAAAACGT ATACAATTGC TGAATTAAAT CAATTGAGTA ATCAGCAGTT AGTAGATTTA CTTGTAACAA TCGATTGGGA GCAAATTACT GGGCTATTTC AGTTTAACAA GGATAGTCTT GCATTCTATC AAAATGATAG TAGGATACAG GCAATTATTG ATAAATTGAA CCAGCAAGGA CAAGCGTATA CGAAAGATGA TTCCAAAGGG ATTGAAACTT TAGTAGAGGT ATTACGATCT GGTTTTTATT TAGGATTTTA TCATACAGAA TTAAGTAAAC TAAATGAGCG AAGCTATCAT GATAAATGCT TACCTGCATT AAAAACGATT GCGAATAACC CGAATTTCAA ACTAGGTACG TTAGAACAAA ATAGAGTTGT ATCATCATAC GGAAAATTAA TAGGAAATGC TTCGAGTGAT GTGGAAACGA TAACATCAGC TGCAAAGATT TTTAAACAAT ATAATGATAA TTTTTCTACA TGGGTAGATA ATCTTTCAGC TGGAAATGCG ATTTACGATA TTATGCAAGG CGTTGACTAC GATATTCAAT CGTATTTGTA CGATACGAGA AAAGCACCGA AAGATACAGT ATGGTATCAA AAAATTGATA GCTATATTAA TGAATTAAGT CGTTTTGCTT TAATTGGAAC GGTGACAGAG AAGAATGGTT GGCTTATTAA TAATGGTATT TATTATACAG GTAGACTTGG TACGTTCCAT AGTACAGGGA CGAAAGGGTT GCAAGTTGTA ACAGATGCCA TGAAAATGTA TCCGTATTTA GGGGAGCAAT ATTTCGTAGC GGCTGAGCAA ATTGCGACGA ATTATGGCGG GAAAGATGCA AATGGGAATG TTGTGAATTT AGATCAAATA CGAGAAGATG GTAAGAAGAA ATATTTACCG AAAACATATA CATTTGACGA TGGGACAATT GTTTTAAAAG CTGGAGATAA AGTGACAGAA GAAAAAGTAA AACGTCTATA TTGGGCGGCA AAAGAAGTGA AGGCTCAATT CCATCGTACG GTTGAAAGTG ACCAGCCGTT AGAAAAAGGG AATGCTGATG ATGTATTAAC GATGGTTATT TATAATAGCC CAGCTGAATA TCAATTTAAC CGTCAATTGT ACGGGTATGA AACGAATAAC GGCGGTCTTT ATATAGAAGG AACAGGTACG TTCTTTACTT ATGAGCGTAC GCCAGAAGAA AGTATTTATA GTTTAGAGGA ATTGTTCCGG CACGAGTTCA CACATTACTT ACAAGGTAGA TATGAAGTGC CAGGACTTTG GGGACAAGGT AAGATTTATG AGAATGAGAG ATTATCTTGG TTTGAAGAAG GCAATGCAGA GTTTTTTGCA GGTGCAACGA GAACAGATAA TGTTGTACCG AGAAAGAGCA TTATAGGAGG AATATCTTCA AATCCGGCAG AACGTTATAC GGCAGAGAGA ACGTTAAATG CAAAGTACGG AACATGGGAT TTTTATAATT ATTCCTTCGC TTTACAATCG TACATGTACA ATAAGAGATA TGATATGTTT GACAAAGTTC ATGATCTTAT TAGAAAAAAT GATGTAACAG CATATGATGC ATATCGCTCT GCATTAAGTA AAGATGTGAA TTTAAATAAA GAGTATCAAG ACTATATGCA AATGTTAGTC GACAATCGTG ATAAATATAA TGTTCCATTA GTATCAGATG ATTATTTAGC AACTCACGCA CCGAAACCAG TCTCAGATAT TGTGGCAGAA ATTACGGCAG AAGCAAAATT AAGTAATGTA TCAGTTAAGA AAAATAAATC ACAGTTCTTT CATACATTTA CACTGCAAGG AACATATACA GGTACGACTG CAAAAGGAGA ATATGAAGAC TGGAAATCAA TTACACAAAA CGTAAATGAT ACGTTAAAAC GTTTAAGTGC AAAAGAATGG ACAGGCTATA AAACAGTAAC AGCTTATTTC GTAAATTACC GTGTGAATGC ATCAGGACAA TTTGAATATG ACGTTGTATT CCATGGTATT AATACAGAAG AAGGCGCTGT GAATAAAGCG CCAGTTGCGG TTATAAATGG TCCCTATAGT GGGAATGTAA ATGAAGCAAT TTCGTTTAAA AGCGATGGAT CAAAAGATGA AGATGGGAAA ATCACTTCGT ATAAATGGGA GTTTGGTGAT GGAGCAGTAA GTAATGAGCA AAATCCGACT CACGTGTATA CAAAAGAAGG AACATATACA GCGAGATTAA CAGTAACAGA TGATAAAGGG TTAACGAATA CTGTTACAAC GAATGTAACG GTTCAAAAGA AAGAAGATAA CAGTGTAGAA AAAGAACCAA ATAACTCATT CCAGACAGCA AATACACTGC AATTCAATCA AGTTTTACGC GCAAGTTTAG GAAATGGTGA TACGAGTGAT TTCTTTGAAA TAAATGTGGA AACGGCGAAA AATCTGCAAA TTAATGTAAC GAAGGAAAAT AATATCGGAG TAAACTGGGT TCTTTATTCG GAAGCAGATT TAAATAACTA TATTACGTAT GCACAGCAAG AAGGGAATAA ATTAGTAGGA AGTTATTATA CGTATCCAGG TAAGTATTAT TTACATGTGT ATCAGTATGG TGGTGAGTTT GGGAATTATA CGGTAGAAGT GAAGTAG
|
Protein sequence | MNAKSSVNFA RYGYICERYL IFSQYSIKSK NSKIMAWDTK KCKVGKSMKG YSKKMLVGVS FASLMLGSFQ GGALAEGTKG EQVSYRNVLK MEPVGVQLPV QELAHSSKVL KNKSFEKRLQ FADLSQRPPE VKKESKQLAV AKTYTIAELN QLSNQQLVDL LVTIDWEQIT GLFQFNKDSL AFYQNDSRIQ AIIDKLNQQG QAYTKDDSKG IETLVEVLRS GFYLGFYHTE LSKLNERSYH DKCLPALKTI ANNPNFKLGT LEQNRVVSSY GKLIGNASSD VETITSAAKI FKQYNDNFST WVDNLSAGNA IYDIMQGVDY DIQSYLYDTR KAPKDTVWYQ KIDSYINELS RFALIGTVTE KNGWLINNGI YYTGRLGTFH STGTKGLQVV TDAMKMYPYL GEQYFVAAEQ IATNYGGKDA NGNVVNLDQI REDGKKKYLP KTYTFDDGTI VLKAGDKVTE EKVKRLYWAA KEVKAQFHRT VESDQPLEKG NADDVLTMVI YNSPAEYQFN RQLYGYETNN GGLYIEGTGT FFTYERTPEE SIYSLEELFR HEFTHYLQGR YEVPGLWGQG KIYENERLSW FEEGNAEFFA GATRTDNVVP RKSIIGGISS NPAERYTAER TLNAKYGTWD FYNYSFALQS YMYNKRYDMF DKVHDLIRKN DVTAYDAYRS ALSKDVNLNK EYQDYMQMLV DNRDKYNVPL VSDDYLATHA PKPVSDIVAE ITAEAKLSNV SVKKNKSQFF HTFTLQGTYT GTTAKGEYED WKSITQNVND TLKRLSAKEW TGYKTVTAYF VNYRVNASGQ FEYDVVFHGI NTEEGAVNKA PVAVINGPYS GNVNEAISFK SDGSKDEDGK ITSYKWEFGD GAVSNEQNPT HVYTKEGTYT ARLTVTDDKG LTNTVTTNVT VQKKEDNSVE KEPNNSFQTA NTLQFNQVLR ASLGNGDTSD FFEINVETAK NLQINVTKEN NIGVNWVLYS EADLNNYITY AQQEGNKLVG SYYTYPGKYY LHVYQYGGEF GNYTVEVK
|
| |