Gene BCZK3238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK3238 
SymbolcolA 
ID3027193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp3359868 
End bp3362924 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content35% 
IMG OID637547458 
Productcollagenase 
Protein accessionYP_084824 
Protein GI52142005 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAA AGTCAAGTGT GAATTTTGCA CGGTATGGAT ACATATGTGA ACGTTACCTT 
ATTTTTTCTC AATACAGTAT AAAAAGTAAA AATAGTAAAA TAATGGCGTG GGACACTAAA
AAATGTAAGG TAGGGAAGAG CATGAAAGGT TATTCAAAAA AAATGTTAGT AGGGGTAAGT
TTTGCTAGTT TAATGTTAGG GAGTTTTCAA GGGGGCGCAT TGGCAGAAGG TACAAAGGGA
GAGCAAGTTT CATATCGGAA TGTGCTCAAA ATGGAACCAG TTGGTGTACA ATTACCTGTG
CAAGAATTAG CTCATTCATC AAAAGTGCTG AAAAATAAGT CTTTTGAGAA AAGGCTACAA
TTTGCCGATT TGTCACAAAG GCCACCTGAA GTAAAAAAGG AAAGTAAGCA ATTAGCTGTA
GCGAAAACGT ATACAATTGC TGAATTAAAT CAATTGAGTA ATCAGCAGTT AGTAGATTTA
CTTGTAACAA TCGATTGGGA GCAAATTACT GGGCTATTTC AGTTTAACAA GGATAGTCTT
GCATTCTATC AAAATGATAG TAGGATACAG GCAATTATTG ATAAATTGAA CCAGCAAGGA
CAAGCGTATA CGAAAGATGA TTCCAAAGGG ATTGAAACTT TAGTAGAGGT ATTACGATCT
GGTTTTTATT TAGGATTTTA TCATACAGAA TTAAGTAAAC TAAATGAGCG AAGCTATCAT
GATAAATGCT TACCTGCATT AAAAACGATT GCGAATAACC CGAATTTCAA ACTAGGTACG
TTAGAACAAA ATAGAGTTGT ATCATCATAC GGAAAATTAA TAGGAAATGC TTCGAGTGAT
GTGGAAACGA TAACATCAGC TGCAAAGATT TTTAAACAAT ATAATGATAA TTTTTCTACA
TGGGTAGATA ATCTTTCAGC TGGAAATGCG ATTTACGATA TTATGCAAGG CGTTGACTAC
GATATTCAAT CGTATTTGTA CGATACGAGA AAAGCACCGA AAGATACAGT ATGGTATCAA
AAAATTGATA GCTATATTAA TGAATTAAGT CGTTTTGCTT TAATTGGAAC GGTGACAGAG
AAGAATGGTT GGCTTATTAA TAATGGTATT TATTATACAG GTAGACTTGG TACGTTCCAT
AGTACAGGGA CGAAAGGGTT GCAAGTTGTA ACAGATGCCA TGAAAATGTA TCCGTATTTA
GGGGAGCAAT ATTTCGTAGC GGCTGAGCAA ATTGCGACGA ATTATGGCGG GAAAGATGCA
AATGGGAATG TTGTGAATTT AGATCAAATA CGAGAAGATG GTAAGAAGAA ATATTTACCG
AAAACATATA CATTTGACGA TGGGACAATT GTTTTAAAAG CTGGAGATAA AGTGACAGAA
GAAAAAGTAA AACGTCTATA TTGGGCGGCA AAAGAAGTGA AGGCTCAATT CCATCGTACG
GTTGAAAGTG ACCAGCCGTT AGAAAAAGGG AATGCTGATG ATGTATTAAC GATGGTTATT
TATAATAGCC CAGCTGAATA TCAATTTAAC CGTCAATTGT ACGGGTATGA AACGAATAAC
GGCGGTCTTT ATATAGAAGG AACAGGTACG TTCTTTACTT ATGAGCGTAC GCCAGAAGAA
AGTATTTATA GTTTAGAGGA ATTGTTCCGG CACGAGTTCA CACATTACTT ACAAGGTAGA
TATGAAGTGC CAGGACTTTG GGGACAAGGT AAGATTTATG AGAATGAGAG ATTATCTTGG
TTTGAAGAAG GCAATGCAGA GTTTTTTGCA GGTGCAACGA GAACAGATAA TGTTGTACCG
AGAAAGAGCA TTATAGGAGG AATATCTTCA AATCCGGCAG AACGTTATAC GGCAGAGAGA
ACGTTAAATG CAAAGTACGG AACATGGGAT TTTTATAATT ATTCCTTCGC TTTACAATCG
TACATGTACA ATAAGAGATA TGATATGTTT GACAAAGTTC ATGATCTTAT TAGAAAAAAT
GATGTAACAG CATATGATGC ATATCGCTCT GCATTAAGTA AAGATGTGAA TTTAAATAAA
GAGTATCAAG ACTATATGCA AATGTTAGTC GACAATCGTG ATAAATATAA TGTTCCATTA
GTATCAGATG ATTATTTAGC AACTCACGCA CCGAAACCAG TCTCAGATAT TGTGGCAGAA
ATTACGGCAG AAGCAAAATT AAGTAATGTA TCAGTTAAGA AAAATAAATC ACAGTTCTTT
CATACATTTA CACTGCAAGG AACATATACA GGTACGACTG CAAAAGGAGA ATATGAAGAC
TGGAAATCAA TTACACAAAA CGTAAATGAT ACGTTAAAAC GTTTAAGTGC AAAAGAATGG
ACAGGCTATA AAACAGTAAC AGCTTATTTC GTAAATTACC GTGTGAATGC ATCAGGACAA
TTTGAATATG ACGTTGTATT CCATGGTATT AATACAGAAG AAGGCGCTGT GAATAAAGCG
CCAGTTGCGG TTATAAATGG TCCCTATAGT GGGAATGTAA ATGAAGCAAT TTCGTTTAAA
AGCGATGGAT CAAAAGATGA AGATGGGAAA ATCACTTCGT ATAAATGGGA GTTTGGTGAT
GGAGCAGTAA GTAATGAGCA AAATCCGACT CACGTGTATA CAAAAGAAGG AACATATACA
GCGAGATTAA CAGTAACAGA TGATAAAGGG TTAACGAATA CTGTTACAAC GAATGTAACG
GTTCAAAAGA AAGAAGATAA CAGTGTAGAA AAAGAACCAA ATAACTCATT CCAGACAGCA
AATACACTGC AATTCAATCA AGTTTTACGC GCAAGTTTAG GAAATGGTGA TACGAGTGAT
TTCTTTGAAA TAAATGTGGA AACGGCGAAA AATCTGCAAA TTAATGTAAC GAAGGAAAAT
AATATCGGAG TAAACTGGGT TCTTTATTCG GAAGCAGATT TAAATAACTA TATTACGTAT
GCACAGCAAG AAGGGAATAA ATTAGTAGGA AGTTATTATA CGTATCCAGG TAAGTATTAT
TTACATGTGT ATCAGTATGG TGGTGAGTTT GGGAATTATA CGGTAGAAGT GAAGTAG
 
Protein sequence
MNAKSSVNFA RYGYICERYL IFSQYSIKSK NSKIMAWDTK KCKVGKSMKG YSKKMLVGVS 
FASLMLGSFQ GGALAEGTKG EQVSYRNVLK MEPVGVQLPV QELAHSSKVL KNKSFEKRLQ
FADLSQRPPE VKKESKQLAV AKTYTIAELN QLSNQQLVDL LVTIDWEQIT GLFQFNKDSL
AFYQNDSRIQ AIIDKLNQQG QAYTKDDSKG IETLVEVLRS GFYLGFYHTE LSKLNERSYH
DKCLPALKTI ANNPNFKLGT LEQNRVVSSY GKLIGNASSD VETITSAAKI FKQYNDNFST
WVDNLSAGNA IYDIMQGVDY DIQSYLYDTR KAPKDTVWYQ KIDSYINELS RFALIGTVTE
KNGWLINNGI YYTGRLGTFH STGTKGLQVV TDAMKMYPYL GEQYFVAAEQ IATNYGGKDA
NGNVVNLDQI REDGKKKYLP KTYTFDDGTI VLKAGDKVTE EKVKRLYWAA KEVKAQFHRT
VESDQPLEKG NADDVLTMVI YNSPAEYQFN RQLYGYETNN GGLYIEGTGT FFTYERTPEE
SIYSLEELFR HEFTHYLQGR YEVPGLWGQG KIYENERLSW FEEGNAEFFA GATRTDNVVP
RKSIIGGISS NPAERYTAER TLNAKYGTWD FYNYSFALQS YMYNKRYDMF DKVHDLIRKN
DVTAYDAYRS ALSKDVNLNK EYQDYMQMLV DNRDKYNVPL VSDDYLATHA PKPVSDIVAE
ITAEAKLSNV SVKKNKSQFF HTFTLQGTYT GTTAKGEYED WKSITQNVND TLKRLSAKEW
TGYKTVTAYF VNYRVNASGQ FEYDVVFHGI NTEEGAVNKA PVAVINGPYS GNVNEAISFK
SDGSKDEDGK ITSYKWEFGD GAVSNEQNPT HVYTKEGTYT ARLTVTDDKG LTNTVTTNVT
VQKKEDNSVE KEPNNSFQTA NTLQFNQVLR ASLGNGDTSD FFEINVETAK NLQINVTKEN
NIGVNWVLYS EADLNNYITY AQQEGNKLVG SYYTYPGKYY LHVYQYGGEF GNYTVEVK