Gene pE33L466_0145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagpE33L466_0145 
SymbolcolA 
ID3399644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_007103 
Strand
Start bp154606 
End bp157488 
Gene Length2883 bp 
Protein Length960 aa 
Translation table11 
GC content33% 
IMG OID637659979 
Productcollagenase 
Protein accessionYP_245643 
Protein GI67078023 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA AATCAAAATT CACTCAAATG ATGCTAAGTA TTAGTACGAT GGCATTATCA 
TTTGGGAGTA TTCAAACACA GGTATCAGCG GAAGAAAAAG CACCATATAA TGTATTACAA
ATCAAACCAA TTGGGACAGA AACTTCAAAA GATGAAATTG TACATGCTAC AAAAGCGGAC
GAAATATTGA CTTTTGAAGA GCGTTTAAAA GTAGGCGATT TTTCACAACG TCCTACTCTG
GTTATGAAAC GTGATGAAAG TCAATTAAAG CAAAGCTACA CTCTGGCAGA ACTGAATAAA
ATGCCTGATA GCGAACTCAT TGATACGCTT TCAAAAATTT CTTGGAATCA AATTACTGAT
TTATTTCAAT CCACTCAAGA TACGAAGGCT TTTTATCAAA ATAAAGAACG TATGAACATT
ATCATTGATG AATTAGGACA ACGAGGAAGC GCTTTTACAA AAGAGGACTC AAAGGGAATC
GAAACATTTG TTGAACTATT ACGTTCTGCT TTTTATGTGG GATATTATAA TAATGAATTA
AGCTACTTAA AAGAAAGAGG CTTCCATGAC AAATGTTTAC CAGCATTAAA AGCAATTGCG
AAAAATCCAA ACTTTACATT AGGTACAGCG GAGCAAGATA GAGTAGTAGC TGCGTACGGA
AAATTAATTA GTAATGCTTC TAGTGATACT GAAACAGTAC AATATGCGGT AAATATTTTA
AAACAATATA ATGATAATCT TTCTACGTAT GTAAGTGATT ATACGAAAGG ACAAGCTGTA
TATGAAATTG TAAAAGGAAT TGATTATGAT ATACAGTCTT ATATGCAGGA TACGAATAAA
AAACCTAATG AAACAATGTG GTATGGAAAG ATTGATAACT TTATAAACGA GGTTAGTAGA
ATTGCTCTCA TACGGAATAT AACAACTGAA AATAGTTGGC TAATTAATAA TGGCATTTAT
TATGCAGGTC GTTTAGGGAA ATTTCATAGT AACCCATACA AAGGATTAGA AGTTATTACA
CAAGCGATGA GCTTGTATCC TCGTTTAAGT GGACCTTATT TTGTAGCAGT AGAACAAATT
AAAACAAACT ATGGTGGAAA AGATTATAGT GGAAATGCAG TAGATCTACA GAAAATACGT
GAAGAAGGGA AACGACAATA CTTACCTAAA ACATATACAT TTGATGACGG ATCAATTGTC
TTCAAGACGG GAGATAAAGT AACAGAAGAA AAAATTAAGA GATTATATTG GGCAGCCAAA
GAAGTAAAAG CACAATATCA CCGTGTAATT GGTAATGATA AAGCACTAGA ACCGGGTAAC
GCTGATGATG TACTAACGAT AGTAATTTAT AATAATCCAG ATGAATATCA ATTAAATAGA
CAATTATATG GATATGAAAC AAACAACGGT GGAATTTATA TTGAAGAGAA GGGGACCTTC
TTTACATATG AGCGTACGCC AAAGCAGAGT ATTTATAGTT TAGAAGAGTT ATTCCGTCAT
GAATTCACTC ATTATTTACA AGGAAGGTAT GAGGTTCCTG GTTTATTTGG AAGCGGAGAA
ATGTATCAAA ATGAACGATT AACTTGGTTC CAAGAAGGGA ATGCAGAATT TTTTGCAGGA
TCTACACGTA CAAATAATGT TGTTCCGCGT AAAAGTATGA TAAGTGGCTT GTCATCTGAT
CCAGCAAGCC GTTATACAGC AAAGCAAACT TTGTTCTCAA AATATGGATC ATGGGACTTT
TATAAGTATT CTTTTGCACT ACAGTCATAT TTGTATAATC ATCAATTTGA AACATTTGAT
AAACTTCAAG ATTTAATCCG TGCAAACGAT GTGAAAAATT ATGACTTATA TCGTGAATCA
TTAAGCAACA ATACACAATT GAATGCAGAA TATCAAACGT ATATGCAGCA GTTGATTGAT
AATCAAGATA AATATAATGT ACCGCAAGTA ACAAATGATT ATTTAATTCA ACACGCACCA
AAGCCGTTAG CTGAAGTGAA AAACGAAATT GTGGATGTAG CAAATATAAA AGATGAAAAA
ATTACTAAAC ACGAGTCGCA ATTCTTTAAT ACATTTACCG TGGAAGGCAA GTACACAGGT
GGTACATCAA AAGGTGAGTC TGAAGATTGG AAAACGATGA GTAAACAAGT AAATCAAGCT
TTGGAGCAGT TATCCCACAA AGGGTGGAGT GGTTATAAAA CAGTTACAGC CTATTTTGTA
AACTATCGTG TGAATGCAGC TAACCAGTTT GAATATGATA TTGTTTTTCA TGGTGTTGCA
ACAGAGGAAA AGGAAAAAAC AAATACTATA GTAAATATGA ATGGACCATA CAGCGGGATA
GTAAATGAAG AGATTCAATT TCATAGCGAT GGTACAAAAA GTGAAAATGG AAAAGTTATT
TCTTATCTAT GGAACTTTGG AGATGGTGCA ACAAGTACAG AAGCAAATCC TACCCATGTA
TATGGAGAAA AAGGAACATA CACTGTGGAA CTAACAGTGA AAGATAGTAG AGGAAAAGAA
AGCAAAGAAC AAACAAAAGT TACTGTAAAA CAAGATCCGC AAACAGGTGA ATCCCATGAA
GAGGAGAAGG TACTCCTGTT TAATACGCTT GTAAAAGGAA ATCTGGTTAC TCCTGATCAA
ACAGATGTTT ATACGTTTGA TGTTACAGAT CCAAAAGAAG TAGATATTTC TGTGGTAAAT
GAACAAAATA TTGGGATGAC ATGGGTACTT TATCATGAAT CAGACATGCA AAATTACGTA
GCTTGTGGTG AAGATGAAGG AGATGTTATA AAAGGGAAAT TCGCAGCAAA ACCAGGAAAA
TATTATTTGA ATGTGTATAA ATTTGATGAT AAAAATGGTG AATATTCATT ATTAGTAAAA
TGA
 
Protein sequence
MNKKSKFTQM MLSISTMALS FGSIQTQVSA EEKAPYNVLQ IKPIGTETSK DEIVHATKAD 
EILTFEERLK VGDFSQRPTL VMKRDESQLK QSYTLAELNK MPDSELIDTL SKISWNQITD
LFQSTQDTKA FYQNKERMNI IIDELGQRGS AFTKEDSKGI ETFVELLRSA FYVGYYNNEL
SYLKERGFHD KCLPALKAIA KNPNFTLGTA EQDRVVAAYG KLISNASSDT ETVQYAVNIL
KQYNDNLSTY VSDYTKGQAV YEIVKGIDYD IQSYMQDTNK KPNETMWYGK IDNFINEVSR
IALIRNITTE NSWLINNGIY YAGRLGKFHS NPYKGLEVIT QAMSLYPRLS GPYFVAVEQI
KTNYGGKDYS GNAVDLQKIR EEGKRQYLPK TYTFDDGSIV FKTGDKVTEE KIKRLYWAAK
EVKAQYHRVI GNDKALEPGN ADDVLTIVIY NNPDEYQLNR QLYGYETNNG GIYIEEKGTF
FTYERTPKQS IYSLEELFRH EFTHYLQGRY EVPGLFGSGE MYQNERLTWF QEGNAEFFAG
STRTNNVVPR KSMISGLSSD PASRYTAKQT LFSKYGSWDF YKYSFALQSY LYNHQFETFD
KLQDLIRAND VKNYDLYRES LSNNTQLNAE YQTYMQQLID NQDKYNVPQV TNDYLIQHAP
KPLAEVKNEI VDVANIKDEK ITKHESQFFN TFTVEGKYTG GTSKGESEDW KTMSKQVNQA
LEQLSHKGWS GYKTVTAYFV NYRVNAANQF EYDIVFHGVA TEEKEKTNTI VNMNGPYSGI
VNEEIQFHSD GTKSENGKVI SYLWNFGDGA TSTEANPTHV YGEKGTYTVE LTVKDSRGKE
SKEQTKVTVK QDPQTGESHE EEKVLLFNTL VKGNLVTPDQ TDVYTFDVTD PKEVDISVVN
EQNIGMTWVL YHESDMQNYV ACGEDEGDVI KGKFAAKPGK YYLNVYKFDD KNGEYSLLVK