Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | pE33L466_0145 |
Symbol | colA |
ID | 3399644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_007103 |
Strand | - |
Start bp | 154606 |
End bp | 157488 |
Gene Length | 2883 bp |
Protein Length | 960 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637659979 |
Product | collagenase |
Protein accession | YP_245643 |
Protein GI | 67078023 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA AATCAAAATT CACTCAAATG ATGCTAAGTA TTAGTACGAT GGCATTATCA TTTGGGAGTA TTCAAACACA GGTATCAGCG GAAGAAAAAG CACCATATAA TGTATTACAA ATCAAACCAA TTGGGACAGA AACTTCAAAA GATGAAATTG TACATGCTAC AAAAGCGGAC GAAATATTGA CTTTTGAAGA GCGTTTAAAA GTAGGCGATT TTTCACAACG TCCTACTCTG GTTATGAAAC GTGATGAAAG TCAATTAAAG CAAAGCTACA CTCTGGCAGA ACTGAATAAA ATGCCTGATA GCGAACTCAT TGATACGCTT TCAAAAATTT CTTGGAATCA AATTACTGAT TTATTTCAAT CCACTCAAGA TACGAAGGCT TTTTATCAAA ATAAAGAACG TATGAACATT ATCATTGATG AATTAGGACA ACGAGGAAGC GCTTTTACAA AAGAGGACTC AAAGGGAATC GAAACATTTG TTGAACTATT ACGTTCTGCT TTTTATGTGG GATATTATAA TAATGAATTA AGCTACTTAA AAGAAAGAGG CTTCCATGAC AAATGTTTAC CAGCATTAAA AGCAATTGCG AAAAATCCAA ACTTTACATT AGGTACAGCG GAGCAAGATA GAGTAGTAGC TGCGTACGGA AAATTAATTA GTAATGCTTC TAGTGATACT GAAACAGTAC AATATGCGGT AAATATTTTA AAACAATATA ATGATAATCT TTCTACGTAT GTAAGTGATT ATACGAAAGG ACAAGCTGTA TATGAAATTG TAAAAGGAAT TGATTATGAT ATACAGTCTT ATATGCAGGA TACGAATAAA AAACCTAATG AAACAATGTG GTATGGAAAG ATTGATAACT TTATAAACGA GGTTAGTAGA ATTGCTCTCA TACGGAATAT AACAACTGAA AATAGTTGGC TAATTAATAA TGGCATTTAT TATGCAGGTC GTTTAGGGAA ATTTCATAGT AACCCATACA AAGGATTAGA AGTTATTACA CAAGCGATGA GCTTGTATCC TCGTTTAAGT GGACCTTATT TTGTAGCAGT AGAACAAATT AAAACAAACT ATGGTGGAAA AGATTATAGT GGAAATGCAG TAGATCTACA GAAAATACGT GAAGAAGGGA AACGACAATA CTTACCTAAA ACATATACAT TTGATGACGG ATCAATTGTC TTCAAGACGG GAGATAAAGT AACAGAAGAA AAAATTAAGA GATTATATTG GGCAGCCAAA GAAGTAAAAG CACAATATCA CCGTGTAATT GGTAATGATA AAGCACTAGA ACCGGGTAAC GCTGATGATG TACTAACGAT AGTAATTTAT AATAATCCAG ATGAATATCA ATTAAATAGA CAATTATATG GATATGAAAC AAACAACGGT GGAATTTATA TTGAAGAGAA GGGGACCTTC TTTACATATG AGCGTACGCC AAAGCAGAGT ATTTATAGTT TAGAAGAGTT ATTCCGTCAT GAATTCACTC ATTATTTACA AGGAAGGTAT GAGGTTCCTG GTTTATTTGG AAGCGGAGAA ATGTATCAAA ATGAACGATT AACTTGGTTC CAAGAAGGGA ATGCAGAATT TTTTGCAGGA TCTACACGTA CAAATAATGT TGTTCCGCGT AAAAGTATGA TAAGTGGCTT GTCATCTGAT CCAGCAAGCC GTTATACAGC AAAGCAAACT TTGTTCTCAA AATATGGATC ATGGGACTTT TATAAGTATT CTTTTGCACT ACAGTCATAT TTGTATAATC ATCAATTTGA AACATTTGAT AAACTTCAAG ATTTAATCCG TGCAAACGAT GTGAAAAATT ATGACTTATA TCGTGAATCA TTAAGCAACA ATACACAATT GAATGCAGAA TATCAAACGT ATATGCAGCA GTTGATTGAT AATCAAGATA AATATAATGT ACCGCAAGTA ACAAATGATT ATTTAATTCA ACACGCACCA AAGCCGTTAG CTGAAGTGAA AAACGAAATT GTGGATGTAG CAAATATAAA AGATGAAAAA ATTACTAAAC ACGAGTCGCA ATTCTTTAAT ACATTTACCG TGGAAGGCAA GTACACAGGT GGTACATCAA AAGGTGAGTC TGAAGATTGG AAAACGATGA GTAAACAAGT AAATCAAGCT TTGGAGCAGT TATCCCACAA AGGGTGGAGT GGTTATAAAA CAGTTACAGC CTATTTTGTA AACTATCGTG TGAATGCAGC TAACCAGTTT GAATATGATA TTGTTTTTCA TGGTGTTGCA ACAGAGGAAA AGGAAAAAAC AAATACTATA GTAAATATGA ATGGACCATA CAGCGGGATA GTAAATGAAG AGATTCAATT TCATAGCGAT GGTACAAAAA GTGAAAATGG AAAAGTTATT TCTTATCTAT GGAACTTTGG AGATGGTGCA ACAAGTACAG AAGCAAATCC TACCCATGTA TATGGAGAAA AAGGAACATA CACTGTGGAA CTAACAGTGA AAGATAGTAG AGGAAAAGAA AGCAAAGAAC AAACAAAAGT TACTGTAAAA CAAGATCCGC AAACAGGTGA ATCCCATGAA GAGGAGAAGG TACTCCTGTT TAATACGCTT GTAAAAGGAA ATCTGGTTAC TCCTGATCAA ACAGATGTTT ATACGTTTGA TGTTACAGAT CCAAAAGAAG TAGATATTTC TGTGGTAAAT GAACAAAATA TTGGGATGAC ATGGGTACTT TATCATGAAT CAGACATGCA AAATTACGTA GCTTGTGGTG AAGATGAAGG AGATGTTATA AAAGGGAAAT TCGCAGCAAA ACCAGGAAAA TATTATTTGA ATGTGTATAA ATTTGATGAT AAAAATGGTG AATATTCATT ATTAGTAAAA TGA
|
Protein sequence | MNKKSKFTQM MLSISTMALS FGSIQTQVSA EEKAPYNVLQ IKPIGTETSK DEIVHATKAD EILTFEERLK VGDFSQRPTL VMKRDESQLK QSYTLAELNK MPDSELIDTL SKISWNQITD LFQSTQDTKA FYQNKERMNI IIDELGQRGS AFTKEDSKGI ETFVELLRSA FYVGYYNNEL SYLKERGFHD KCLPALKAIA KNPNFTLGTA EQDRVVAAYG KLISNASSDT ETVQYAVNIL KQYNDNLSTY VSDYTKGQAV YEIVKGIDYD IQSYMQDTNK KPNETMWYGK IDNFINEVSR IALIRNITTE NSWLINNGIY YAGRLGKFHS NPYKGLEVIT QAMSLYPRLS GPYFVAVEQI KTNYGGKDYS GNAVDLQKIR EEGKRQYLPK TYTFDDGSIV FKTGDKVTEE KIKRLYWAAK EVKAQYHRVI GNDKALEPGN ADDVLTIVIY NNPDEYQLNR QLYGYETNNG GIYIEEKGTF FTYERTPKQS IYSLEELFRH EFTHYLQGRY EVPGLFGSGE MYQNERLTWF QEGNAEFFAG STRTNNVVPR KSMISGLSSD PASRYTAKQT LFSKYGSWDF YKYSFALQSY LYNHQFETFD KLQDLIRAND VKNYDLYRES LSNNTQLNAE YQTYMQQLID NQDKYNVPQV TNDYLIQHAP KPLAEVKNEI VDVANIKDEK ITKHESQFFN TFTVEGKYTG GTSKGESEDW KTMSKQVNQA LEQLSHKGWS GYKTVTAYFV NYRVNAANQF EYDIVFHGVA TEEKEKTNTI VNMNGPYSGI VNEEIQFHSD GTKSENGKVI SYLWNFGDGA TSTEANPTHV YGEKGTYTVE LTVKDSRGKE SKEQTKVTVK QDPQTGESHE EEKVLLFNTL VKGNLVTPDQ TDVYTFDVTD PKEVDISVVN EQNIGMTWVL YHESDMQNYV ACGEDEGDVI KGKFAAKPGK YYLNVYKFDD KNGEYSLLVK
|
| |