Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCE_3539 |
Symbol | |
ID | 2747984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus ATCC 10987 |
Kingdom | Bacteria |
Replicon accession | NC_003909 |
Strand | - |
Start bp | 3290215 |
End bp | 3293130 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637280340 |
Product | collagenase, putative |
Protein accession | NP_979836 |
Protein GI | 42782589 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGCT ATTCAAAAAA AGTGTTAGTA GGGGTAAGCT TTGCTAGTTT CATGTTAGGG AGTTTTCAAG GGGGCGCATT GGCAGAAGGT ACAAAGGGAG AGCAAGTTTC ATATCGGAAT GTGCTCAAAA TGGAGCCGGT TGGTGTACAA TTACCAGTGC AAGAATTAGC TCATTCATCA AAAGTGTTGG AAAATAAGTC TTTTGAGAAA AGGCTACAAT TTGCTGATTT GTCACAAAGA CCGCCTGAAG TAAAAAAGAA AAGTAAACAA TTAACCGCAG CGAAAACGTA TACAATTGCT GAATTAAATC AATTGAGCAA TCAGCAGTTA GTGGATTTAC TTGTAACAAT TGATTGGGAG CAAATTACTG GGCTATTTCA GTTTAATAAG GATAGTCTTG CATTCTATCA AAATGATAGT AGAATGCAGG CAATTATTGA TAAATTGAAG CAGCAAGGAC AAGCTTATAC GAAGGATGAT TCAAAAGGGA TTGAGACTTT AGTAGAGGTA TTACGCTCAG GGTTTTATCT AGGATTTTAT CATACAGAAT TAAGTAAACT AAATGATCGA AGCTATCATG ATAAATGCTT ACCTGCATTA AAAACGATTG CGAATAACCC GAATTTCAAA CTCGGTACGT TAGAACAAAA TAGAGTTGTC TCATCATACG GAAAATTAAT AGGAAATGCT TCGAGTGATG TGGAAACGAT AACATCAGCT GCAAAGATTT TTAAACAATA TAACGATAAT TTTTCTACAT TGGTAGACAA TCTATCAGCT GGAAATGCGA TTTACGATAT TATGCAAGGT GTTGATTACG ATATTCAATC GTATTTATAC GATACGAGAA AAGCACCGAA AGATACAATG TGGTATCAAA AAATTGATAG CTATATTAAT GAATTAAGTC GTTTTGCCTT AATTGGAACG GTGACAGCGA AAAATGGTTG GTTAATTAAT AATGGTATTT ATTATACAGG TAGACTTGGT ACGTTCCATA GTACAGGAAC GAAAGGGTTG CAAGTTGTAA CGGACGCAAT GAAAATGTAT CCGTATTTAG GAGAGCAATA TTTCGTAGCT GCTGAGCAAA TTGCGACGAA TTATGGCGGG AAAGATGCGA ATGGTAAAGT GGTTGATCTA GATCAAATAA GAGAAGATGG TAAGAAAAAA TATTTACCGA AAACGTATAC GTTCGATGAT GGAGCAATTG TGTTAAAAGC TGGAGATAAG GTGACAGAAG AAAAAGTAAA ACGTCTATAT TGGGCGGCAA AAGAAGTGAA GGCACAATTC CATCGTACGG TTGAAAGTGA CCAGCCGTTA GAAAAAGGCA ATCCTGATGA TGTATTAACA ATGGTTATTT ATAATAGCCC ATCTGAATAT CAATTTAACC GTCAATTATA CGGATATGAA ACGAATAACG GCGGTCTTTA TATAGAAGGA ACAGGTACGT TCTTTACTTA TGAACGTACG CCAGAAGAAA GTATTTATAG TTTAGAAGAA TTGTTCCGAC ATGAGTTCAC ACATTACTTA CAAGGTAGAT ATGAAGTGCC AGGACTTTGG GGACAAGGGA AGATGTATGA GAATGAGAGA TTATCTTGGT TTGAAGAAGG GAATGCGGAG TTTTTTGCAG GGGCAACGAG AACAGATAAT GTTGTACCGA GAAAGAGCAT TATAGGAGGA ATATCTTCAA ATCCAGCAGA ACGTTATACA GCAGAGAGAA CGTTAAACGC AAAATATGGA ACGTGGGATT TCTATAATTA TTCCTTCGCT TTACAATCGT ACATGTACAA TAAGAGATAT GATATGTTTG ACAAAATTCA TGATCTTATT AGGAAAAATG ATGTAACAGC ATATGATGCA TACCGCTCTG CTTTAAGTAA AGATGCGAAT TTAAATAAAG AATATCARGA TTATATGCAA ATGTTAGTAG ATAACCGTGA GAAATATAAT GTTCCGTTAG TATCAGATGA TTATTTAGCA ACTCACGCAC CGAAACCAGT TTCAGATATT GCGGCAGAAA TTACAGCAGA AGCAAAATTA AATAATGTAT CAGTTAAGAA AAATAAATCA CAGTTCTTTC ATACATTTAC ACTACAAGGA ACATACACAG GTACTACTGC AAAAGGAGAA TATGAAGACT GGAAGACAAT TACACAAAAC GTGAATGATA CGTTAAAACG TTTAAGTGCG AAAGAATGGA CAGGCTATAA AACAGTAACA GCATACTTCG TAAACTATCG TGTGAATGCA GCAGGACAAT TTGAGTATGA TGTTGTATTC CATGGTATTA ATACAGAAGA AGGTGCTGTG AATAAAGCGC CAGTTGCGGT TATAAATGGT CCATATAGCG GAAAGGTAAA TGAAGCAATT TCGTTTAAAA GCGATGGATC AAAAGATGAA GATGGGAAAA TCATTTCGTA TAAATGGGAG TTTGGCGATG GAGCAGTAAG TGATGAGCAA AATCCGACTC ACGTGTATAC AAAAGAAGGA ACATATACAG CGAAATTAAC AGTAACAGAT GACAAAGGAT TAACGAATAC TGCTACAACG AATGTAACGG TTCAAAAGAA AGAAGATAAC AGTGTGGAAA AAGAGCCGAA TAACTCATTT CAAACAGCAA ATAAACTGCA GCTAAATCAA ATTTTACGTG CTAGTTTAGG AAATGGCGAT ACGAGTGATT TCTTTGAAAT TAATGTGGAT ACTGCTAAAA ACCTTCAAAT TAACGTAACG AATGAAAATA ATATCGGAAT GAACTGGGTT CTTTATTCGG AAGCAGATTT AAATAATTAT GTTACGTATG CACAGCAAGA AGGGAATAAG TTAGTAGGAA GTTACTACAC GTATCCAGGA AAGTATTACT TACATGTGTA TCAGTATAGC GGGGGAACAG GGAATTATAC GGTAGAAGTG AAATAG
|
Protein sequence | MKGYSKKVLV GVSFASFMLG SFQGGALAEG TKGEQVSYRN VLKMEPVGVQ LPVQELAHSS KVLENKSFEK RLQFADLSQR PPEVKKKSKQ LTAAKTYTIA ELNQLSNQQL VDLLVTIDWE QITGLFQFNK DSLAFYQNDS RMQAIIDKLK QQGQAYTKDD SKGIETLVEV LRSGFYLGFY HTELSKLNDR SYHDKCLPAL KTIANNPNFK LGTLEQNRVV SSYGKLIGNA SSDVETITSA AKIFKQYNDN FSTLVDNLSA GNAIYDIMQG VDYDIQSYLY DTRKAPKDTM WYQKIDSYIN ELSRFALIGT VTAKNGWLIN NGIYYTGRLG TFHSTGTKGL QVVTDAMKMY PYLGEQYFVA AEQIATNYGG KDANGKVVDL DQIREDGKKK YLPKTYTFDD GAIVLKAGDK VTEEKVKRLY WAAKEVKAQF HRTVESDQPL EKGNPDDVLT MVIYNSPSEY QFNRQLYGYE TNNGGLYIEG TGTFFTYERT PEESIYSLEE LFRHEFTHYL QGRYEVPGLW GQGKMYENER LSWFEEGNAE FFAGATRTDN VVPRKSIIGG ISSNPAERYT AERTLNAKYG TWDFYNYSFA LQSYMYNKRY DMFDKIHDLI RKNDVTAYDA YRSALSKDAN LNKEYQDYMQ MLVDNREKYN VPLVSDDYLA THAPKPVSDI AAEITAEAKL NNVSVKKNKS QFFHTFTLQG TYTGTTAKGE YEDWKTITQN VNDTLKRLSA KEWTGYKTVT AYFVNYRVNA AGQFEYDVVF HGINTEEGAV NKAPVAVING PYSGKVNEAI SFKSDGSKDE DGKIISYKWE FGDGAVSDEQ NPTHVYTKEG TYTAKLTVTD DKGLTNTATT NVTVQKKEDN SVEKEPNNSF QTANKLQLNQ ILRASLGNGD TSDFFEINVD TAKNLQINVT NENNIGMNWV LYSEADLNNY VTYAQQEGNK LVGSYYTYPG KYYLHVYQYS GGTGNYTVEV K
|
| |