Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH187_A3542 |
Symbol | |
ID | 7076712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH187 |
Kingdom | Bacteria |
Replicon accession | NC_011658 |
Strand | - |
Start bp | 3287548 |
End bp | 3290463 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643451975 |
Product | putative microbial collagenase |
Protein accession | YP_002339486 |
Protein GI | 217960918 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGCT ATTCAAAAAA AGTGTTAGTA GGGGTAAGTT TTGCTAGTTT CATGTTAGGG AGTTTTCAAG GGGACGCATT GGCAGAAGGT ACAAAGGGAG AGCAAGTTTC ATATCGGAAT GTGCTCAAAA TGGAGCCGAT TGGTTTACAA TTACCAGTGC AAGAATTAGC TCATTCATCA AAAGTGCTAG AAAATAAGTC TTTTGAGAAA AGGCTAAAAT TTGCTGATTT GTCACAAAGA CCGCCTGAAG TGAAAAAGGA AAGTAAGCAA TTAGCTGTAG CGAAAACGTA TACAATTGCT GAATTAAATC AATTGAGCAA TCAGCAGTTA GTGGACTTAC TTGTAACAAT TGATTGGGAG CAAATTACTG GGCTATTTCA GTTTAATAAG GATAGTCTTG CATTCTATCA AAATGATAGT AGAATGCAGG CAATTATTGA TAAATTGAAC CAGCAAGGAC AAGCTTATAC GAAAGATGAT TCAAAAGGGA TTGAGACTTT AGTAGAGGTG TTACGATCTG GTTTTTATTT AGGATTTTAT CATACAGAAT TAAGTAAACT AAATGAGCGA AGCTATCATG ATAAATGCTT ACCTGCATTA AAAACGATTG CGAATAACTC GAATTTCAAG CTCGGTACGT TAGAACAAAA TAGAGTTGTC TCATCATATG GAAAATTAAT TGGAAATGCT TCAAGTGATG TGGAAACGAT CACATCAGCT GCAAAGATTT TTAAACAATA TAATGATAAT TTTTCTACAT TGGTAGATAA TCTTTCAGCT GGAAATGCGA TTTACGATAT TATGCAAGGT GTTGATTACG ATATTCAATC GTATTTGTAC GATACGAGAA AAGCACCGGA AGATACAATA TGGTATCAAA AAATTGATAG CTATATTAAT GAATTAAGTC GTTTTGCCTT AATTGGAACG GTGACAGCGA AAAATGGTTG GTTAATTAAT AATGGTATTT ATTATACAGG TAGACTTGGT ACGTTCCATA GTACAGAAAC GAAAGGATTA CAAGTTGTAA CAGATGCCAT GAAAATCTAT CCGTATTTAG GTGAGCAATA TTTCGTAGCT GCTGAGCAAA TTGCGACGAA TTATGGCGGG AAAGATGCGA ATGGTAAAGT GGTTGATCTA GATCAAATAA GAGAAGATGG TAAGAAAAAA TATTTACCGA AAACGTATAC GTTCGATGAT GGAGCAATTG TGTTAAAAGC TGGAGATAAG GTGACTGAGG AAAAAGTAAA ACGTCTATAT TGGGCGGCAA AAGAAGTGAA GGCACAATTC CATCGTACGG TTGAAAGTGA CCAGCCGTTA GAAAAAGGCA ATCCTGATGA TGTATTAACG ATGGTTATTT ATAATAGCCC AGCTGAATAT CAATTTAACC GTCAATTATA CGGGTATGAA ACGAATAACG GCGGTCTTTA TATAGAAGGA ACAGGTACGT TCTTTACTTA TGAACGTACG CCAGAAGAAA GCATTTATAG TTTGGAAGAA TTGTTCCGAC ATGAGTTCAC GCATTATTTA CAAGGTAGAT ATGAAGTGCC AGGACTTTGG GGACAAGGGA AGATGTATGA GAATGAGAGA TTATCTTGGT TTGAAGAAGG AAATGCGGAG TTTTTTGCAG GTGCAACGAG AACAGATAAT GTTGTACCGA GAAAGAGTAT TATAGGAGGC ATATCTTCAA ATCCAGCAGA ACGTTATACA GCAGAGAGAA CGTTAAACGC AAAATATGGA ACGTGGGATT TCTATAATTA TTCCTTCGCT TTACAATCGT ACATGTACAA TAAGAGATAT GATATGTTTG ACAAAATTCA TGATCTTATT AGGAAAAATG ATGTAACAGC ATATGATGCA TACCGCTCTG CTTTAAGTAA AGATGCGAAT TTAAATAAAG AGTATCAAGA CTATATGCAA ATGTTAGTAG ATAACCGTGA GAAATATAAT GTTCCATTAG TATCAGATGA TTATTTAGCG ACGCACGCAC CGAAACCAGT TTCAGATATT GCAGCAGAAA TTACAGCAGA AGCAAAATTA AGTAATGTAT CAGTTAAGAA AAACAAATCA CAGTTCTTTC ATACATTTAC ACTACAAGGA ACATATACAG GTACGACTGC AAAAGGAGAA TATGAAGACT GGAAGACAAT TACACAAAAC GTAAATGATA CGTTAAAACG TTTAAGTGCG AAAGAATGGA CAGGCTATAA AACAGTAACA GCTTATTTCG TAAATTACCG TGTGAATGCA GCAGGACAAT TTGAGTATGA TGTTGTATTC CATGGGATTA ATACAGAAGA AGGTGCTGTG AATAAAGCGC CAGTTGCGGT TATAAACGGC CCATATAGCG GAAATGTAAA TGCAGCAATT TCGTTTAAAA GCGATGGATC GAAAGATGAA GATGGAAAAA TTGTTGCTTA TAAATGGGAG TTTGGTGATG GTACTGTAAG TAATAAACAA AATCCGACTC ATGTATACAC AAAAGAAGGA ACATATACGG CAAAATTAAC AGTAACAGAT GACAAAGGAT TAACGAATAC TGCTACAACG AATGTAACGG TTCAAAAGAA AGAAGATAAT AGTGTAGAAA AAGAGCCGAA TAACTCATTC CAGACAGCAA ATAAACTGCA ATTCAATCAA GTTTTACGTG CTAGTTTAGG AAACGGTGAT ACGAGTGATT ACTTTGAAAT AAATGTGGAA ACGGCGAAAA ACCTGCAAAT TAATGTAACG AAGGAAAATA ATATTGGAGT AAACTGGGTT CTTTATTCGG AAGCAGATTT AAATAATTAT GTTACGTATG CCCAGCAAGA AGGAAATAAG TTAGTAGGAA GTTACTACAC GTATCCAGGG AAATATTATT TACATGTGTA TCAGTATGGC GGGGGAACAG GGAATTATAC GGTAGAAGTG AAATAG
|
Protein sequence | MKGYSKKVLV GVSFASFMLG SFQGDALAEG TKGEQVSYRN VLKMEPIGLQ LPVQELAHSS KVLENKSFEK RLKFADLSQR PPEVKKESKQ LAVAKTYTIA ELNQLSNQQL VDLLVTIDWE QITGLFQFNK DSLAFYQNDS RMQAIIDKLN QQGQAYTKDD SKGIETLVEV LRSGFYLGFY HTELSKLNER SYHDKCLPAL KTIANNSNFK LGTLEQNRVV SSYGKLIGNA SSDVETITSA AKIFKQYNDN FSTLVDNLSA GNAIYDIMQG VDYDIQSYLY DTRKAPEDTI WYQKIDSYIN ELSRFALIGT VTAKNGWLIN NGIYYTGRLG TFHSTETKGL QVVTDAMKIY PYLGEQYFVA AEQIATNYGG KDANGKVVDL DQIREDGKKK YLPKTYTFDD GAIVLKAGDK VTEEKVKRLY WAAKEVKAQF HRTVESDQPL EKGNPDDVLT MVIYNSPAEY QFNRQLYGYE TNNGGLYIEG TGTFFTYERT PEESIYSLEE LFRHEFTHYL QGRYEVPGLW GQGKMYENER LSWFEEGNAE FFAGATRTDN VVPRKSIIGG ISSNPAERYT AERTLNAKYG TWDFYNYSFA LQSYMYNKRY DMFDKIHDLI RKNDVTAYDA YRSALSKDAN LNKEYQDYMQ MLVDNREKYN VPLVSDDYLA THAPKPVSDI AAEITAEAKL SNVSVKKNKS QFFHTFTLQG TYTGTTAKGE YEDWKTITQN VNDTLKRLSA KEWTGYKTVT AYFVNYRVNA AGQFEYDVVF HGINTEEGAV NKAPVAVING PYSGNVNAAI SFKSDGSKDE DGKIVAYKWE FGDGTVSNKQ NPTHVYTKEG TYTAKLTVTD DKGLTNTATT NVTVQKKEDN SVEKEPNNSF QTANKLQFNQ VLRASLGNGD TSDYFEINVE TAKNLQINVT KENNIGVNWV LYSEADLNNY VTYAQQEGNK LVGSYYTYPG KYYLHVYQYG GGTGNYTVEV K
|
| |