Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH820_3538 |
Symbol | |
ID | 7190198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH820 |
Kingdom | Bacteria |
Replicon accession | NC_011773 |
Strand | - |
Start bp | 3367900 |
End bp | 3370815 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643556949 |
Product | putative microbial collagenase |
Protein accession | YP_002452488 |
Protein GI | 218904654 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 5.60161e-19 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGGCT ATTCAAAAAA AGTGTTAGTA GGGGTAAGTT TTGCTAGTTT AATGTTAGGG AGTTTTCAAG GGGGCGCATT GGCAGAAGGT ACAAAGGGAG AGCAAGCTTC ATATCGGAAT GTGCTCAAAA TGGAACCAGT TGGTGTACAA TTACCAGTAC AAGAATTAGC TCATTCATCA AAAGTACTTG AAAATAAGTC TTTTGAGAAG AGGTTACAAT TTGCTGATTT GTCGCAAAGA CCGCCTGAAT TGAAAAAGGA GAGTAAGCAA TTAGCTACAG CAAAAACTTA TACAATTGCT GAGTTAAATC AATTAAGCAA TCAGCAGTTA GTAGATTTAC TTGTAACAAT TGATTGGGAG CAAATTACTG GGCTATTTCA GTTTAACAAG GATAGTCTTG CCTTCTATCA AAATGATAGT AGAATGCAGG CAATTATTGA TAAATTGAAC CAGCAAGGAC AAGCGTATAC GAAAGATGAT TCAAAAGGGA TTGAGACTTT AGTAGAGGTA TTACGATCTG GTTTTTATTT AGGATTTTAT CATACAGAAT TAAGTAAACT AAATGAGCGA AGCTATCATG ATAAATGCTT ACCTGCACTA AAAACGATTG CGAATAACCC GAATTTCAAA CTAGGTACGT TAGAACAAAA TAGAGTTGTA TCATCATACG GAAAATTAAT AGGAAATGCT TCGAGTGATG TGGAAACGAT AACATCAGCT GCAAAGATTT TTAAACAATA TAATGATAAT TTTTCTACAT GGGTAGATAA TCTTTCAGCT GGAAATGCGA TTTACGATAT TATGCAAGGC GTTGACTACG ATATTCAATC GTATTTGTAC GATACGAGAA AAGCACCGAA AGATACAGTA TGGTATCAAA AAATTGATAG CTATATTAAT GAATTAAGTC GTTTTGCTTT AATTGGAACG GTGACAGAGA AGAATGCTTG GCTTATTAAT AATGGTATTT ATTATACAGG TAGACTTGGT ACGTTCCATA GTACAGGGAC GAAAGGGTTG CAAGTTGTAA CAGATGCCAT GAAAATGTAT CCGTATTTAG GGGAGCAATA TTTCGTAGCG GCTGAGCAAA TTGCGACGAA TTATGGCGGG AAAGATGCAA ATGGAAACGT TGTGAATTTA GATCAAATAC GAGAAGATGG TAAGAAGAAA TATTTACCGA AAACATATAC ATTTGACGAT GGGACAATTG TTTTAAAAGC TGGAGATAAA GTGACAGAAG AAAAAGTAAA ACGTCTATAT TGGGCGGCAA AAGAAGTGAA GGCTCAATTC CATCGTACGG TTGAAAGTGA CCAGCCGTTA GAAAAAGGGA ATGCTGATGA TGTATTAACG ATGGTTATTT ATAATAGCCC AGCTGAATAT CAATTTAACC GTCAATTGTA CGGGTATGAA ACGAATAACG GCGGTCTTTA TATAGAAGGA ACAGGTACGT TCTTTACTTA TGAGCGTACG CCAGAAGAAA GTATTTATAG TTTAGAGGAA TTGTTCCGGC ACGAGTTCAC ACATTACTTA CAAGGTAGAT ATGAAGTGCC AGGACTTTGG GGACAAGGTA AGATTTATGA GAATGAGAGA TTATCTTGGT TTGAAGAAGG CAATGCAGAG TTTTTTGCAG GTGCAACGAG AACAGATAAT GTTGTACCGA GAAAGAGCAT TATAGGAGGA ATATCTTCAA ATCCGGCAGA ACGTTATACG GCAGAGAGAA CGTTAAATGC AAAGTACGGA ACATGGGATT TTTATAATTA TTCCTTCGCT TTACAATCGT ACATGTACAA TAAGAGATAT GATATGTTTG ACAAAGTTCA TGATCTTATT AGAAAAAATG ATGTAACAGC ATATGATGCA TATCGCTCTG CATTAAGTAA AGATGCGAAT TTAAATAAAG AGTATCAAGA CTATATGCAA ATGTTAGTCG ACAATCGTGA TAAATATAAT GTTCCATTAG TATCAGATGA TTATTTAGCA ACTCACGCAC CGAAACCAGT CTCAGATATT GTGGCAGAAA TTACGGCAGA AGCGAAATTA AGTAATGTAT CAGTTAAGAA AAATAAATCA CAGTTCTTTA ATACATTTAC ACTGCAAGGA ACATATACAG GTACGACTGC AAAAGGAGAA TATGAAGACT GGAAATCAAT TACACAAAAC GTAAATGATA CGTTAAAACG TTTAAGTGTA AAAGAATGGA CAGGCTATAA AACAGTAACA GCTTATTTCG TAAATTACCG TGTGAATGCA TCAGGACAAT TTGAATATGA CGTTGTATTC CATGGTATTA ATACAGAAGA AGGCGCTGTG AATAAAGCAC CAGTTGCGGT TATAAATGGT CCCTATAGTG GGAATGTAAA TGAAGCAATT TCGTTTAAAA GCGATGGATC AAAAGATGAA GATGGAAAAA TTGTTGCTTA TAAATGGGAG TTTGGTGATG GTACTGTAAG CAATGAACAA AATCCAACTC ACGTGTATAC AAAAGAAGGA ACATATACAG CGAGATTAAC AGTAACAGAT GATAAAGGGT TAACGAATAC TGTTACAACG AATGTAACAG TTCAAAAGAA AGAAGATAAC AGTGTAGAAA AAGAACCAAA CAATTCATTC CAGACAGCAA ATACACTGCA ATTCAATCAA GTTTTACGCG CAAGTTTAGG AAATGGTGAT ACGAGTGATT TCTTTGAAAT AAATGTGGAA ACGGCGAAAA ATCTGCAAAT TAATGTAACG AAGGAAAATA ATATCGGAGT AAACTGGGTT CTTTATTCGG AAGCAGATTT AAATAACTAT ATTACGTATG CCCAGCAAGA GGGGAATAAG TTAGTAGGAA GTTACTACAC GTATCCAGGT AAGTATTATT TACATGTGTA TCAGTATGGT GGTGGATTTG GGAATTATAC GGTAGAAGTG AAGTAG
|
Protein sequence | MKGYSKKVLV GVSFASLMLG SFQGGALAEG TKGEQASYRN VLKMEPVGVQ LPVQELAHSS KVLENKSFEK RLQFADLSQR PPELKKESKQ LATAKTYTIA ELNQLSNQQL VDLLVTIDWE QITGLFQFNK DSLAFYQNDS RMQAIIDKLN QQGQAYTKDD SKGIETLVEV LRSGFYLGFY HTELSKLNER SYHDKCLPAL KTIANNPNFK LGTLEQNRVV SSYGKLIGNA SSDVETITSA AKIFKQYNDN FSTWVDNLSA GNAIYDIMQG VDYDIQSYLY DTRKAPKDTV WYQKIDSYIN ELSRFALIGT VTEKNAWLIN NGIYYTGRLG TFHSTGTKGL QVVTDAMKMY PYLGEQYFVA AEQIATNYGG KDANGNVVNL DQIREDGKKK YLPKTYTFDD GTIVLKAGDK VTEEKVKRLY WAAKEVKAQF HRTVESDQPL EKGNADDVLT MVIYNSPAEY QFNRQLYGYE TNNGGLYIEG TGTFFTYERT PEESIYSLEE LFRHEFTHYL QGRYEVPGLW GQGKIYENER LSWFEEGNAE FFAGATRTDN VVPRKSIIGG ISSNPAERYT AERTLNAKYG TWDFYNYSFA LQSYMYNKRY DMFDKVHDLI RKNDVTAYDA YRSALSKDAN LNKEYQDYMQ MLVDNRDKYN VPLVSDDYLA THAPKPVSDI VAEITAEAKL SNVSVKKNKS QFFNTFTLQG TYTGTTAKGE YEDWKSITQN VNDTLKRLSV KEWTGYKTVT AYFVNYRVNA SGQFEYDVVF HGINTEEGAV NKAPVAVING PYSGNVNEAI SFKSDGSKDE DGKIVAYKWE FGDGTVSNEQ NPTHVYTKEG TYTARLTVTD DKGLTNTVTT NVTVQKKEDN SVEKEPNNSF QTANTLQFNQ VLRASLGNGD TSDFFEINVE TAKNLQINVT KENNIGVNWV LYSEADLNNY ITYAQQEGNK LVGSYYTYPG KYYLHVYQYG GGFGNYTVEV K
|
| |