Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B4748 |
Symbol | |
ID | 7186592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | + |
Start bp | 532916 |
End bp | 535813 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643548326 |
Product | putative microbial collagenase |
Protein accession | YP_002444019 |
Protein GI | 218895608 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.369378 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGA AATCAAAGAT CAATAAAGTG ATGCTTAGCA TTAGTACAAT GGCTTTATCG TTAGGGGCAC TTCAAACTCA TGCAGTAGCG GAAGAAAAAG TACCGTATAA TGTGCTAAAA ACGAAACCGG TTGGAATTGA AAAGCCAGTA GATGAAGTTG GGCATGTTTC AAAAGTTGAT GAAACCTTAT CATTTCAAGA ACGTTTAAAA GTAGGCGATT TTTCACAGCG ACCAGCATCT ATTACGAAGA AAACGGCAGT AAAGCAAGTT AAAGAAAGCT ATTCAATGGC TGATTTAAAC AAAATGAATG ATCAAGAATT AGTTGAAACA TTAGGCAGTA TTAAGTGGCA CCAAATTACA GATTTATTCC AGTTTAATGA AGACACAAAG GCCTTTTATA AAGATAAAGG AAAAATGCAA GTCATTATAG ATGAATTAGC TCATAGAGGT AGTACATTTA CGAAAGATGA TTCAAAAGGA ATTCAAACGT TTACTGAAGT GTTGCGTTCC GCTTTTTATC TGGCATTTTA TAATAACGAA TTAAGTGAAT TAAATGAAAG AAGCTTCCAG GATAAATGTT TACCTGCTTT AAAAGCAATC GCAAAAAATC CAAACTTTAA GCTTGGTACA GATGAACAAG ATACAGTCGT ATCTGCATAC GGAAAATTAA TTAGTAATGC ATCAAGTGAT GTTGAAACAG TTCAATACGC ATCAAATATT TTAAAGCAAT ACAATGATAA TTTTACTACT TATGTAAATG ATCGAATGAA GGGACAAGCA ATATACGATA TTATGCAAGG GATTGACTAT GATATACAGT CGTATTTAGT TGAGGCCCGT AAAGAAGCGA ATGAAACGAT GTGGTATGGA AAAGTAGATG GGTTTATTAA TGAAATAAAT CGTATTGCTC TTTTAAATGA AGTAACGTCA GAAAATAAAT GGCTCGTTAA TAATGGTATT TATTTTGCAA GCCGTTTAGG GAAATTTCAT AGCAATCCGA ATAAAGGATT AGAGGTTGTT ACACAAGCAA TGCATATGTA CCCACACTTA AGTGAACCAT ATTTTGTTGC GATAGAACAA ATTACAACAA ATTATAATGG TAAAGATTAT AGCGGGAATA CAGTAGATTT AGAGAAAATA CGTAAAGAAG GAAAAGAGCA GTACTTACCA AAAACGTATA CATTCGATGA TGGATCAATT GTGTTCAAAA CAGGAGATAA AGTATCAGAA GAAAAAATTA AGAGATTATA TTGGGCTGCG AAGGAAGTAA AGGCACAGTA TCACCGTGTA ATTGGAAATG ACAAAGCATT AGAGCCAGGA AATGCGGATG ATGTGTTAAC AATCGTAATT TATAATAGTC CAGATGAATA TCAGTTAAAT AGACAATTGT ATGGGTATGA AACAAACAAC GGTGGAATTT ATATCGAAGA AACAGGAACA TTCTTTACAT ATGAGCGTAC ACCAGAGCAA AGTATTTATA GTTTAGAAGA GTTATTCCGT CATGAATTTA CTCATTATCT GCAAGGGAGA TATGAAGTTC CTGGTTTATT TGGAAGAGGA GATATGTATC AAAATGAAAG GTTAACTTGG TTCCAAGAAG GAAATGCAGA GTTTTTCGCA GGCTCTACTC GTACGAATAA CGTTGTACCA AGAAAGAGTA TAATTAGCGG ATTATCATCT GATCCTGCAA GCCGTTATAC TGCAGAGCGC ACACTATTTG CTAAATACGG TTCTTGGGAT TTCTATAATT ACTCGTTCGC ATTGCAGTCT TACTTATATA CGCATCAGTT TGAAACATTT GATAAAATTC AAGATTTAAT TCGTGCGAAT GACGTGAAAA ATTATGATGC ATATCGTGAA AATCTAAGTA AAGATCCTAA GTTAAATAAA GAGTATCAAG AGTATATGCA GCAGTTAATT AATAATCAAG ATACATACAC TGTACCAGAA GTAGCTGATG ATTATTTAGC TGAACATGCA ACGAAGTCGT TAACAGCGGT GAAGAAAGAA ATTAGTGATA CGTTGCCTAT GAAAGATACA AAAATGACAA AACATAATTC TCAATTCTTT AATACATTTA CATTAGAAGG TACGTATACA GGTAGTGTCA CAAAAGGTGA ATCAGAAGAT TGGAAAGCAA TGAGTAAAAG AGTAAATGAA TCTTTAGAAC AATTGGCGCA AAAAGAATGG AGTGGCTACA AAACTGTTAC AGCATACTTC GTCAATTATC GTGTGAATAG CTCAAATGAA TTTGAATATG ATGTAGTCTT CCATGGAATC GCAAAAGATG ATGGAGAAAA TAAAGCTCCG ACGGTTAATG TAAACGGGCC TTATAATGGA GTTGTAAAAG AGGGAATTCA ATTTAAAAGT GATGGCTCAA ACGATGAAGA TGGAAAAATT GTTTCTTATT TATGGGAATT TGGAGATGGA AGCACAAGTG CAGAAGTGAA TCCAGTACAT GTATATGAAA GAGAAGGTTC TTATAAAGTA TCGTTAAGAG TAAAAGATGA TAAAGGAAAA GAGAGCAGAA GCGAAACAAC TGTTACGATT AAAGATGGAA GTTTAACAGA ATCAGAACCA AATAATCGTC CAGAGGAAGC AAATCGTATC GGGCTAAATA GTACGATAAA AGGTAATCTT ATTGGCGGGG ACCACACTGA TGTTTATACA TTTAATGTAG CATCAGCGAA AGATATCGAC ATTTCTGTTT TAAATGAGTA TGGAATTGGG ATGACATGGG TACTTCACCA TGAATCAGAT ATGCAAAATT ATGCGGCTTA CGGTCAAGCT AATGGGAATC ATATAGAAGC AAAATTTAAT GCAAAACCAG GTAAGTATTA CTTGTATGTA TATAAATATG ATAATGGCGA TGGAACATAC GAATTGTCAG TAAAATAA
|
Protein sequence | MNKKSKINKV MLSISTMALS LGALQTHAVA EEKVPYNVLK TKPVGIEKPV DEVGHVSKVD ETLSFQERLK VGDFSQRPAS ITKKTAVKQV KESYSMADLN KMNDQELVET LGSIKWHQIT DLFQFNEDTK AFYKDKGKMQ VIIDELAHRG STFTKDDSKG IQTFTEVLRS AFYLAFYNNE LSELNERSFQ DKCLPALKAI AKNPNFKLGT DEQDTVVSAY GKLISNASSD VETVQYASNI LKQYNDNFTT YVNDRMKGQA IYDIMQGIDY DIQSYLVEAR KEANETMWYG KVDGFINEIN RIALLNEVTS ENKWLVNNGI YFASRLGKFH SNPNKGLEVV TQAMHMYPHL SEPYFVAIEQ ITTNYNGKDY SGNTVDLEKI RKEGKEQYLP KTYTFDDGSI VFKTGDKVSE EKIKRLYWAA KEVKAQYHRV IGNDKALEPG NADDVLTIVI YNSPDEYQLN RQLYGYETNN GGIYIEETGT FFTYERTPEQ SIYSLEELFR HEFTHYLQGR YEVPGLFGRG DMYQNERLTW FQEGNAEFFA GSTRTNNVVP RKSIISGLSS DPASRYTAER TLFAKYGSWD FYNYSFALQS YLYTHQFETF DKIQDLIRAN DVKNYDAYRE NLSKDPKLNK EYQEYMQQLI NNQDTYTVPE VADDYLAEHA TKSLTAVKKE ISDTLPMKDT KMTKHNSQFF NTFTLEGTYT GSVTKGESED WKAMSKRVNE SLEQLAQKEW SGYKTVTAYF VNYRVNSSNE FEYDVVFHGI AKDDGENKAP TVNVNGPYNG VVKEGIQFKS DGSNDEDGKI VSYLWEFGDG STSAEVNPVH VYEREGSYKV SLRVKDDKGK ESRSETTVTI KDGSLTESEP NNRPEEANRI GLNSTIKGNL IGGDHTDVYT FNVASAKDID ISVLNEYGIG MTWVLHHESD MQNYAAYGQA NGNHIEAKFN AKPGKYYLYV YKYDNGDGTY ELSVK
|
| |