Gene BCG9842_B4748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B4748 
Symbol 
ID7186592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp532916 
End bp535813 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content34% 
IMG OID643548326 
Productputative microbial collagenase 
Protein accessionYP_002444019 
Protein GI218895608 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.369378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGA AATCAAAGAT CAATAAAGTG ATGCTTAGCA TTAGTACAAT GGCTTTATCG 
TTAGGGGCAC TTCAAACTCA TGCAGTAGCG GAAGAAAAAG TACCGTATAA TGTGCTAAAA
ACGAAACCGG TTGGAATTGA AAAGCCAGTA GATGAAGTTG GGCATGTTTC AAAAGTTGAT
GAAACCTTAT CATTTCAAGA ACGTTTAAAA GTAGGCGATT TTTCACAGCG ACCAGCATCT
ATTACGAAGA AAACGGCAGT AAAGCAAGTT AAAGAAAGCT ATTCAATGGC TGATTTAAAC
AAAATGAATG ATCAAGAATT AGTTGAAACA TTAGGCAGTA TTAAGTGGCA CCAAATTACA
GATTTATTCC AGTTTAATGA AGACACAAAG GCCTTTTATA AAGATAAAGG AAAAATGCAA
GTCATTATAG ATGAATTAGC TCATAGAGGT AGTACATTTA CGAAAGATGA TTCAAAAGGA
ATTCAAACGT TTACTGAAGT GTTGCGTTCC GCTTTTTATC TGGCATTTTA TAATAACGAA
TTAAGTGAAT TAAATGAAAG AAGCTTCCAG GATAAATGTT TACCTGCTTT AAAAGCAATC
GCAAAAAATC CAAACTTTAA GCTTGGTACA GATGAACAAG ATACAGTCGT ATCTGCATAC
GGAAAATTAA TTAGTAATGC ATCAAGTGAT GTTGAAACAG TTCAATACGC ATCAAATATT
TTAAAGCAAT ACAATGATAA TTTTACTACT TATGTAAATG ATCGAATGAA GGGACAAGCA
ATATACGATA TTATGCAAGG GATTGACTAT GATATACAGT CGTATTTAGT TGAGGCCCGT
AAAGAAGCGA ATGAAACGAT GTGGTATGGA AAAGTAGATG GGTTTATTAA TGAAATAAAT
CGTATTGCTC TTTTAAATGA AGTAACGTCA GAAAATAAAT GGCTCGTTAA TAATGGTATT
TATTTTGCAA GCCGTTTAGG GAAATTTCAT AGCAATCCGA ATAAAGGATT AGAGGTTGTT
ACACAAGCAA TGCATATGTA CCCACACTTA AGTGAACCAT ATTTTGTTGC GATAGAACAA
ATTACAACAA ATTATAATGG TAAAGATTAT AGCGGGAATA CAGTAGATTT AGAGAAAATA
CGTAAAGAAG GAAAAGAGCA GTACTTACCA AAAACGTATA CATTCGATGA TGGATCAATT
GTGTTCAAAA CAGGAGATAA AGTATCAGAA GAAAAAATTA AGAGATTATA TTGGGCTGCG
AAGGAAGTAA AGGCACAGTA TCACCGTGTA ATTGGAAATG ACAAAGCATT AGAGCCAGGA
AATGCGGATG ATGTGTTAAC AATCGTAATT TATAATAGTC CAGATGAATA TCAGTTAAAT
AGACAATTGT ATGGGTATGA AACAAACAAC GGTGGAATTT ATATCGAAGA AACAGGAACA
TTCTTTACAT ATGAGCGTAC ACCAGAGCAA AGTATTTATA GTTTAGAAGA GTTATTCCGT
CATGAATTTA CTCATTATCT GCAAGGGAGA TATGAAGTTC CTGGTTTATT TGGAAGAGGA
GATATGTATC AAAATGAAAG GTTAACTTGG TTCCAAGAAG GAAATGCAGA GTTTTTCGCA
GGCTCTACTC GTACGAATAA CGTTGTACCA AGAAAGAGTA TAATTAGCGG ATTATCATCT
GATCCTGCAA GCCGTTATAC TGCAGAGCGC ACACTATTTG CTAAATACGG TTCTTGGGAT
TTCTATAATT ACTCGTTCGC ATTGCAGTCT TACTTATATA CGCATCAGTT TGAAACATTT
GATAAAATTC AAGATTTAAT TCGTGCGAAT GACGTGAAAA ATTATGATGC ATATCGTGAA
AATCTAAGTA AAGATCCTAA GTTAAATAAA GAGTATCAAG AGTATATGCA GCAGTTAATT
AATAATCAAG ATACATACAC TGTACCAGAA GTAGCTGATG ATTATTTAGC TGAACATGCA
ACGAAGTCGT TAACAGCGGT GAAGAAAGAA ATTAGTGATA CGTTGCCTAT GAAAGATACA
AAAATGACAA AACATAATTC TCAATTCTTT AATACATTTA CATTAGAAGG TACGTATACA
GGTAGTGTCA CAAAAGGTGA ATCAGAAGAT TGGAAAGCAA TGAGTAAAAG AGTAAATGAA
TCTTTAGAAC AATTGGCGCA AAAAGAATGG AGTGGCTACA AAACTGTTAC AGCATACTTC
GTCAATTATC GTGTGAATAG CTCAAATGAA TTTGAATATG ATGTAGTCTT CCATGGAATC
GCAAAAGATG ATGGAGAAAA TAAAGCTCCG ACGGTTAATG TAAACGGGCC TTATAATGGA
GTTGTAAAAG AGGGAATTCA ATTTAAAAGT GATGGCTCAA ACGATGAAGA TGGAAAAATT
GTTTCTTATT TATGGGAATT TGGAGATGGA AGCACAAGTG CAGAAGTGAA TCCAGTACAT
GTATATGAAA GAGAAGGTTC TTATAAAGTA TCGTTAAGAG TAAAAGATGA TAAAGGAAAA
GAGAGCAGAA GCGAAACAAC TGTTACGATT AAAGATGGAA GTTTAACAGA ATCAGAACCA
AATAATCGTC CAGAGGAAGC AAATCGTATC GGGCTAAATA GTACGATAAA AGGTAATCTT
ATTGGCGGGG ACCACACTGA TGTTTATACA TTTAATGTAG CATCAGCGAA AGATATCGAC
ATTTCTGTTT TAAATGAGTA TGGAATTGGG ATGACATGGG TACTTCACCA TGAATCAGAT
ATGCAAAATT ATGCGGCTTA CGGTCAAGCT AATGGGAATC ATATAGAAGC AAAATTTAAT
GCAAAACCAG GTAAGTATTA CTTGTATGTA TATAAATATG ATAATGGCGA TGGAACATAC
GAATTGTCAG TAAAATAA
 
Protein sequence
MNKKSKINKV MLSISTMALS LGALQTHAVA EEKVPYNVLK TKPVGIEKPV DEVGHVSKVD 
ETLSFQERLK VGDFSQRPAS ITKKTAVKQV KESYSMADLN KMNDQELVET LGSIKWHQIT
DLFQFNEDTK AFYKDKGKMQ VIIDELAHRG STFTKDDSKG IQTFTEVLRS AFYLAFYNNE
LSELNERSFQ DKCLPALKAI AKNPNFKLGT DEQDTVVSAY GKLISNASSD VETVQYASNI
LKQYNDNFTT YVNDRMKGQA IYDIMQGIDY DIQSYLVEAR KEANETMWYG KVDGFINEIN
RIALLNEVTS ENKWLVNNGI YFASRLGKFH SNPNKGLEVV TQAMHMYPHL SEPYFVAIEQ
ITTNYNGKDY SGNTVDLEKI RKEGKEQYLP KTYTFDDGSI VFKTGDKVSE EKIKRLYWAA
KEVKAQYHRV IGNDKALEPG NADDVLTIVI YNSPDEYQLN RQLYGYETNN GGIYIEETGT
FFTYERTPEQ SIYSLEELFR HEFTHYLQGR YEVPGLFGRG DMYQNERLTW FQEGNAEFFA
GSTRTNNVVP RKSIISGLSS DPASRYTAER TLFAKYGSWD FYNYSFALQS YLYTHQFETF
DKIQDLIRAN DVKNYDAYRE NLSKDPKLNK EYQEYMQQLI NNQDTYTVPE VADDYLAEHA
TKSLTAVKKE ISDTLPMKDT KMTKHNSQFF NTFTLEGTYT GSVTKGESED WKAMSKRVNE
SLEQLAQKEW SGYKTVTAYF VNYRVNSSNE FEYDVVFHGI AKDDGENKAP TVNVNGPYNG
VVKEGIQFKS DGSNDEDGKI VSYLWEFGDG STSAEVNPVH VYEREGSYKV SLRVKDDKGK
ESRSETTVTI KDGSLTESEP NNRPEEANRI GLNSTIKGNL IGGDHTDVYT FNVASAKDID
ISVLNEYGIG MTWVLHHESD MQNYAAYGQA NGNHIEAKFN AKPGKYYLYV YKYDNGDGTY
ELSVK