Gene BCB4264_A4838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCB4264_A4838 
Symbol 
ID7097748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus B4264 
KingdomBacteria 
Replicon accessionNC_011725 
Strand
Start bp4681641 
End bp4685534 
Gene Length3894 bp 
Protein Length1297 aa 
Translation table11 
GC content55% 
IMG OID643472347 
Productcollagen triple helix repeat domain protein 
Protein accessionYP_002369524 
Protein GI218231261 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0444265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAATC GTGATAATAA TCGGAAGCAA AATTCGTTAA GCTCTAATTT TAGAATTCCA 
CCCGAACTTA TTGGGCCTAC TTTCCCCCCT GTTCCAACTG GATTTACAGG TATAGGGATT
ACTGGTCCAA CTGGTCCACA AGGCCCAACA GGACCTCAAG GACCAAGAGG ATTACAAGGT
CCGATGGGGG AGATGGGCCC GACAGGACCT CAAGGTGTGC AAGGGATACA AGGATCAGTT
GGGCCAATAG GTGCAACTGG ACCAGAAGGA CAGCAGGGGC CACAAGGATT GAGAGGACCA
CAAGGAGAAA CTGGAGCGAC AGGACCTCAA GGTGTGCAAG GGTTACAAGG TCCGATTGGT
CCAACGGGAG CGACTGGGGC ACAAGGTATA CAGGGGATAC AGGGATTACA GGGGCCAATT
GGAGCGACGG GACCTGAGGG ACCTCAAGGA ATTCAAGGCG TCCAAGGGTT ACCGGGTGCA
ACTGGTCCAC AAGGAATACA AGGAGCACAA GGGATACAAG GACCGAGTGG AAATACAGGT
GCAACCGGAG CGACTGGAGC AACTGGTCAA GGAATAACAG GACCGACTGG AATAACAGGT
CCAACTGGAA TAACTGGACC ATCTGGAGGA CCTCCTGGTC CGACAGGGCC AACTGGTGCG
ACAGGTCCGG GTGGCGGACC GAGTGGAAGT ACAGGTGCGA CTGGAGCAAC GGGGAGTACT
GGGGCTACAG GAAGTACAGG GGTAACAGGA GCAACGGGAA CTACAGGTCC GACTGGAAGT
ACGGGAGCAC AGGGCTTGCA AGGAATACAA GGAATTCAAG GGCCAATTGG CCCAACAGGT
TCAGAAGGAC CGCAGGGGAT TCAAGGTATT CCTGGTCCGA CGGGAGTAAC TGGTGAACAA
GGAATTCAAG GAGTTCAAGG TATTCAAGGG ATAACGGGAG CAACAGGAGA TCAAGGTCCG
CAAGGGATAC AGGGGGCTAT AGGTCCTCAA GGGGCCACAG GAGCAACAGG AGATCAAGGT
CCACAAGGAA TACAAGGAGT ACCAGGGCCA TCAGGAGCAA CAGGACCACA AGGAGTTCAA
GGGATACAAG GTCCAATGGG CGATATAGGA CCAACAGGTC CAGAAGGCCC AGAGGGACTT
CAGGGCCCGC AAGGAATACA AGGAGTACCG GGGCCAGCTG GAGCAACGGG TCCAGAGGGG
CCACAGGGGA TACAAGGAAT TCAAGGACCG GTAGGAGCAA CAGGCTCACA AGGTCCCCAA
GGAATTCAGG GAATTCAAGG TGTGCAAGGG ATAACGGGAG CAACAGGAGT ACAAGGAGCA
ACTGGAATTC AAGGTATACA AGGAGAAATA GGAGCAACGG GTCCAGAGGG TCCCCAAGGA
GTGCAAGGTG CTCAAGGGGC GATTGGTCCA ACAGGTCCAA TGGGCGCACA AGGAGTGCAA
GGAATACAAG GGATTCAAGG CCCAACAGGT GCACAAGGAG TGCAAGGTGC TCAAGGAATA
CAAGGAATTC AAGGTCCGAC GGGTGCAACA GGAGATACGG GAGCAACTGG GGCGACAGGA
GAAGGCACTA CAGGCCCAAC AGGAGTAACT GGTCCGACAG GGGTAACAGG ACCATCTGGA
GGACCAGCAG GACCGACCGG CCCAACGGGG CCATCAGGTC CGGCGGGAGT GACTGGTCCA
TCTGGTGGAC CACCTGGCCC GACAGGAGCA ACTGGGGCGA CAGGAGTAAC AGGGGATACC
GGGGCGACGG GCTCAACTGG AGTGACAGGA GCGACAGGAG AAACGGGAGC AACCGGAGTG
ACGGGTTTAC AAGGTCCGCA AGGAATCCAA GGTGTGCAGG GAGAGATAGG TCCGACGGGT
CCCCAAGGTG TTCAAGGTCC CCAAGGAATT CAAGGAGTAA CGGGGGCCAC TGGAGATCAA
GGTCCGCAAG GGATTCAAGG CCCACAAGGC GACATAGGTC CAACAGGCCC ACAAGGAATT
CAAGGCCCAC AAGGTTCTCA AGGAATCCAA GGAGCGACAG GGGGAACAGG AGCACAAGGC
CCACAGGGAA TCCAAGGTCC GCAAGGTGAC ATAGGTCCGA CTGGGCCACA AGGTCCAACT
GGAATCCAAG GGATACAAGG AGAGATAGGT CCAACAGGTC CAGAAGGCCC AGAGGGACTT
CAGGGCCCGC AAGGAATCCA AGGTGTGCCA GGGCCAGTTG GAGCAACGGG TCCTGAGGGT
CCTCAAGGGA TACAAGGCAT TCAAGGACCG GTAGGAGCAA CAGGCCCACA AGGTCCACAA
GGAATACAGG GAATACAAGG TGTGCAAGGG ATAACGGGAG CAACAGGAGC ACAAGGAGCA
ACTGGAATTC AAGGGATACA AGGGGAAATA GGAGCAACAG GTCCAGAGGG GCCCCAAGGA
GTGCAAGGTA TACAAGGGGC GATTGGTCCA ACAGGTCCGA TGGGCGCACA AGGAGTGCAA
GGTATACAAG GGATTCAAGG AGCAACAGGA GCACAAGGAG TGCAGGGACC ACAAGGAATT
CAAGGAGTGC AAGGTCCGAC GGGAGCAACA GGAGAAACGG GATCAACCGG AGCGACGGGA
GAAGGATCAA CCGGTCCAAC AGGAGTAACC GGTCCAACAG GGGTGACAGG CCCGTCAGGA
GGCCCAGCAG GACCGACCGG CCCAACGGGG CCATCAGGTC CGGCAGGAGT GACAGGTCCA
TCAGGTGGAC CACCTGGCCC GACAGGAGCA ACAGGTGCGA CAGGAGTAAC AGGAGATACC
GGGGCGACAG GCTCAACTGG AGTGACAGGA GCAACAGGAG CAACGGGATC AACCGGAGTG
ACGGGTTTAC AGGGTCCACA AGGAATCCAA GGTGTTCAAG GAGAGATAGG TCCAACCGGT
CCACAGGGTA TTCAAGGTCC ACAAGGAATA CAAGGAGTAA CGGGGGCAAC TGGAGCACAA
GGTCCCCAAG GAATTCAAGG CCCACAAGGC GACATAGGTC CAACCGGCCC ACAAGGAATT
CAAGGTCCAC AAGGTCCTCA AGGAATCCAA GGAACGACGG GGGCAACCGG AGCACAAGGC
CCACAGGGAA TCCAAGGTCC GCAAGGTGAC ATAGGTCCGA CCGGCCCACA AGGCCCACAA
GGAATTCAAG GCCCGCAAGG AATTCAAGGT CCAACGGGAG CTACAGGAGT AACCGGAGCG
ACAGGTCCAC AAGGGATTCA AGGTCCACAA GGAATTCAAG GCCCGCAAGG AATTCAAGGT
CCAACGGGAG TTACAGGAGC AACCGGAGCG ACAGGTCCAC AAGGAATTCA AGGCCCGCAA
GGAATTCAAG GTCCAACGGG AGCTACAGGA GCAACCGGAG CGACAGGTCC ACAAGGAATT
CAAGGCTCGC AAGGAATTCA AGGTCCAACG GGAGCTACAG GAGCAACCGG TTCACAAGGT
CCAACTGGAG ATACAGGTCC AACCGGAGCT GGAGCCACTG GAGCGACCGG GGCGACTGGA
GTTAGTACAA CTGCAACGTA TGCGTTTGCG AATAATACAT CAGGAACCGC TATTTCCGTT
TTATTAGGGG GTACGAACGT ACCGTTACCG AACAATCAAA ATATTGGCCC AGGAATAACC
GTTAGTGGTG GAAATACTGT ATTTACAGTT GCGAATGCAG GAAACTATTA TATAGCCTAT
ACAATTAATT TAACGGCAGG TTTACTTGTA AGTTCTCGTA TAACTGTAAA TGGCAGTCCG
CTTGCGGGAA CGATAAATGC CCCGACAGTG GCTACTGGTT CATTTAGTGC AACAATAATC
GCTAACTTGC CTGCTGGAGC TGCTGTTAGC TTACAGTTAT TTGGAGTAGT TGCAGTAGCT
ACATTATCTA CAGCAACGCC AGGGGCTACC CTAACTATTA TTAGATTAAG TTAA
 
Protein sequence
MKNRDNNRKQ NSLSSNFRIP PELIGPTFPP VPTGFTGIGI TGPTGPQGPT GPQGPRGLQG 
PMGEMGPTGP QGVQGIQGSV GPIGATGPEG QQGPQGLRGP QGETGATGPQ GVQGLQGPIG
PTGATGAQGI QGIQGLQGPI GATGPEGPQG IQGVQGLPGA TGPQGIQGAQ GIQGPSGNTG
ATGATGATGQ GITGPTGITG PTGITGPSGG PPGPTGPTGA TGPGGGPSGS TGATGATGST
GATGSTGVTG ATGTTGPTGS TGAQGLQGIQ GIQGPIGPTG SEGPQGIQGI PGPTGVTGEQ
GIQGVQGIQG ITGATGDQGP QGIQGAIGPQ GATGATGDQG PQGIQGVPGP SGATGPQGVQ
GIQGPMGDIG PTGPEGPEGL QGPQGIQGVP GPAGATGPEG PQGIQGIQGP VGATGSQGPQ
GIQGIQGVQG ITGATGVQGA TGIQGIQGEI GATGPEGPQG VQGAQGAIGP TGPMGAQGVQ
GIQGIQGPTG AQGVQGAQGI QGIQGPTGAT GDTGATGATG EGTTGPTGVT GPTGVTGPSG
GPAGPTGPTG PSGPAGVTGP SGGPPGPTGA TGATGVTGDT GATGSTGVTG ATGETGATGV
TGLQGPQGIQ GVQGEIGPTG PQGVQGPQGI QGVTGATGDQ GPQGIQGPQG DIGPTGPQGI
QGPQGSQGIQ GATGGTGAQG PQGIQGPQGD IGPTGPQGPT GIQGIQGEIG PTGPEGPEGL
QGPQGIQGVP GPVGATGPEG PQGIQGIQGP VGATGPQGPQ GIQGIQGVQG ITGATGAQGA
TGIQGIQGEI GATGPEGPQG VQGIQGAIGP TGPMGAQGVQ GIQGIQGATG AQGVQGPQGI
QGVQGPTGAT GETGSTGATG EGSTGPTGVT GPTGVTGPSG GPAGPTGPTG PSGPAGVTGP
SGGPPGPTGA TGATGVTGDT GATGSTGVTG ATGATGSTGV TGLQGPQGIQ GVQGEIGPTG
PQGIQGPQGI QGVTGATGAQ GPQGIQGPQG DIGPTGPQGI QGPQGPQGIQ GTTGATGAQG
PQGIQGPQGD IGPTGPQGPQ GIQGPQGIQG PTGATGVTGA TGPQGIQGPQ GIQGPQGIQG
PTGVTGATGA TGPQGIQGPQ GIQGPTGATG ATGATGPQGI QGSQGIQGPT GATGATGSQG
PTGDTGPTGA GATGATGATG VSTTATYAFA NNTSGTAISV LLGGTNVPLP NNQNIGPGIT
VSGGNTVFTV ANAGNYYIAY TINLTAGLLV SSRITVNGSP LAGTINAPTV ATGSFSATII
ANLPAGAAVS LQLFGVVAVA TLSTATPGAT LTIIRLS