Gene BCB4264_A0569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCB4264_A0569 
Symbol 
ID7097307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus B4264 
KingdomBacteria 
Replicon accessionNC_011725 
Strand
Start bp540671 
End bp542215 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content34% 
IMG OID643468124 
Productphage major capsid protein, HK97 family 
Protein accessionYP_002365330 
Protein GI218234165 
COG category[R] General function prediction only 
COG ID[COG3740] Phage head maturation protease
[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family
[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.361307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGG AACTTCGTAT TGCAGTCTCA GATTTACATA CAAATACAGA TGGAACAATG 
AACGTTTCCG GTTATGTTAA TAAAACAGAT CAACTGAGTA ATGTATTAGG AGTTACAAAG
CGATTTGTTG AAAAGATTGC TAAAGGTGCT TTCTCGCGCG CTATTCAATC AGGAAAAAAA
GACATTGAAT TCCTTGCTGA ACACAAGGGT GACTTAATTT TGGCGTCTAC TCGCAACGGC
TCTTTACGAT TAACAGAAGA TAATAAAGGT CTTTATATGG AAGCGACAAT TGCTTCTACA
TCATGGGGAA AAGATTATTA CGAATTAATT AACTCTGGCA TACTCAGAAA CATGTCATTT
GGTTTCCGTA CTATTAAAGA TAGTTGGAGA TTACTTGAAT CAAATCTTTA TGAACGAACA
ATTGAAGAGC TTGAATTATT TGAGGTTTCA GTTGTGAAGG ATCCTGCTTA TTCTCAATCG
ACAATTGCAG CCCGTGGTAT TCATCTTGTT AAAGATATAG AAGTCCCAAA AGAAGTCAAA
GTATACAATC AGCTAAAGGA GAAGAGAGAA ATGGAAAAGA CGGTAATTAG ATATGGTATT
GAAGAAAGAA CAATCGAACA ATTAGAAGTA GAACGATTTG AGTCTTTTGT TATGGGAAAA
CAAAATGGAC AAGAAAGAGA CATGAACGAA AATCGAAATA GCGAGGTAAG ATATAATACA
ACGGGAACAG CTGGTGGAGC AGTAATTCCT GAATCAGTTC ACAATCAAGT AATTAAGAAA
GTGGAAGAGT ATTCTCCTAT TTTTGAAATG GCTCGTAAGT TTCCATCAGT TGCAGGTACG
TTAACGATTG CAAGGGAAGA TAGTCTAGAT GATGCAGGTT TCGTTGGCGA AGGAATTAAT
TTAAAACAAT TAGCTATGGA TTTTGAAACT GTTAAATTGG AACAGAAGAG AGCTGGGGCA
TATATTCGGG CTTCTAATCA GTTATTAAAT GATACTGCAA TCTCAACATC AGATTATATT
GCAGACTTAT TATCACGTAA ATTAATAAAG GCCATAGAAA AATCAATTTT AGTTGGTTCT
GGTGGAAATG AATTCAATGG TATAGTAAAC GATACATTCG TTCCAACAGT GAAGGTAAAG
AAAATTGAAA TTGATGAATT AATGGATTTG CATAATAGCC TTCCTTACGA TTATGCAGAT
GGAAATGCCA CATTTATTAT GGCACGTAAA ACATATAACC AAATTGCTAA ATTAAAAGAT
GCATTAGGTC ATTCTTATGT ACAAAATGGA GTAGTAAATG GAAGACCCAC AAAAACGTTA
TTTGGTAAAG TGATTTATAT TACTGATGTT TTACCTGAAT CAACACCAGT AATTTTCGCA
AACTTTTATC ATGCTTATGC AATAATGATT AAACAAGCTG CGAGATTACA GCGAACAGTT
GATACTGAAA ACGCTCTAGC AGGAACAACT ACTTTCGTAT TGGATAGTTA TATGGATGGA
GCAGTCTATA ATCCACAAGC TATTGCTAAA TTGGTTATTG CTTAA
 
Protein sequence
MKMELRIAVS DLHTNTDGTM NVSGYVNKTD QLSNVLGVTK RFVEKIAKGA FSRAIQSGKK 
DIEFLAEHKG DLILASTRNG SLRLTEDNKG LYMEATIAST SWGKDYYELI NSGILRNMSF
GFRTIKDSWR LLESNLYERT IEELELFEVS VVKDPAYSQS TIAARGIHLV KDIEVPKEVK
VYNQLKEKRE MEKTVIRYGI EERTIEQLEV ERFESFVMGK QNGQERDMNE NRNSEVRYNT
TGTAGGAVIP ESVHNQVIKK VEEYSPIFEM ARKFPSVAGT LTIAREDSLD DAGFVGEGIN
LKQLAMDFET VKLEQKRAGA YIRASNQLLN DTAISTSDYI ADLLSRKLIK AIEKSILVGS
GGNEFNGIVN DTFVPTVKVK KIEIDELMDL HNSLPYDYAD GNATFIMARK TYNQIAKLKD
ALGHSYVQNG VVNGRPTKTL FGKVIYITDV LPESTPVIFA NFYHAYAIMI KQAARLQRTV
DTENALAGTT TFVLDSYMDG AVYNPQAIAK LVIA