Gene BCG9842_B3313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B3313 
Symbol 
ID7182769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp1884998 
End bp1886686 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content34% 
IMG OID643549738 
Productphage major capsid protein, HK97 family 
Protein accessionYP_002445408 
Protein GI218896997 
COG category[R] General function prediction only 
COG ID[COG3740] Phage head maturation protease
[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family
[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.000000000533544 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATGG AACTACGAGT TAATCAGACC AACATTGAAG CAAATGAAGA TGGCTCAATG 
ACTGTCAATG GCTATGTAAA TAAGACTGAA CAATTTAGTA AAATGTTGGG ACGAAACGAA
CAGTTTAAAG AAAAGATTTC ACGCGGTGTT TTTAAACGTG CAATTGAGAA GGCAAAAGAA
ATTCACTTTC TTGCTGAACA TGATGGTGAA AAGATTTTAT CTTCAACTAG AAACGGTTCT
TTGGAATTGT CAGAAGATAC AAATGGACTT TATATGTCTG CAACAATTAC ACCTACATCT
TGGGGCAAAG ATTATTATGA ATTAATTAAG TCAGGAATTT TAAAGAACAT GTCTTTCGGA
TTCCGCTCAA TTAAAGATTC ATGGAAAAAG ACTACTCAAG GTTATTTTGA AAGGACAATC
CATGAACTTG AATTATTTGA AGTTTCAGTT GTAAAAGACC CTGCATATTC TCAATCTTCA
ATTTCTGCTC GTGGAATTGA TGTTGTTGAA GAGGTTGAAG TACCTGACGA GGTTAAGAAG
AAAATAGTAA GAAATATTCA AGAAATGAGT CGCAAGGCTC TTATTGAATT GCGAAATGAC
TTGCTCGAAC AATCACAAAG TTATGAAACT CGTGGTCTTA CGGAAGAACG CGAATATGAG
CAATTAAAAT CTCAAATTCG AGATATTGAA ATACAACTTA AAAAAATAGA TAAGAAAGAG
GTTAGAAATA TGGTTGAATT ATTAAGTCCA AACGATACAA ATGTAGAACA ACGTGGTTTT
GAAGAATTTT TAAAAGGTCA CTTATATTCT GAAGAAGTTC GTGCAATTAC AACAGGTACG
TCACCAGGAC AACTAACAGT TCCAACTTCA ATTTCCGATC AAATTATTAA GAAATTAGAA
GAAGTAGCTC CATTATTTGC ACTATCTAAA CAGTTTCCAA GTGAACATGG TTATCTTGAA
GTGTTAAAAG AAACAGGTAT TGGTGGAGCT CAATGGCTAG GTGAAATGGA AAATGCAACT
CCAGCAGATT TCACAATGTC AAAAGTAAAA TTAGAACAAA AACGTTTAAC GGCAGCTATT
GAATTATCAC AACAACTAAT TAATGATGCT GGCTTTGATA TTGTGAGTTA TGCAATCAAT
GTTTTATCTC GACGTATTGC TTATTCAGTT AACCGAGCAA TTGTTAATGG AAATGGCGTT
GGACAAATGG AAGGTTTCTT AACTGCAACA TTGGCTTCAG AATCAGTGAT TAAAACAACT
GCAAATACAG TTACTACTGA TGATGTTTTA GGACTATTCA ACTCTATGAA TCCAGAACTA
ATCGAAGGTG CAGTGTTTGT TATGAACCGT AATACTTGGA ATGCGGTTTC AAAGCTGAAA
GATGCGGAAA ACCGATACTA TCTTGTAGAT TTCAAAAATG GTAATGGTTC TAAATATTAC
ACTATGCTTG GATTACCGGT AATGATTTCA GATGCCATGC CAGATATTGC AACAGAAAAC
AAAGCAATTG GTTTAATTAA TATGGGTGAA GCATATGGAA CTCTGATTAA GAAGGGAATT
GAAGTGCAAC ATGTTTATGC TGATAGCGCT CAAGCACTTC GTGGTTCTCA ATTAATCGTT
GCTTCTATCT ATCTTGATGG TAAAATTATC AATGAACAAG CAATTCGTTT ATTGTCTATT
GCTGCATAA
 
Protein sequence
MKMELRVNQT NIEANEDGSM TVNGYVNKTE QFSKMLGRNE QFKEKISRGV FKRAIEKAKE 
IHFLAEHDGE KILSSTRNGS LELSEDTNGL YMSATITPTS WGKDYYELIK SGILKNMSFG
FRSIKDSWKK TTQGYFERTI HELELFEVSV VKDPAYSQSS ISARGIDVVE EVEVPDEVKK
KIVRNIQEMS RKALIELRND LLEQSQSYET RGLTEEREYE QLKSQIRDIE IQLKKIDKKE
VRNMVELLSP NDTNVEQRGF EEFLKGHLYS EEVRAITTGT SPGQLTVPTS ISDQIIKKLE
EVAPLFALSK QFPSEHGYLE VLKETGIGGA QWLGEMENAT PADFTMSKVK LEQKRLTAAI
ELSQQLINDA GFDIVSYAIN VLSRRIAYSV NRAIVNGNGV GQMEGFLTAT LASESVIKTT
ANTVTTDDVL GLFNSMNPEL IEGAVFVMNR NTWNAVSKLK DAENRYYLVD FKNGNGSKYY
TMLGLPVMIS DAMPDIATEN KAIGLINMGE AYGTLIKKGI EVQHVYADSA QALRGSQLIV
ASIYLDGKII NEQAIRLLSI AA