Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B3313 |
Symbol | |
ID | 7182769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | + |
Start bp | 1884998 |
End bp | 1886686 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643549738 |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_002445408 |
Protein GI | 218896997 |
COG category | [R] General function prediction only |
COG ID | [COG3740] Phage head maturation protease [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01543] phage prohead protease, HK97 family [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.000000000533544 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAATGG AACTACGAGT TAATCAGACC AACATTGAAG CAAATGAAGA TGGCTCAATG ACTGTCAATG GCTATGTAAA TAAGACTGAA CAATTTAGTA AAATGTTGGG ACGAAACGAA CAGTTTAAAG AAAAGATTTC ACGCGGTGTT TTTAAACGTG CAATTGAGAA GGCAAAAGAA ATTCACTTTC TTGCTGAACA TGATGGTGAA AAGATTTTAT CTTCAACTAG AAACGGTTCT TTGGAATTGT CAGAAGATAC AAATGGACTT TATATGTCTG CAACAATTAC ACCTACATCT TGGGGCAAAG ATTATTATGA ATTAATTAAG TCAGGAATTT TAAAGAACAT GTCTTTCGGA TTCCGCTCAA TTAAAGATTC ATGGAAAAAG ACTACTCAAG GTTATTTTGA AAGGACAATC CATGAACTTG AATTATTTGA AGTTTCAGTT GTAAAAGACC CTGCATATTC TCAATCTTCA ATTTCTGCTC GTGGAATTGA TGTTGTTGAA GAGGTTGAAG TACCTGACGA GGTTAAGAAG AAAATAGTAA GAAATATTCA AGAAATGAGT CGCAAGGCTC TTATTGAATT GCGAAATGAC TTGCTCGAAC AATCACAAAG TTATGAAACT CGTGGTCTTA CGGAAGAACG CGAATATGAG CAATTAAAAT CTCAAATTCG AGATATTGAA ATACAACTTA AAAAAATAGA TAAGAAAGAG GTTAGAAATA TGGTTGAATT ATTAAGTCCA AACGATACAA ATGTAGAACA ACGTGGTTTT GAAGAATTTT TAAAAGGTCA CTTATATTCT GAAGAAGTTC GTGCAATTAC AACAGGTACG TCACCAGGAC AACTAACAGT TCCAACTTCA ATTTCCGATC AAATTATTAA GAAATTAGAA GAAGTAGCTC CATTATTTGC ACTATCTAAA CAGTTTCCAA GTGAACATGG TTATCTTGAA GTGTTAAAAG AAACAGGTAT TGGTGGAGCT CAATGGCTAG GTGAAATGGA AAATGCAACT CCAGCAGATT TCACAATGTC AAAAGTAAAA TTAGAACAAA AACGTTTAAC GGCAGCTATT GAATTATCAC AACAACTAAT TAATGATGCT GGCTTTGATA TTGTGAGTTA TGCAATCAAT GTTTTATCTC GACGTATTGC TTATTCAGTT AACCGAGCAA TTGTTAATGG AAATGGCGTT GGACAAATGG AAGGTTTCTT AACTGCAACA TTGGCTTCAG AATCAGTGAT TAAAACAACT GCAAATACAG TTACTACTGA TGATGTTTTA GGACTATTCA ACTCTATGAA TCCAGAACTA ATCGAAGGTG CAGTGTTTGT TATGAACCGT AATACTTGGA ATGCGGTTTC AAAGCTGAAA GATGCGGAAA ACCGATACTA TCTTGTAGAT TTCAAAAATG GTAATGGTTC TAAATATTAC ACTATGCTTG GATTACCGGT AATGATTTCA GATGCCATGC CAGATATTGC AACAGAAAAC AAAGCAATTG GTTTAATTAA TATGGGTGAA GCATATGGAA CTCTGATTAA GAAGGGAATT GAAGTGCAAC ATGTTTATGC TGATAGCGCT CAAGCACTTC GTGGTTCTCA ATTAATCGTT GCTTCTATCT ATCTTGATGG TAAAATTATC AATGAACAAG CAATTCGTTT ATTGTCTATT GCTGCATAA
|
Protein sequence | MKMELRVNQT NIEANEDGSM TVNGYVNKTE QFSKMLGRNE QFKEKISRGV FKRAIEKAKE IHFLAEHDGE KILSSTRNGS LELSEDTNGL YMSATITPTS WGKDYYELIK SGILKNMSFG FRSIKDSWKK TTQGYFERTI HELELFEVSV VKDPAYSQSS ISARGIDVVE EVEVPDEVKK KIVRNIQEMS RKALIELRND LLEQSQSYET RGLTEEREYE QLKSQIRDIE IQLKKIDKKE VRNMVELLSP NDTNVEQRGF EEFLKGHLYS EEVRAITTGT SPGQLTVPTS ISDQIIKKLE EVAPLFALSK QFPSEHGYLE VLKETGIGGA QWLGEMENAT PADFTMSKVK LEQKRLTAAI ELSQQLINDA GFDIVSYAIN VLSRRIAYSV NRAIVNGNGV GQMEGFLTAT LASESVIKTT ANTVTTDDVL GLFNSMNPEL IEGAVFVMNR NTWNAVSKLK DAENRYYLVD FKNGNGSKYY TMLGLPVMIS DAMPDIATEN KAIGLINMGE AYGTLIKKGI EVQHVYADSA QALRGSQLIV ASIYLDGKII NEQAIRLLSI AA
|
| |