Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH187_A4002 |
Symbol | |
ID | 7076749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH187 |
Kingdom | Bacteria |
Replicon accession | NC_011658 |
Strand | - |
Start bp | 3753065 |
End bp | 3754222 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643452428 |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_002339939 |
Protein GI | 217961371 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAAG AATTAAGAGA ATTGTTAGCT AAGATTCAAA ATAAGAAAGC AGCAGCACGA GAACTTTTAG CTCAAAAAAA GCTGGAAGAA GCGGAGCAAC TTACAAATGA AATTAAGGAT TTACAGAAGG AGTTCGATAT CGCTTCAGCT TTGTATGAAG AAGAAGTAAA TAATATTCCA AATGACCCAA TTCCTCAACC ACAAGCAAAT ACAGTACAGC CTAATGAGGC GTTTGTTAAT GCAATGAAAG CAGCTGTAGG AAAACATAAA CTATCTGAGG ACGAAAAAGA GGTATTGAAT GCAACTACTA TGACTGAAGG TGTGCCATCT GATGGCGGTT TAACTGTACC AAAAGATATT CGTACAGCTA TTAAAGAATT ACGTCGTAGT GGTCCAGATG CACTTGAAAA TTACGTAAAT GTTGAGTCTG TTTCTACATT AACTGGATCT AGAGTTATTG AAGTAGAGGC AGAATATATC CCGTTTGACA ATGTGGATGA GGCAGCAGAT TTTCCATTGT TGGAAGCACC GAAGTTTGAA GATATTCAAT ATAGCGTTAA GAAAAAAGGT GGTATCTTGA AGTTTTCAAA AGAATTACTT GCAGATACGG CAGAAAATAT TCAAGCTTAC ATTAAAAAAT GGACATTTAA AAAATCTAAA GCTACTCGTA ATGCTTTAAT TTTAAAAGCT TTAACCGATA ATTTCAGTGC CACAAAGGTA GCGGTTAAAA CAGTTGACGA TTTAAAGGAT ATTTTTAATG TGAAGCTTGA TCCAGGTATT GAGCCAACAT CAAGTGCGAT CATGAATCAA GATGCTTTTA ATTATCTTGA TAAATTAAAA GATACGGATG GTAAATATAT TCTTCAACCA AACCCAACAA TGACAACACA AAAGCTGTTA TTTGGTAAAT ATCCAATTCG TGTTGTTAGT AATAAAACAT TAAAAACAGA TGCGGTGAAG AAGACAACAC CATTATATTT TGGTGATTTT AAAGAGGCTA TTACTATCTT TGATAGAGAG GCTTTATTCA TCGAGTTCTC AGAGCAAGCA TTAGACCTGT GGGGCAAAGA TTTAGTTGGT ATGAAGGTAC GTGAGCGCTT AGATGTAAAA GCTGTTGATA AAAAAGCGAT TGTTACTGGC GAAATCACGT TTGTTTAA
|
Protein sequence | MPKELRELLA KIQNKKAAAR ELLAQKKLEE AEQLTNEIKD LQKEFDIASA LYEEEVNNIP NDPIPQPQAN TVQPNEAFVN AMKAAVGKHK LSEDEKEVLN ATTMTEGVPS DGGLTVPKDI RTAIKELRRS GPDALENYVN VESVSTLTGS RVIEVEAEYI PFDNVDEAAD FPLLEAPKFE DIQYSVKKKG GILKFSKELL ADTAENIQAY IKKWTFKKSK ATRNALILKA LTDNFSATKV AVKTVDDLKD IFNVKLDPGI EPTSSAIMNQ DAFNYLDKLK DTDGKYILQP NPTMTTQKLL FGKYPIRVVS NKTLKTDAVK KTTPLYFGDF KEAITIFDRE ALFIEFSEQA LDLWGKDLVG MKVRERLDVK AVDKKAIVTG EITFV
|
| |