Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCE_0400 |
Symbol | |
ID | 2752365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus ATCC 10987 |
Kingdom | Bacteria |
Replicon accession | NC_003909 |
Strand | + |
Start bp | 416538 |
End bp | 417731 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637277189 |
Product | HK97 family phage major capsid protein |
Protein accession | NP_976728 |
Protein GI | 42779481 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA TTCTTGAATT GCGTGAGAAA CGCGCTAAAG CATGGGACGC AGCAAAGGCA TTCCTTGATT CAAAACGTGG CGGTGATGGA TTGTTATCCG CCGAGGACAC GACAACCTAT GAAAAAATGG AAGCCGATGT GGTGGCACTT GGTAAGGAAA TCGAACGTTT GGAACGCCAA GCATCTATCG ACTTAGAACT GTCGAAAGCA ACCAGTAACC CAATTACGAA CGAACCTACT AGAACTGGAG AGGAAAAGAC CGGACGCGCA AGTGCTGAAT ATAAAAAAGC TTTCTGGAAT GCGATGCGTG ACAATGTCAG CTATGAAGTA AGGAACGCTC TAAAGATTGG AACTGATTCT GAAGGTGGAT TCCTTGTGCC AGATGAGTTT GAGCGTACGC TAGTAGAAGC CCTAGAGGAA GAAAATATTT TCCGTAGGTT AGCCAATGTC ATCACAACAT CTTCTGGCGA CCGCAAGATT CCTGTTGTTG CAAGCAAAGG CTCTGCAAGC TGGATCGATG AAGAAGGAGC TATTCCTGAA AGTGATGATA GCTTCGGTCA AGTATCCATC GGTGCTTATA AACTGGCAAC GATGATTAAA GTCTCAGAGG AACTGCTAAA CGATTCCGTG TTCAATCTCG AAAGCTACAT CACAAGAGAA TTCGCACGAC GTATTGGTAA CAAGGAGGAG GAAGCCTTCT TTATAGGTGA CGGTACAGGA AAGCCAACAG GAATTCTGAA TGCTACTGGT GGTGGTCAAG TTGGGGTTAC TGCGGCAAGT GCCACTGCCA TCACTTTGGA TGAGGTATTA GATTTATTCT ACAGCTTAAA AGCACCGTAT CGAAATAAGG CAGTATTCGT AATGAATGAC GCCACTATAA AAGCTATCCG TAAATTAAAG GACGGAAACG GACAGTACTT ATGGCAACCT TCTGTCCAAG CGGGGACACC TGATACGATT CTTAACCGCC CGCTGTACAC CTCATCATAT GTACCTACTA TTGAAGCAGG TGCAAAGACT ATGGTATTCG GTGATTTTAG TTATTACTGG GTGGCAGACC GTCAAGGACG CGTATTCAAA CGATTAAATG AACTCTATGC TGTTACAGGT CAAGTAGGAT TTATTGCGAC TCAGCGAGTT GATGGAAAGC TTATCTTACC GGAGGCCGTT AAGGTACTCC AACAGAAAGC CTAA
|
Protein sequence | MSKILELREK RAKAWDAAKA FLDSKRGGDG LLSAEDTTTY EKMEADVVAL GKEIERLERQ ASIDLELSKA TSNPITNEPT RTGEEKTGRA SAEYKKAFWN AMRDNVSYEV RNALKIGTDS EGGFLVPDEF ERTLVEALEE ENIFRRLANV ITTSSGDRKI PVVASKGSAS WIDEEGAIPE SDDSFGQVSI GAYKLATMIK VSEELLNDSV FNLESYITRE FARRIGNKEE EAFFIGDGTG KPTGILNATG GGQVGVTAAS ATAITLDEVL DLFYSLKAPY RNKAVFVMND ATIKAIRKLK DGNGQYLWQP SVQAGTPDTI LNRPLYTSSY VPTIEAGAKT MVFGDFSYYW VADRQGRVFK RLNELYAVTG QVGFIATQRV DGKLILPEAV KVLQQKA
|
| |