Gene BCE_0400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCE_0400 
Symbol 
ID2752365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus ATCC 10987 
KingdomBacteria 
Replicon accessionNC_003909 
Strand
Start bp416538 
End bp417731 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content45% 
IMG OID637277189 
ProductHK97 family phage major capsid protein 
Protein accessionNP_976728 
Protein GI42779481 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA TTCTTGAATT GCGTGAGAAA CGCGCTAAAG CATGGGACGC AGCAAAGGCA 
TTCCTTGATT CAAAACGTGG CGGTGATGGA TTGTTATCCG CCGAGGACAC GACAACCTAT
GAAAAAATGG AAGCCGATGT GGTGGCACTT GGTAAGGAAA TCGAACGTTT GGAACGCCAA
GCATCTATCG ACTTAGAACT GTCGAAAGCA ACCAGTAACC CAATTACGAA CGAACCTACT
AGAACTGGAG AGGAAAAGAC CGGACGCGCA AGTGCTGAAT ATAAAAAAGC TTTCTGGAAT
GCGATGCGTG ACAATGTCAG CTATGAAGTA AGGAACGCTC TAAAGATTGG AACTGATTCT
GAAGGTGGAT TCCTTGTGCC AGATGAGTTT GAGCGTACGC TAGTAGAAGC CCTAGAGGAA
GAAAATATTT TCCGTAGGTT AGCCAATGTC ATCACAACAT CTTCTGGCGA CCGCAAGATT
CCTGTTGTTG CAAGCAAAGG CTCTGCAAGC TGGATCGATG AAGAAGGAGC TATTCCTGAA
AGTGATGATA GCTTCGGTCA AGTATCCATC GGTGCTTATA AACTGGCAAC GATGATTAAA
GTCTCAGAGG AACTGCTAAA CGATTCCGTG TTCAATCTCG AAAGCTACAT CACAAGAGAA
TTCGCACGAC GTATTGGTAA CAAGGAGGAG GAAGCCTTCT TTATAGGTGA CGGTACAGGA
AAGCCAACAG GAATTCTGAA TGCTACTGGT GGTGGTCAAG TTGGGGTTAC TGCGGCAAGT
GCCACTGCCA TCACTTTGGA TGAGGTATTA GATTTATTCT ACAGCTTAAA AGCACCGTAT
CGAAATAAGG CAGTATTCGT AATGAATGAC GCCACTATAA AAGCTATCCG TAAATTAAAG
GACGGAAACG GACAGTACTT ATGGCAACCT TCTGTCCAAG CGGGGACACC TGATACGATT
CTTAACCGCC CGCTGTACAC CTCATCATAT GTACCTACTA TTGAAGCAGG TGCAAAGACT
ATGGTATTCG GTGATTTTAG TTATTACTGG GTGGCAGACC GTCAAGGACG CGTATTCAAA
CGATTAAATG AACTCTATGC TGTTACAGGT CAAGTAGGAT TTATTGCGAC TCAGCGAGTT
GATGGAAAGC TTATCTTACC GGAGGCCGTT AAGGTACTCC AACAGAAAGC CTAA
 
Protein sequence
MSKILELREK RAKAWDAAKA FLDSKRGGDG LLSAEDTTTY EKMEADVVAL GKEIERLERQ 
ASIDLELSKA TSNPITNEPT RTGEEKTGRA SAEYKKAFWN AMRDNVSYEV RNALKIGTDS
EGGFLVPDEF ERTLVEALEE ENIFRRLANV ITTSSGDRKI PVVASKGSAS WIDEEGAIPE
SDDSFGQVSI GAYKLATMIK VSEELLNDSV FNLESYITRE FARRIGNKEE EAFFIGDGTG
KPTGILNATG GGQVGVTAAS ATAITLDEVL DLFYSLKAPY RNKAVFVMND ATIKAIRKLK
DGNGQYLWQP SVQAGTPDTI LNRPLYTSSY VPTIEAGAKT MVFGDFSYYW VADRQGRVFK
RLNELYAVTG QVGFIATQRV DGKLILPEAV KVLQQKA