Gene BCAH820_3146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_3146 
Symbol 
ID7189229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp2980320 
End bp2981540 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content36% 
IMG OID643556557 
Producthemolysin BL lytic component L1 
Protein accessionYP_002452096 
Protein GI218904262 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones132 
Fosmid unclonability p-value0.537349 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT TTCCATTCAA AGTACTAACT TTAGCTACAT TAGCAACTGT TATAACTGCT 
ACTACCGGTA ACACTATTCA TGCATTTGCA CAAGAAACGA CTGCTCAAGA ACAAAAAGTA
GGCAATTATG CATTAGGCCC CGAAGGACTG AAGAAAGCAT TGGCTGAAAC AGGGTCTCAT
ATTCTAGTAA TGGATTTATA CGCAAAAACA ATGATTAAGC AACCAAATGT AAATTTATCT
AATATCGATT TAGGCTCAGA GGGGGGAGAG TTGCTCAAAA ATATTCACCT TAATCAAGAG
CTGTCACGAA TCAATGCGAA TTACTGGTTA GATACAGCGA AGCCACAGAT TCAAAAAACT
GCTCGTAATA TTGTAAATTA CGATGAACAA TTTCAAAATT ATTACGACAC ATTAGTAGAA
ACTGTACAAA AGAAAGATAA GGCAGGTCTA AAAGAGGGTA TAAATGATTT AATTACTACA
ATCAATACAA ATTCAAAAGA AGTTACAGAT GTGATTAAGA TGCTACAAGA CTTCAAAGGG
AAACTATATC AAAATTCTAC AGATTTTAAA AATAATGTTG GTGGTCCAGA TGGGAAAGGT
GGATTAACTG CAATATTAGC AGGTCAACAG GCAACGATTC CACAACTTCA AGCTGAAATT
GAGCAACTTC GTTCTACTCA GAAAAAACAT TTTGATGATG TATTAGCATG GTCAATTGGT
GGTGGATTGG GAGCAGCTAT TTTAGTTATT GCAGCTATTG GAGGAGCGGT AGTTATTGTT
GTAACTGGCG GTACAGCAAC ACCGGCTGTT GTTGGTGGAC TCTCGGCTCT TGGTGCAGCT
GGTATCGGTC TAGGAACTGC GGCTGGTGTC ACAGCATCTA AGCATATGGA CTCCTATAAT
GAAATTTCTA ACAAAATCGG AGAATTAAGT ATGAAAGCAG ATCGTGCTAA TCAAGCAGTT
CTTTCGCTTA CTAACGCGAA AGAAACATTG GCATATTTAT ATCAGACTGT AGATCAAGCG
ATATTGTCTC TAACAAATAT TCAAAAGCAA TGGAATACAA TGGGCGCAAA TTATACAGAT
TTATTGGATA ATATCGATTC TATGCAAGAC CACAAATTCT CTTTAATACC AGATGATTTA
AAAGCCGCTA AAGAAAGTTG GAATGATATT CATAAAGATG CAGAATTCAT TTCAAAAGAT
ATTGCTTTTA AACAGGAGTA G
 
Protein sequence
MKKFPFKVLT LATLATVITA TTGNTIHAFA QETTAQEQKV GNYALGPEGL KKALAETGSH 
ILVMDLYAKT MIKQPNVNLS NIDLGSEGGE LLKNIHLNQE LSRINANYWL DTAKPQIQKT
ARNIVNYDEQ FQNYYDTLVE TVQKKDKAGL KEGINDLITT INTNSKEVTD VIKMLQDFKG
KLYQNSTDFK NNVGGPDGKG GLTAILAGQQ ATIPQLQAEI EQLRSTQKKH FDDVLAWSIG
GGLGAAILVI AAIGGAVVIV VTGGTATPAV VGGLSALGAA GIGLGTAAGV TASKHMDSYN
EISNKIGELS MKADRANQAV LSLTNAKETL AYLYQTVDQA ILSLTNIQKQ WNTMGANYTD
LLDNIDSMQD HKFSLIPDDL KAAKESWNDI HKDAEFISKD IAFKQE