Gene BCZK1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK1149 
Symbol 
ID3024079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp1251054 
End bp1253003 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content40% 
IMG OID637545381 
Producthypothetical protein 
Protein accessionYP_082748 
Protein GI52144080 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAA ATATATTACT TATAGGAGAT GGCATTCTTG CAGACTATGT ACATGATCAA 
TTATACAAAC AATATTCCAT CATTCGCCAA CATACAATTG CAGACGAACT CCCTGAAAAT
ATCGACCTCG CTCTCGTATT ACACGACGGC TCTCCTTCTA CTATTCACCA TGACGCTGAG
CTAACTTTCC GGTCAAATCA TATTCCGTGG CTACGTGGTT TTACTTCATT TGGTGAAGGG
ATTATCGGGC CTTATATTCA CCCTCCTGCA GCGGGATGTA CTCATTGTGC CGATGGACGA
CGCTTTATCG CTGGATTTGA TCAAAAAGAA ATGTGGGAGT TACAACGAAA ATATGCGGTA
AAAGAAGAAA ACGTAACGAG GCGTGATGTA CGTGCCACCC AAAATGGCAT TCTGCAAATG
TGCCATTTGA TTTGCGCAGA AACAGAGAAA ATATTAACTC ATAATCACCC TTCTTTAGAA
AATGAACTCA TTTTACTAAA CTTACAAACA CTGCAATGTA CGCGGCATTC TTTTCTTCCA
GATCCAATCT GCCCTGTATG TAGTAATTTA CCTGATGACA CGGCAGATGC AGCAGCAATT
TCATTACAAC CGAGCTTAAA AACAAGTGAT GCAACATATC GCTGTCGTTC CATTCATGAA
CTAAACACAT TTTTAACGAA AGACTATTTA GATTACCGAG TCGGTATGTT GAATGGAAAA
ATGCAGCATT CTTTATTACC ATTTGCTGAC GTTATTATAA ACATGCCATT ACTGTTTGGA
AATGAAGGGG TTGCAGGGCG CACTCATTCA TTTGCAATCA GTGAAGCAAC TGCTATTTTA
GAAGGTTTAG AACGATATTG TGGTATGTCA CCTCGAGGGA AAAAGACAAA TGTGTATGGT
AGTTTTCATG ATGTAGAGGA CCACGCGCTG AATCCCCTTA CGCTCGGTGT ACATACAAAT
GAACATTATA ATCGTGATGG TTTTCCATTT AAACCATTTG ATCCTGACTA TGAACAAAAC
TGGGTATGGG GATATTCACT ATCACAAAAC CGGCCAATTT TAGTTCCTGA ATCAATCGCT
TATTATAGCC TCGGTCATCG AGATGCTTAC GTATATGAAA CATCAAATGG ATGTGCCATT
GGTGGTAGTT TAGAAGAAGC AATTTTTCAC GGCATTTTAG AAATTGTAGA GCGTGACGCC
TTTTTGCTCA CTTGGTATGC TGAATTACCT CTTCACCGCC TTGATCTTAG TTCAGCACAT
GATACAGAAT TACAATTAAT GATTCAGCGG CTATACACGA TTACTGGTTA TGAATTACAT
GCATTTAACG CAACGATGGA ACACGGCATC CCGAGCTTAT GGGTAATTGC GAAAAATACG
CGTGAAAATG GAATGAATGT CGTTTGTGCT GGAGGCTCTC ATTTGGACCC AGTCCGTGCT
TTAAAGAGTG CCATTCACGA AATAGCAGGC ATGTTACTTA TAACAGACGA TGAACTTGAG
GAAAAAAGAG AGTACTATGA AAACTGCTTA CAAGACCCGT ATCTCGTAAA TAAAATGGAA
GACCATAGTA TGCTGTACGG ATTGAAAGAA GCAGAAGAAC GTCTTCACTT TCTTTTACGC
GGGGATGCCC CGGTGCAAAC GTTCCAGGAA ATGAATGCAT TACAATCAGT TGATCTAGAT
TTAACATCCG ATCTTCATCA ACTTTTAAAC CGTCTAGGGC AATCTGGACT TGAAGTAATC
GTTGTCGATC AAACAGTACC TCTTATAGAA AAAAACGGAT TACATTGTGT AAAAGTCATT
ATTCCAGGCA TGCTACCGAT GACATTTGGT CACCATCTCA CTCGACTTAC AGGGCTAGAT
CGAGTGTATA CCGTACCGAT GACACTTGGA TATACAGACG AACCTTTAAC GAATGAACAA
TTAAATCCAC ATCCGCACCC GTTTCCATAG
 
Protein sequence
MTQNILLIGD GILADYVHDQ LYKQYSIIRQ HTIADELPEN IDLALVLHDG SPSTIHHDAE 
LTFRSNHIPW LRGFTSFGEG IIGPYIHPPA AGCTHCADGR RFIAGFDQKE MWELQRKYAV
KEENVTRRDV RATQNGILQM CHLICAETEK ILTHNHPSLE NELILLNLQT LQCTRHSFLP
DPICPVCSNL PDDTADAAAI SLQPSLKTSD ATYRCRSIHE LNTFLTKDYL DYRVGMLNGK
MQHSLLPFAD VIINMPLLFG NEGVAGRTHS FAISEATAIL EGLERYCGMS PRGKKTNVYG
SFHDVEDHAL NPLTLGVHTN EHYNRDGFPF KPFDPDYEQN WVWGYSLSQN RPILVPESIA
YYSLGHRDAY VYETSNGCAI GGSLEEAIFH GILEIVERDA FLLTWYAELP LHRLDLSSAH
DTELQLMIQR LYTITGYELH AFNATMEHGI PSLWVIAKNT RENGMNVVCA GGSHLDPVRA
LKSAIHEIAG MLLITDDELE EKREYYENCL QDPYLVNKME DHSMLYGLKE AEERLHFLLR
GDAPVQTFQE MNALQSVDLD LTSDLHQLLN RLGQSGLEVI VVDQTVPLIE KNGLHCVKVI
IPGMLPMTFG HHLTRLTGLD RVYTVPMTLG YTDEPLTNEQ LNPHPHPFP