Gene BCZK0459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK0459 
Symbol 
ID3027175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp534407 
End bp537745 
Gene Length3339 bp 
Protein Length1112 aa 
Translation table11 
GC content32% 
IMG OID637544676 
Productinternalin protein 
Protein accessionYP_082066 
Protein GI52144762 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein
[COG5386] Cell surface protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.695574 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGTGG CTGCGACATT ATCGTTGCCG TTTGCGGTTT ATGCTACACC TATCTTAGCT 
GCTACTGCTG CTACAGAGAA TATGGCTGTA CAAAGTCCAA AAAAGCATGT TTTTGATGCG
GTAATAAAGG CTTATAAAGA TAACTCAGAT GAAGAGTCAT ATGCAACTGT ATATATAAAA
GATCCAAAGC TGACGATTGA AAATGGGAAA AGAATAATTA CAGCAACATT AAAAGATAGT
GATTTCTTTG ACTATCTGAA AGTCGAAGAT AGTAAAGAGC CAGGTGTCTT CCATGATGTA
AAGGTGCTTT CAGAAGATAA AAGAAAACAT GGAACGAAAG TTATACAATT TGAAGTAGGT
GAGTTAGGAA AAAGATATAA TATGCAAATG CATATTTTAA TTCCCACTTT AGGGTACGAT
AAGGAATTCA AAATTCAGTT TGAAGTAAAT ATGCGCACAT TTGTAGAAAG CGATATAGAA
GAGGATGAAG AAGAACAAAT TGAAGATACA CAAAATATCA TACGTGATAA ACGATTACAA
CAAGCAATTA ATAAAAATGT ATTAAATAGA AAAGATGTAA ATGAACCTAT ATTTGAAGAA
GATTTAAAAG AAATTAAAGA GCTAAATATA TATGCAGGTC AAGGAATTGA GAGTCTAAAA
GGTTTAGAGT ATATGGAAAA TCTAGAAAGA ATAACAATAC AAGGATCTGA TGTACGAAAT
ATAGCTCCTA TTTCACAACT AAAACGTTTA AAAGTAGTTG ATCTATCTTT TAATAAAATA
GAAAATGTTG AGCCGCTTGT AAACTTAGAA AAACTGGATA TACTAGAGCT ACAAAATAAT
AGAATTGCTG ACGTAACGCC ACTAAGTCAA CTTAAAAAGG TTAGGACAAT TAATTTATCA
GGTAATAAAA TTAGTGATAT AAAGCCTTTA TATAATGTTT CTTCTTTAAG AAAGTTATAT
GTAAGCAATA ATAAAATTAC TGATTTTACA GGCATTGAGC AATTGAATAA ATTAGGGACA
TTAGGGGTAG GAAGTAACGG GCTTGTAAAT ATTGAACCGA TTAGTCAGAT GAGTGGCATT
GTTGAACTTA ATCTTGAAAA AAATGATATT AAAGATATTA CATCATTATC TAAACTAACT
GGCTTACAAT CACTTAACTT GGAAGAAAAC TATGTTTCGG ATGTATCATC ACTTAGTAAT
TTGATTAATT TATATGAATT AAAACTTGCG ACAAATGAGA TTCGTGATAT AAGACCTATT
CAAGAATTAG GAAAACGAAT TAAGATTGAT GCTCAAAGGC AAAAGGTCTT TTTAGATGAA
GCCTATATGA ATGAAGAAGT GAAAATTCCT GTATATGATG TAAATGGGAC AGCACTTCAA
AATATTGAGT GGAAGAGTGA AGGCGGAAGT ATTACGAACG GAGTAATAAA GTGGAATAGC
CTTGGGGAAA AAATGTATGA ATTTAAGATG GATGCTGGCG AAAGTAAGAT AAGGTTCCAA
GGGAGGGTAA TACAAAATAT TGTTGAAAAA CGAGAAGAGA GTTCGAACGT AATTCAAGAT
ATGAAACTAA GACAATACAT GAATAAACAT AATTTTGAAC GGAAAAATGT AAATACCCCT
ATAACGAAAG AAGATTTATT AACAGTTAAG GCTTTGAAAA TTACGGATGG GAAAAAAGAG
GGGATAACAG ATTTTTCTGG ATTAGAATTC ATGACAAATA TGGAAGAATT GATATTACAA
AATGCTAATA TGAAAAATGT GAAATTTATC TCAAGTTTGA GAAATTTGAA GTCAGTAGAT
TTATCCTATA ATCAAATTGA AGATATTAAA CCGCTTCATT CATTAGAGAA TCTTGAAAAA
TTAAATATTA GCAATAACGG TATAAAAAAT GTTCCAGAAC TATTTAAGAT GCAGACATTA
AAAACTCTAG ACCTATCAAA TAATAAACTT GATAATGCTG CTTTGGATGG AATTTATCAA
TTGGAAAATC TAGATGCATT GTTAGTAAAT AATAATGAAA TCAATAATTT AGATGAGATT
GGCAAAGTTA GCAAATTGAA TAAGCTAGAA ATGATGGGCA ATAAAGTACG AGATATTTCT
CCATTAGCTA ACTTGAAAAA CTTACAGTGG TTAAATTTAG CCAATAATAA GATTCAAGAT
ATCTCTAGTT TATCCTCTAT ACTTGATTTA CTTAGTTTGA AATTAGCTGG AAACGAGATT
CGTGATGTAA GACCAATTAT TCAATTGGCT CAATGGATAA CAGTTGATAT TAAAAACCAA
AAAATTGTTT TAGAAGATGG ACAAATGAAT CAAGAAATCC AAATTCCTAT CTATGATTTA
GAGGGAGAAA TCTTTGAAGA TATTGAACTG AAGAGTGAAG CCGGTATCGT TACCGATAGA
GGAACAGTCG TATGGAAAAC TCCAGGAGAA AAAAATTATG TATTCTCCTT AAATGGTAAT
TATCACGGTC TATCTCTATT ATTCAGTGGT ACAGTTATGC AAAATATAGT AGCGAAAGAA
GAACCAAAAG AACCAGTGGA AGAAGTTGAA GGTTCGAAAG AAGAACCAAT AAAAGAAGCT
GAAGGATCAA AAGAAGAGCC AAAAGGGCCA GCAAAAGAAG TTGAAGGTCC GAAAGAAGAA
GTGAAAGAAC CGGCAAAAGA AGTTGAAGGT CCGAAAGAAG AAGTGAAAGA ACCGGCAAAA
GAAGTTGAAG GCCTGAAAGA AGAAGTAAAA GAACCGGCAA AAGAAGTTGA AGGCCCGAAA
GAAGAAGTAA AAGAACCGGC AAAAGAAGTT GAAGGCCCGA AAGAAGAAGT AAAAGAACCG
GCAAAAGAAG TTGAAGGCCC GAAAGAAGAA GTAAAAGAAC CAGCAAAAGA AGTTGAAGGT
CCGAAAGAAG AAGTGAGAGA ACCAACAAAA GAAGTTGAAG GTCCGAAAGA AGAAGTGAAA
GAACCAATGA AAGAAGTTGA AGGATCGAAA GAAGAAGTGA AAGAACCAAC GAAAGAAGCT
GAAGGATCGA AAGAAGAAGT GAAAGAGCCA ACAACAGAAG TTGAAGGATC GAAAGAAGTA
AAAGAACCAG GAAAAGAAGT TGAAGGTTCA AAAGATGCAA TAAATCAATC AGCAGTAGCT
CAAGAAACAA ACGTGAACAA TCAAGTTGGG AAAGAAAAAG TAGTAGAGAA TCAAAACATG
AAAGAAAATA AACCAGCTGT TACTAAGCAA GAAGAAAGTA AGAAATCACT AGGAGCAACA
GGTGGACAAG AGAATACATC AACATTACTT TCAGGCTTAG CACTAGTTCT TTCAGCATTG
AGTATGTTTG TATTTAGAAA GAGATTATTT AAGAAATAA
 
Protein sequence
MLVAATLSLP FAVYATPILA ATAATENMAV QSPKKHVFDA VIKAYKDNSD EESYATVYIK 
DPKLTIENGK RIITATLKDS DFFDYLKVED SKEPGVFHDV KVLSEDKRKH GTKVIQFEVG
ELGKRYNMQM HILIPTLGYD KEFKIQFEVN MRTFVESDIE EDEEEQIEDT QNIIRDKRLQ
QAINKNVLNR KDVNEPIFEE DLKEIKELNI YAGQGIESLK GLEYMENLER ITIQGSDVRN
IAPISQLKRL KVVDLSFNKI ENVEPLVNLE KLDILELQNN RIADVTPLSQ LKKVRTINLS
GNKISDIKPL YNVSSLRKLY VSNNKITDFT GIEQLNKLGT LGVGSNGLVN IEPISQMSGI
VELNLEKNDI KDITSLSKLT GLQSLNLEEN YVSDVSSLSN LINLYELKLA TNEIRDIRPI
QELGKRIKID AQRQKVFLDE AYMNEEVKIP VYDVNGTALQ NIEWKSEGGS ITNGVIKWNS
LGEKMYEFKM DAGESKIRFQ GRVIQNIVEK REESSNVIQD MKLRQYMNKH NFERKNVNTP
ITKEDLLTVK ALKITDGKKE GITDFSGLEF MTNMEELILQ NANMKNVKFI SSLRNLKSVD
LSYNQIEDIK PLHSLENLEK LNISNNGIKN VPELFKMQTL KTLDLSNNKL DNAALDGIYQ
LENLDALLVN NNEINNLDEI GKVSKLNKLE MMGNKVRDIS PLANLKNLQW LNLANNKIQD
ISSLSSILDL LSLKLAGNEI RDVRPIIQLA QWITVDIKNQ KIVLEDGQMN QEIQIPIYDL
EGEIFEDIEL KSEAGIVTDR GTVVWKTPGE KNYVFSLNGN YHGLSLLFSG TVMQNIVAKE
EPKEPVEEVE GSKEEPIKEA EGSKEEPKGP AKEVEGPKEE VKEPAKEVEG PKEEVKEPAK
EVEGLKEEVK EPAKEVEGPK EEVKEPAKEV EGPKEEVKEP AKEVEGPKEE VKEPAKEVEG
PKEEVREPTK EVEGPKEEVK EPMKEVEGSK EEVKEPTKEA EGSKEEVKEP TTEVEGSKEV
KEPGKEVEGS KDAINQSAVA QETNVNNQVG KEKVVENQNM KENKPAVTKQ EESKKSLGAT
GGQENTSTLL SGLALVLSAL SMFVFRKRLF KK