Gene BCZK0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK0789 
Symbolsap 
ID3023334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp901355 
End bp903799 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content35% 
IMG OID637545026 
ProductS-layer protein sap precursor 
Protein accessionYP_082393 
Protein GI52144435 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0200088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGA CTAACTCTTA CAAAAAAGTA ATCGCTGGTA CAATGACAGC AGCAATGGTA 
GCAGGTGTTG TTTCTCCAGT AGCAGCAGCA GGTAAAACAT TCCCAGACGT TCCAGCTGAT
CACTGGGGAA TTGATTCTAT CAACTACTTA GTAGAAAAAG GCGCAGTTAC AGGTAACGAC
AAAGGAATGT TCGAACCTGG AAAAGAATTA ACTCGTGCAG AAGCAGCTAC AATGATGGCT
CAAATCTTAA ACTTACCAAT CGATAAAGAT GCTAAACCAT CTTTCGCTGA CTCTCAAGGC
CAATGGTACA CTCCATTCAT CGCAGCTGTA GAAAAAGCTG GCGTTATTAA AGGTACAGGC
AACGGCTTTG AGCCAAACGG AAAAATCGAC CGCGTTTCTA TGGCATCTCT TCTTGTAGAA
GCTTACAAAT TAGATACTAA AGTAAACGGT ACTCCAGCAA CTAAATTCAA AGATTTAGAA
ACATTAAACT GGGGTAAAGA AAAAGCTAAC ATCCTAGTTG AATTAGGAAT CTCTGTTGGT
ACTGGTGATC AATGGGAGCC TAAGAAATCT GTAACTAAAG CAGAAGCAGC TCAATTTATT
GCTAAGACTG ACAAGCAGTT CGGTACAGAA GCAGCAAAAG TTGAATCTGC AAAAGCTGTT
ACAACTCAAA AAGTAGAAGT TAAATTCAGC AAAGCTGTTG AAAAATTAAC TAAAGAAGAT
ATCAAAGTAA CTAACAAAGC TAACAACGAT AAAGTACTAG TTAAAGAGGT AACTTTATCA
GAAGATAAAA AATCTGCTAC AGTTGAATTA TATAGTAACT TAGCAGCTAA ACAAACTTAC
ACTGTAGATG TAAACAAAGT TGGTAAAACA GAAGTAGCTG TAGGTTCTTT AGAAGCAAAA
ACAATCGAAA TGGCTGACCA AACAGTTGTA GCTGATGAGC CAACAGCATT ACAATTCACA
GTTAAAGATG AAAACGGTAC TGAAGTTGTT TCACCAGAGG GTATTGAATT TGTAACTCCA
GCTGCAGAAA AAATTAATGC AAAAGGTGAA ATCACTTTAG CAAAAGGTAC TTCAACTACT
GTAAAAGCTG TTTATAAAAA AGACGGTAAA GTAGTAGCTG AAAGTAAAGA AGTAAAAGTT
TCTGCTGAAG GTGCTGCAGT AGCTTCAATC TCTAACTGGA CAGTTGCAGA ACAAAATAAA
GCTGACTTTA CTTCTAAAGA TTTCAAACAA AACAATAAAG TTTACGAAGG CGACAACGCT
TACGTTCAAG TAGAATTGAA AGATCAATTT AACGCAGTAA CAACTGGAAA AGTTGAATAT
GAGTCGTTAA ACACAGAAGT TGCTGTAGTA GATAAAGCTA CTGGTAAAGT AACTGTATTA
TCTGCAGGAA AAGCACCAGT AAAAGTAACT GTAAAAGATT CAAAAGGTAA AGAACTTGTT
TCAAAAACAG TTGAAATTGA AGCTTTCGCT CAAAAAGCAA TGAAAGAAAT TAAATTAGAA
AAAACTAACG TAGCGCTTTC TACAAAAGAT GTAACAGATT TAAAAGTAAA AGCTCCAGTA
CTAGATCAAT ACGGTAAAGA GTTTACAGCT CCTGTAACAG TGAAAGTACT TGATAAAGAT
GGTAAAGAAT TAAAAGAACA AAAATTAGAA GCTAAATATG TGAACAAAGA ATTAGTTCTG
AATGCAGCAG GTCAAGAAGC TGGTAATTAT AAAGTTGTAT TAACTGCAAA ATCTGGTGAA
AAAGAAGCAA AAGCTACATT AGCTCTAGAA TTAAAAGCTC CAGGTGCATT CTCTAAATTT
GAAGTTCGTG GTTTAGAAAA AGAATTAGAT AAATATGTTA CTGAGGAAAA CCAAAAGAAT
GCAATGACTG TTTCAGTTCT TCCTGTAGAT GCAAATGGAT TAGTATTAAA AGGTGCAGAA
GCAGCTGAAC TAAAAGTAAC AACAACAAAC AAAGAAGGTA AAGAAGTAGA CGCAACTGAT
GCACAAGTTA CTGTACAAAA TAACAGTGTA ATTACTGTTG GTCAAGGTGC AAAAGCTGGT
GAAACTTATA AAGTAACAGT TGTACTAGAT GGTAAATTAA TCACAACTCA TTCATTCAAA
GTTGTTGATA CAGCACCAAC TGCTAAAGGA TTAGCAGTAG AATTTACAAG CACATCTCTT
AAAGAAGTAG CTCCAAATGC TGATTTAAAA GCTGCACTTT TAAATATCTT ATCTGTTGAT
GGTGTACCTG CGACTACAGC AAAAGCAACA GTTTCTAATG TAGAATTTGT TTCTGCTGAC
ACAAATGTTG TAGCTGAAAA TGGTACAGTT GGTGCAAAAG GTGCAACATC TATCTATGTG
AAAAACCTGA CAGTTGTAAA AGATGGAAAA GAGCAAAAAG TAGAATTTGA TAAAGCTGTA
CAAGTTGCAG TTTCTATTAA AGAAGCAAAA CCTGCAACAA AATAA
 
Protein sequence
MAKTNSYKKV IAGTMTAAMV AGVVSPVAAA GKTFPDVPAD HWGIDSINYL VEKGAVTGND 
KGMFEPGKEL TRAEAATMMA QILNLPIDKD AKPSFADSQG QWYTPFIAAV EKAGVIKGTG
NGFEPNGKID RVSMASLLVE AYKLDTKVNG TPATKFKDLE TLNWGKEKAN ILVELGISVG
TGDQWEPKKS VTKAEAAQFI AKTDKQFGTE AAKVESAKAV TTQKVEVKFS KAVEKLTKED
IKVTNKANND KVLVKEVTLS EDKKSATVEL YSNLAAKQTY TVDVNKVGKT EVAVGSLEAK
TIEMADQTVV ADEPTALQFT VKDENGTEVV SPEGIEFVTP AAEKINAKGE ITLAKGTSTT
VKAVYKKDGK VVAESKEVKV SAEGAAVASI SNWTVAEQNK ADFTSKDFKQ NNKVYEGDNA
YVQVELKDQF NAVTTGKVEY ESLNTEVAVV DKATGKVTVL SAGKAPVKVT VKDSKGKELV
SKTVEIEAFA QKAMKEIKLE KTNVALSTKD VTDLKVKAPV LDQYGKEFTA PVTVKVLDKD
GKELKEQKLE AKYVNKELVL NAAGQEAGNY KVVLTAKSGE KEAKATLALE LKAPGAFSKF
EVRGLEKELD KYVTEENQKN AMTVSVLPVD ANGLVLKGAE AAELKVTTTN KEGKEVDATD
AQVTVQNNSV ITVGQGAKAG ETYKVTVVLD GKLITTHSFK VVDTAPTAKG LAVEFTSTSL
KEVAPNADLK AALLNILSVD GVPATTAKAT VSNVEFVSAD TNVVAENGTV GAKGATSIYV
KNLTVVKDGK EQKVEFDKAV QVAVSIKEAK PATK