Gene BCZK0919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK0919 
SymbolyhaN 
ID3022024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp1030857 
End bp1033781 
Gene Length2925 bp 
Protein Length974 aa 
Translation table11 
GC content39% 
IMG OID637545154 
Producthypothetical protein 
Protein accessionYP_082521 
Protein GI52144307 
COG category[S] Function unknown 
COG ID[COG4717] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGG AAAAACTCCA TATTTATGGG TACGGAAAAT TAGAAAATGT GGAAATGGAT 
CTTTCAATGC TGACGGTGTT ATACGGTGAA AATGAAGCGG GAAAATCGAC AATTCGCTCG
TTTATGAAAA GTATTTTGTT CGGCTTTCCG ACGAGAGGAC AGCGCCGTTA TGAACCGAAA
GAAGGCGGCA AGTATGGCGG GGCGATGACT GTACAAACAG AGAAGTACGG CCGTTTGAAA
ATTGAACGAT TGCCAAAGAC GGCCGCTGGG GAGGTAACTG TTTATTTTGA AGACGGGAAA
ACGGGTGGCG AAGAAATTTT ACACGATATA TTAACCGGGA TGAATGAAAG TTTATTTGAA
TCGGTCTTTT CTTTTGATAT GCATGGCCTT CAAAATATTC ATCAGCTTGG CGAAGCGGAT
ATCGGCAATT ATTTATTTTC GGCAAGTGCA GTCGGAAGTG ATGCATTATT GCAGCTAGAT
AAAAAGCTAG AAAAAGAAAT GGATCAGCGC TTTAAGCCGA GTGGTCGTAA GCCAGAAATT
AATGTGTCAC TGCAAGAGAT GAAAAAGCTT GAAGAGAAGA TGAAAGAGTG GCAAGGGAAA
ATTGGCACGT ATGAAAAGCA AGTCGAGCAG TTAAAAGAAA GTGAAGAGAA GCTTGTTTCT
GTTCGCGCGG AAAAAGAGAA TGCAGAAAAA CGAAAGCAGG ATTATGAAAT ATTAGCAGCG
CTTGAACCTC TCGTTATTGA AAAACGTACG TATGAGAAAG TGTTAGAAAA TGAGAATGTG
CAATTTCCTG TAAATGGAAT GGCGCGTTAT GAAGCGATTA AGGCGAAGAT GGAGCCGCTT
CAGTTGCAAG TTGATTCACT TCATAAAAAA ATTGAGAATG TGCAATCAGA AATAGAATCG
ATTCAAATAG ATGAAGAATT TTTACAAAAA GAAAGTTATG TAGAAGAACT TCGTATGCAG
CATATGTCTT ACGAAAATGC ACGCCAAGAA ATGCGTGATA TAACAGGGAC GATTACGAAT
ATAAAAGAAG AGATTGCAGA ACTAGAGCAA CAAATCGGTG CTACTTTTGA AAAAGAAACA
GTCCTTTCGT TTGATATGAG TTTGGCAACG AAAGAGTTAA TTACGCAAGC AGTGCAAAAG
GCGCGCGAAT TAGAAACGCA AAAAGCACAG CTTGATGATC GTTTTAAAGT AGCGCAAGAG
CAATTAGAAG AACAAGAAGA AAATATAAGA CAGATTCAGA AGCAAATGTT AGCGGATGAA
GAGCGAAATA CGTTAGTTGA GAAAGAAAAA TCGTTCCAAG ATGCGGCGTT TATCGGTATG
GGCGCTGAGA GAATGAAGCG CAAGTATGAG GAAAAAGCAG GAGCGGCGAT GCAAAAGAAA
AAACAGTGGC AAAGAGTTTG TCTTCTGTTA CTTCTTATTA ACACCGGCGT TTTATTCACA
AGCTTATTTA TAGACAATCG CCTACTCTTA TTTATTAGTG TCATTGTGTT TGTAGCGATT
GTTCTTGCCC TCGTTTTATA TAAAGATCCG TCAAGTGGAT TACAAGAAGA ACTTCTTACT
CTTCAGCAAA GTGCTGGCGG GAGACAAAGT GAAGAAGCGA TGACTGTACG CTACCAGTTA
GAAAAAGACG AAGAGATTCG TAAGTTATTT GAGCGTGAGT CTTATAAATT GCAGCAAATG
GAGCGAGCGT ATGATAAAGT CGTTTCATCG TATGAGGAAT GGGAGAGAGA AACGTTCCGC
ACGAGCGAAC AAGTAAGTGT GTATAAAAAG CGCTATACGT TCCCTGAATT TTATACGTAT
GCGCACATAT TGCCGGCGTT TGAGCGTATG GAAAAAATGC AGCAATTATA TCGTGAATTA
GAGAAACAAG GCAAGCGAAA ATCTTCATTA TATGAAATGA TTTCGCAATT TGAACATAAA
CTAGAAACTG TTATCGGTAG CGCGGAGTAT AGTAAGCTGC ACGAGGCGCA AAGTCGTATG
CAAAATGAGA AAGAGAAGCG CCAAACTTGT AAGCAGTTAA AAGAAAAACT GGCGGAATGG
CAAGAAGAAT ATGAGTTTAT GCAAGAGCAA TTAAAGCAAT TACTAGTAGA ACGAGACAGT
TTATGGCATA TCGCAGAGTC TACAAATGAA GAGATGTTTT TAGAGGCAGG TAAACTAGCG
GAAAAACGTG AAGATGCAGA GAAACAAGTG GGGCGTTTAT TACCGCAAAT TGATCTGTTA
GAACAGCGTT TAACGAGTTT ATCATTAGCT GAACATTATG AAGCTGACGG TTATGATGAA
AAATTAAAGC AAGAACTGAC AACCGCGCAC AACTGTCTGG CACAAGAAAA AGAACTGACA
GAGCGTATTG CGAAACATCG TATGGAAATT GCGAATTTAG AAGAAGGTAG TACGTACGGT
GATTTAATGC ATGAATGGGA AATGAAAAAA GCGCAAGTGC GTGAACAAGT AAAGAAGTGG
GCTGCGTATG CGGCTGCAAA GACAGTGTTA ACGAAAACGA AGCAATATTA TCATGAAGTA
CATCTTCCTC GTATTTTACA AAAATCAGAA GAGTATTTCG TCTACTTAAC AGGCGGACGA
TATAGTAAAA TCTTTTCACC GTCAGAGGCG GAGCCGTTTA TTGTAGAGCG TAATGATGGT
ATGCGTTTTT ATAGTCATGA ACTAAGCCAA GCGACAGCTG AGCAGTTGTA TTTATCGCTG
AGATTTGCGT TAGCAAAAAC ATTTGAGCAT GATTATCCAT TTATTATTGA TGATAGTTTC
GTGCATTTTG ACGCGGTAAG GACAAATCGT ACAATTGAAC TAATAAAGGA AATAGCGCAA
GATAGACAAG TCATATTCTT TACATGTCAT GCGCATTTAC TCGCGTATTT TACAGAAAAA
CAGATTATAA AATTAACACA TATGCGTAAA GAAAATGAGT TGTAG
 
Protein sequence
MRMEKLHIYG YGKLENVEMD LSMLTVLYGE NEAGKSTIRS FMKSILFGFP TRGQRRYEPK 
EGGKYGGAMT VQTEKYGRLK IERLPKTAAG EVTVYFEDGK TGGEEILHDI LTGMNESLFE
SVFSFDMHGL QNIHQLGEAD IGNYLFSASA VGSDALLQLD KKLEKEMDQR FKPSGRKPEI
NVSLQEMKKL EEKMKEWQGK IGTYEKQVEQ LKESEEKLVS VRAEKENAEK RKQDYEILAA
LEPLVIEKRT YEKVLENENV QFPVNGMARY EAIKAKMEPL QLQVDSLHKK IENVQSEIES
IQIDEEFLQK ESYVEELRMQ HMSYENARQE MRDITGTITN IKEEIAELEQ QIGATFEKET
VLSFDMSLAT KELITQAVQK ARELETQKAQ LDDRFKVAQE QLEEQEENIR QIQKQMLADE
ERNTLVEKEK SFQDAAFIGM GAERMKRKYE EKAGAAMQKK KQWQRVCLLL LLINTGVLFT
SLFIDNRLLL FISVIVFVAI VLALVLYKDP SSGLQEELLT LQQSAGGRQS EEAMTVRYQL
EKDEEIRKLF ERESYKLQQM ERAYDKVVSS YEEWERETFR TSEQVSVYKK RYTFPEFYTY
AHILPAFERM EKMQQLYREL EKQGKRKSSL YEMISQFEHK LETVIGSAEY SKLHEAQSRM
QNEKEKRQTC KQLKEKLAEW QEEYEFMQEQ LKQLLVERDS LWHIAESTNE EMFLEAGKLA
EKREDAEKQV GRLLPQIDLL EQRLTSLSLA EHYEADGYDE KLKQELTTAH NCLAQEKELT
ERIAKHRMEI ANLEEGSTYG DLMHEWEMKK AQVREQVKKW AAYAAAKTVL TKTKQYYHEV
HLPRILQKSE EYFVYLTGGR YSKIFSPSEA EPFIVERNDG MRFYSHELSQ ATAEQLYLSL
RFALAKTFEH DYPFIIDDSF VHFDAVRTNR TIELIKEIAQ DRQVIFFTCH AHLLAYFTEK
QIIKLTHMRK ENEL