Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK0919 |
Symbol | yhaN |
ID | 3022024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 1030857 |
End bp | 1033781 |
Gene Length | 2925 bp |
Protein Length | 974 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637545154 |
Product | hypothetical protein |
Protein accession | YP_082521 |
Protein GI | 52144307 |
COG category | [S] Function unknown |
COG ID | [COG4717] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGG AAAAACTCCA TATTTATGGG TACGGAAAAT TAGAAAATGT GGAAATGGAT CTTTCAATGC TGACGGTGTT ATACGGTGAA AATGAAGCGG GAAAATCGAC AATTCGCTCG TTTATGAAAA GTATTTTGTT CGGCTTTCCG ACGAGAGGAC AGCGCCGTTA TGAACCGAAA GAAGGCGGCA AGTATGGCGG GGCGATGACT GTACAAACAG AGAAGTACGG CCGTTTGAAA ATTGAACGAT TGCCAAAGAC GGCCGCTGGG GAGGTAACTG TTTATTTTGA AGACGGGAAA ACGGGTGGCG AAGAAATTTT ACACGATATA TTAACCGGGA TGAATGAAAG TTTATTTGAA TCGGTCTTTT CTTTTGATAT GCATGGCCTT CAAAATATTC ATCAGCTTGG CGAAGCGGAT ATCGGCAATT ATTTATTTTC GGCAAGTGCA GTCGGAAGTG ATGCATTATT GCAGCTAGAT AAAAAGCTAG AAAAAGAAAT GGATCAGCGC TTTAAGCCGA GTGGTCGTAA GCCAGAAATT AATGTGTCAC TGCAAGAGAT GAAAAAGCTT GAAGAGAAGA TGAAAGAGTG GCAAGGGAAA ATTGGCACGT ATGAAAAGCA AGTCGAGCAG TTAAAAGAAA GTGAAGAGAA GCTTGTTTCT GTTCGCGCGG AAAAAGAGAA TGCAGAAAAA CGAAAGCAGG ATTATGAAAT ATTAGCAGCG CTTGAACCTC TCGTTATTGA AAAACGTACG TATGAGAAAG TGTTAGAAAA TGAGAATGTG CAATTTCCTG TAAATGGAAT GGCGCGTTAT GAAGCGATTA AGGCGAAGAT GGAGCCGCTT CAGTTGCAAG TTGATTCACT TCATAAAAAA ATTGAGAATG TGCAATCAGA AATAGAATCG ATTCAAATAG ATGAAGAATT TTTACAAAAA GAAAGTTATG TAGAAGAACT TCGTATGCAG CATATGTCTT ACGAAAATGC ACGCCAAGAA ATGCGTGATA TAACAGGGAC GATTACGAAT ATAAAAGAAG AGATTGCAGA ACTAGAGCAA CAAATCGGTG CTACTTTTGA AAAAGAAACA GTCCTTTCGT TTGATATGAG TTTGGCAACG AAAGAGTTAA TTACGCAAGC AGTGCAAAAG GCGCGCGAAT TAGAAACGCA AAAAGCACAG CTTGATGATC GTTTTAAAGT AGCGCAAGAG CAATTAGAAG AACAAGAAGA AAATATAAGA CAGATTCAGA AGCAAATGTT AGCGGATGAA GAGCGAAATA CGTTAGTTGA GAAAGAAAAA TCGTTCCAAG ATGCGGCGTT TATCGGTATG GGCGCTGAGA GAATGAAGCG CAAGTATGAG GAAAAAGCAG GAGCGGCGAT GCAAAAGAAA AAACAGTGGC AAAGAGTTTG TCTTCTGTTA CTTCTTATTA ACACCGGCGT TTTATTCACA AGCTTATTTA TAGACAATCG CCTACTCTTA TTTATTAGTG TCATTGTGTT TGTAGCGATT GTTCTTGCCC TCGTTTTATA TAAAGATCCG TCAAGTGGAT TACAAGAAGA ACTTCTTACT CTTCAGCAAA GTGCTGGCGG GAGACAAAGT GAAGAAGCGA TGACTGTACG CTACCAGTTA GAAAAAGACG AAGAGATTCG TAAGTTATTT GAGCGTGAGT CTTATAAATT GCAGCAAATG GAGCGAGCGT ATGATAAAGT CGTTTCATCG TATGAGGAAT GGGAGAGAGA AACGTTCCGC ACGAGCGAAC AAGTAAGTGT GTATAAAAAG CGCTATACGT TCCCTGAATT TTATACGTAT GCGCACATAT TGCCGGCGTT TGAGCGTATG GAAAAAATGC AGCAATTATA TCGTGAATTA GAGAAACAAG GCAAGCGAAA ATCTTCATTA TATGAAATGA TTTCGCAATT TGAACATAAA CTAGAAACTG TTATCGGTAG CGCGGAGTAT AGTAAGCTGC ACGAGGCGCA AAGTCGTATG CAAAATGAGA AAGAGAAGCG CCAAACTTGT AAGCAGTTAA AAGAAAAACT GGCGGAATGG CAAGAAGAAT ATGAGTTTAT GCAAGAGCAA TTAAAGCAAT TACTAGTAGA ACGAGACAGT TTATGGCATA TCGCAGAGTC TACAAATGAA GAGATGTTTT TAGAGGCAGG TAAACTAGCG GAAAAACGTG AAGATGCAGA GAAACAAGTG GGGCGTTTAT TACCGCAAAT TGATCTGTTA GAACAGCGTT TAACGAGTTT ATCATTAGCT GAACATTATG AAGCTGACGG TTATGATGAA AAATTAAAGC AAGAACTGAC AACCGCGCAC AACTGTCTGG CACAAGAAAA AGAACTGACA GAGCGTATTG CGAAACATCG TATGGAAATT GCGAATTTAG AAGAAGGTAG TACGTACGGT GATTTAATGC ATGAATGGGA AATGAAAAAA GCGCAAGTGC GTGAACAAGT AAAGAAGTGG GCTGCGTATG CGGCTGCAAA GACAGTGTTA ACGAAAACGA AGCAATATTA TCATGAAGTA CATCTTCCTC GTATTTTACA AAAATCAGAA GAGTATTTCG TCTACTTAAC AGGCGGACGA TATAGTAAAA TCTTTTCACC GTCAGAGGCG GAGCCGTTTA TTGTAGAGCG TAATGATGGT ATGCGTTTTT ATAGTCATGA ACTAAGCCAA GCGACAGCTG AGCAGTTGTA TTTATCGCTG AGATTTGCGT TAGCAAAAAC ATTTGAGCAT GATTATCCAT TTATTATTGA TGATAGTTTC GTGCATTTTG ACGCGGTAAG GACAAATCGT ACAATTGAAC TAATAAAGGA AATAGCGCAA GATAGACAAG TCATATTCTT TACATGTCAT GCGCATTTAC TCGCGTATTT TACAGAAAAA CAGATTATAA AATTAACACA TATGCGTAAA GAAAATGAGT TGTAG
|
Protein sequence | MRMEKLHIYG YGKLENVEMD LSMLTVLYGE NEAGKSTIRS FMKSILFGFP TRGQRRYEPK EGGKYGGAMT VQTEKYGRLK IERLPKTAAG EVTVYFEDGK TGGEEILHDI LTGMNESLFE SVFSFDMHGL QNIHQLGEAD IGNYLFSASA VGSDALLQLD KKLEKEMDQR FKPSGRKPEI NVSLQEMKKL EEKMKEWQGK IGTYEKQVEQ LKESEEKLVS VRAEKENAEK RKQDYEILAA LEPLVIEKRT YEKVLENENV QFPVNGMARY EAIKAKMEPL QLQVDSLHKK IENVQSEIES IQIDEEFLQK ESYVEELRMQ HMSYENARQE MRDITGTITN IKEEIAELEQ QIGATFEKET VLSFDMSLAT KELITQAVQK ARELETQKAQ LDDRFKVAQE QLEEQEENIR QIQKQMLADE ERNTLVEKEK SFQDAAFIGM GAERMKRKYE EKAGAAMQKK KQWQRVCLLL LLINTGVLFT SLFIDNRLLL FISVIVFVAI VLALVLYKDP SSGLQEELLT LQQSAGGRQS EEAMTVRYQL EKDEEIRKLF ERESYKLQQM ERAYDKVVSS YEEWERETFR TSEQVSVYKK RYTFPEFYTY AHILPAFERM EKMQQLYREL EKQGKRKSSL YEMISQFEHK LETVIGSAEY SKLHEAQSRM QNEKEKRQTC KQLKEKLAEW QEEYEFMQEQ LKQLLVERDS LWHIAESTNE EMFLEAGKLA EKREDAEKQV GRLLPQIDLL EQRLTSLSLA EHYEADGYDE KLKQELTTAH NCLAQEKELT ERIAKHRMEI ANLEEGSTYG DLMHEWEMKK AQVREQVKKW AAYAAAKTVL TKTKQYYHEV HLPRILQKSE EYFVYLTGGR YSKIFSPSEA EPFIVERNDG MRFYSHELSQ ATAEQLYLSL RFALAKTFEH DYPFIIDDSF VHFDAVRTNR TIELIKEIAQ DRQVIFFTCH AHLLAYFTEK QIIKLTHMRK ENEL
|
| |