Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B4250 |
Symbol | |
ID | 7182703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | + |
Start bp | 1001348 |
End bp | 1004272 |
Gene Length | 2925 bp |
Protein Length | 974 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643548815 |
Product | hypothetical protein |
Protein accession | YP_002444486 |
Protein GI | 218896075 |
COG category | [S] Function unknown |
COG ID | [COG4717] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.00000724853 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAATTG AAAAACTCCA TATTTATGGG TACGGAAAAT TAGAAAATGT GGAAATGGAT CTTTCCTTAC TGACGGTGTT ATACGGTGAA AATGAAGCGG GGAAATCGAC AATTCGCTCG TTTATGAAAA GTATTTTGTT CGGTTTTCCG ACGAGAGGAC AGCGCCGTTA TGAGCCAAAA GAAGGTGGGA AGTATGGCGG AGCGATGACT GTACAAACAG AGAAGTACGG TCGTTTGAAA ATTGAACGAT TGCCAAAGAC GGCATCTGGC GAGGTGACCG TTTATTTTGA AGACGGGAAA ACGGGCGGCG AGGAAATTTT AAACGATATA TTAAGCGGGA TGAATGAAAG TTTATTTGAA TCGATCTTTT CATTTGATAT GCATGGTCTT CAAAATATTC ATCAGCTCGG CGAAGCGGAT ATTGGCAATT ATTTATTTTC GGCAAGTGCA GTCGGAAGCG ATGCGCTATT ACAGTTAGAT AAAAAGCTAG AAAAAGAAAT GGATCAACGC TTTAAACCGA GTGGGCGTAA ACCGGAAATT AACGTGTCAC TGCAAGAGAT GAAGAAACTT GAAGAGAAGA TGAAAGAGTG GCAAGGGAAA ATTGGCACGT ATGAAAAACA AGTCGAGCAA TTAAAAGAAA GTGAAGAGAA ACTTGCTTCT GTTCGTATGG AGAAAGAGAG TGCAGAAAAA CGAAAGCAAG ATTATGAAAT ATTAGCAGCG CTTGAGCCAC TCGTTATTGA AAAGCGTGCG CATGAGAAAG TGTTAGAAAA TGAGAGCGGG CAGTTTCCTG TAAACGGAAT GGCGCGCTAT GAAGCGATTA AGGCGAAGAT GGAGCCGCTC CAATTACAAG TTGACTCGCT CCATAAAAAA ATAGAGACGG TGCAATCGGA AATTGCCTCG ATTCAAATAG ATGAAGAGTT TTTACAAAAA GAAAGTTATG TAGAAGAACT TCGTATGCAA CATATGTCTT ACGAAAATGC ACGCCAAGAA ATGCGTGATA TGACAGGAAG TATTGCGAAT ATAAAAGAAG AAATTGCAGA ACTAGAACAA CAAATTGGTG CTACTTTTGA AAAGGAAGCG GTTCTTGCGT TTGACATGAG TTTGGCAACG AAAGAATTAA TTACGCAAAC CGTACAAAAA GCGCGTGAAT TAGAAACGCA AAAAGCACAG CTTGATGATC GTTTTAAAGT AGCGCAAGAG CAATTAGAAG AACAAGAAGA AAATATAAGA CAGATTCAGA AACAAATGTT AGTTGATGAA GAGCGAGATA CGTTAGTTGA GAAAGAGAAA TCGTTCCAAG ATGCGGCGTT TATCGGTATG GGCGCAGAGA GAATGAAGCG CAAGTATGAG GAAAAAGCAG GAGCAGCTAT GCAAAAGAAA AAGCAGTGGC AAAGAGTTTG TCTTCTGTTA CTTCTTATTA ACACGCTTGT TTTATTCACG AGTTTATTTA TTGATAACCG CCCGCTCTTA TTTATTAGTG TTATTGTTTT TGTAGGGATT GTTCTTGCCC TTGTTTTATA TAAAGATCCG TCAAGTGGAT TACAAGAAGA GCTTCTTACT CTTCAGCAAA GTGCTGGCGG GAGACAAAGT GAAGAAGCGC TGTCTGTGCG CTATCAGTTA GAAAAAGATG AAGAGATTCG GAAGCTATTT GAGCGTGAGT CTTATAAATT GCAGCAAATG GAGCGAGCTT ATGATAAAGT CGTTTCATCG TATGAGGAAT GGGAGAGAGA AACGTTCCAG GCGGGAGAAC AAGTAGATGC ATATAAAACG CGCTATATGT TCCCAGAATT TTATACGTAT GCACACATAT TGCCAGCGTT TGAGCGTATT GAAAAAATGC AGCAATTATA TCGGGAGTTA GAAAAAGAAG GCACGCGAAA ATCTTCATTA TATGAAATGA TTTCGCATTT TGAACAGAAA CTAGAAGCTG TTATCGGTAG TGCGGAGTAT AGTAAGCTAC ACGGGGCACA AAGTCGTATG CAAAATGAGA AAGAGAAGCG CCAAACTTGT AAGCAGTTAA AAGAAAAACT GGCGGAATGG CAAGAAGAAT ATGAGTTTAT GCAAGAACAA TTGAAACAAC TGTTAGTAGA ACGAAATGCA TTATGGAATA TTGCAGAGAC TACAGATGAA GAGATGTTTT TAGAAGCGGG TAAACTGGCG GAAAAACGTG AAGATGCTGA GAAACAAGTA GTGCGTTTAT TACCGCAAAT TGATCTGTTA GAACAGCGTT TAAAGAGTTT ATCATTAGCT GAACATTACG AAGCTGACGG TTATGAAGAG AAATTAAAGA GAGAAATTAC AGTCGTACAA AACTGTCTGG CGCAAGAGAA AGAATTGACA GAGCGCATCG CGAAACATCG TATGGAAATT GCGAATTTAG AAGAAGGTAG TACGTACGGC GATTTAATGC ATGAGTGGGA AATGAAAAAA GCGCAAGTAC GTGAACAAGT GAAGAAGTGG GCTGCTTATG CGGCCGCAAA GACAGTATTA ACGAAAACGA AGCAATATTA TCATGAAGTG CATCTGCCTC GTATTTTACA AAAATCAGAA GAGTATTTCG TCTATTTAAC GGGCGGACGC TATAGTAAAA TCTTTTCACC GTCAGAGGCG GAGCCGTTCG TTGTTGAACG TAACGACGGT ATACGTTTTT ACAGTCATGA ACTAAGCCAA GCGACAGCGG AGCAGTTATA TTTATCGCTG AGATTTGCAC TTGCGAAAAC ATTTGAACAT GATTACCCAT TTATTATTGA TGATAGTTTC GTGCACTTTG ATGCGGTAAG GACGGATCGA ACGATTGAAC TAATAAAGGA AATCGCAAAA GATAGACAAG TCATCTTCTT TACATGTCAT GCGCATTTAC TCGCGTATTT TACAGAAAAA CAGATTATAA AATTAACACA TAAGCGTAAA GAAAATGAGT TGTAA
|
Protein sequence | MRIEKLHIYG YGKLENVEMD LSLLTVLYGE NEAGKSTIRS FMKSILFGFP TRGQRRYEPK EGGKYGGAMT VQTEKYGRLK IERLPKTASG EVTVYFEDGK TGGEEILNDI LSGMNESLFE SIFSFDMHGL QNIHQLGEAD IGNYLFSASA VGSDALLQLD KKLEKEMDQR FKPSGRKPEI NVSLQEMKKL EEKMKEWQGK IGTYEKQVEQ LKESEEKLAS VRMEKESAEK RKQDYEILAA LEPLVIEKRA HEKVLENESG QFPVNGMARY EAIKAKMEPL QLQVDSLHKK IETVQSEIAS IQIDEEFLQK ESYVEELRMQ HMSYENARQE MRDMTGSIAN IKEEIAELEQ QIGATFEKEA VLAFDMSLAT KELITQTVQK ARELETQKAQ LDDRFKVAQE QLEEQEENIR QIQKQMLVDE ERDTLVEKEK SFQDAAFIGM GAERMKRKYE EKAGAAMQKK KQWQRVCLLL LLINTLVLFT SLFIDNRPLL FISVIVFVGI VLALVLYKDP SSGLQEELLT LQQSAGGRQS EEALSVRYQL EKDEEIRKLF ERESYKLQQM ERAYDKVVSS YEEWERETFQ AGEQVDAYKT RYMFPEFYTY AHILPAFERI EKMQQLYREL EKEGTRKSSL YEMISHFEQK LEAVIGSAEY SKLHGAQSRM QNEKEKRQTC KQLKEKLAEW QEEYEFMQEQ LKQLLVERNA LWNIAETTDE EMFLEAGKLA EKREDAEKQV VRLLPQIDLL EQRLKSLSLA EHYEADGYEE KLKREITVVQ NCLAQEKELT ERIAKHRMEI ANLEEGSTYG DLMHEWEMKK AQVREQVKKW AAYAAAKTVL TKTKQYYHEV HLPRILQKSE EYFVYLTGGR YSKIFSPSEA EPFVVERNDG IRFYSHELSQ ATAEQLYLSL RFALAKTFEH DYPFIIDDSF VHFDAVRTDR TIELIKEIAK DRQVIFFTCH AHLLAYFTEK QIIKLTHKRK ENEL
|
| |