Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_A0020 |
Symbol | |
ID | 7169550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011774 |
Strand | + |
Start bp | 84308 |
End bp | 89284 |
Gene Length | 4977 bp |
Protein Length | 1658 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643559078 |
Product | virion structural protein, putative |
Protein accession | YP_002454582 |
Protein GI | 218847851 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.987095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00780122 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTCTTG AAAGAGAAAA ACAGGGGTTA GATTTGCATG ATTATAAAAG CAAACTCTCT AATAATACCT ATTCTATGCG TTTAGGAAAA GAAGTGCCAG AAGGTGATGT TAATCTAGCT TATATTCATG TGCCAAAGGT ACAGATTGAA GAAAATTTAT CATTAATTGA TACTTCATAT ACAGCCGATA ACGTAATCAC AAAAGATCAA CTGGAATCAA TTGTGGTTGC GAATGAAGCA GGGGAATTGG AATACGTTGA TATTGTTAAT GAGGATGAAA TAAAACCTCG GCCACCTAAA GGAGTATTTC CAAGTGATAA AATTAATGTT ACTCGGAAAT TCAAAAAGAA CGAGTATAAG ACTGAAAGTG CTCTTTATTA TAAATTCGAA ATAGATTTTC ATTATGACAG TAAAACCGCT ATCGCAAATG AAAAAGGTGT ATTTAAAAAA GAAAAATATA CAGGGCAGCA AATAGAATTA ACGGATGAGA ATGGAAACCT GTTAGATGAT TCGTATAAGT ATGATATTTA TGTGATTCCT TATAAGGAAA ATCCAAGAAT TTATAGTATA CAAGTCTATC TTCATAAGAA CACAGACAAA AATAATACCA TTAAAATTCG CTATAATCAC ATAGATAAGA TCGTAAAAGA TTCCAGCATA CAAGCTGTAG AAAAAAGCAT AGAGTTTTAT ACAGATAAAA ATAATGAAAT TCAAGTAGAT AAATTATCTA GACAAATGCT TGAAGGTGGC AAGTTGCGAA TTATCAATGG TATAAGTGCA TTTGATCAAA AAACTGAAGA AGAAGTTCGC CAAGCGTCCA TAGATAACCC AGATAAAGAA ATTTTTGCGG TAGTCTCTCA TAGTAATGGG GAAGGTCATA AAGTGATTGT CGCTCAAAGA TCTGAATCAG ACCCTCGAAC ACCTAAAATA TTTTCACATC GAATTGTTGC GAAATATAAA AATGACGATG GAAAAGAAAT GCAAGTCAGT GTCGGACACA TCACAGATTG GGTAATGAAT TACGATGCAT TGCTTACAAA TGAAAAAGAA GAATACACTG GTGAGTGGAA GAACATCGGC TTGGCTCTAG ATGGTGGGAA AATAAATGCA AAAGATATGA TTGAACTATC TTTACCAATG GGTACACCAA GTGTTCCAAT GGATGCCAAA TATTTCATCG AAGACGGAAA AGGGAATCTT CTATATAACG TAACAACCAT TGTAGATAAT AATGAAATTG AAAGCCAAAT CAACGAGGTT ATGTCGCATG CTGCAGAAGC AAAAGTAAAG CAATTGAATG TGAAACCCTG GAAAAATGCA CTTCAAGATA ATGTGAAGAT TAAAGATGAA CCAATCCTCC ATCGCTGTAC TATTATCCCA GAGCGTCAAA AGACAAAATG GGGATTTACA TGGGAAGCAA ACGGACAAGG CTTTACGGAA AAGAAGATGG ATTATAAAAC AAACTGGCAA GTATGTGCAG ATGTTGCGTT TAAAAAAGAA AGAGAAACTA AAGTATTAGA TGTACTAGAT AAAAACAAAT GGTCTACGAT AGGAATTTCT AATGATTTAA GTAAATGGCA GTACTCCTAT ATTGCATCTA TGAAAAAGAA TGTAATTCAG TATTTAGAAA ACCAAAATGA TGTTTGTGGA TTCTACCAAA GAAATGAAAT GATTAACGGT AAAACCATTA ATCTGATGGA GAAAACAGAT TATCAATTTT CTGTAAAAGT AAAAATGGAT GATTCCATTG ATGATGATTG TATTGGACTT ATGTTTCGTG TACAAGATGC ACAAAACTAT TACATGTTTG TTTGGGAAAA AGATGAAAAA TCAACAGCAG AGAAAACCTA TACGAAAGAA CATGGGAAAG ATGTTGCTGG TAAAATACAA CCATGTGATC GTGCTATTCT TGACGAATAC GGCTTTACTA TGAAAACTTT TAGTCCAGTT AATAATAATA TGTGGAACAC CACTAACAGT AGGGAAGCTT ATTTAAGTAG TGGATTTGGA AGAAGTCATA AACGAATTCT AAAAGCTTTA CCAAGTGTTT TGCCACCATC TCCAAATCAA GATTCATGGA ATAGTGGAAG CAGGTATCCT ACAGATAAAA CGAACTGTTC ATTTAAGGAT ATAACAAATA TGACTGAATC GTATGCCAGT AAGGGCTGGG AGCATGGAAA AGATTATAAG TTAACCGTAG TTGTAACTGG TGATTGGTTC CGTATTTTCA TTTCTGATAA CCCCGAATCA GACGAGCTAG GACAATTAGT TTGCCAAGCG AAAGATAATA CGCATCAAAA AGGCTCATAT GGTATTTTTT CTGCTTCTCA GAGAAATACA CTATGGTATG ATTTTAAAAT GGCTGACGTG ATTGTAGATA CAGTTTGCAC AGAGAAAAAA GATATCTTAT TAACAGATAC GAAGGATAAA AAGCTTTCTA ATTATGCAGT AGAGGACTTG ATGGCTGCAT CCATTAAAAG TAAAGCTAAA CAACTAGGCG ATGCAACTTA TGAGGTCTTC ACATATTACG GGAAATCCGA TGGTGAATTT ACTATTAGCA TAGACCCTAG GACACACTTC GTTTATGGGC GTTCCAATGA TCCTATTACA GGTAATGTGG TCAGATCAAA GTGGACAACT AAACAAAATG GTCTAACCAT AAAAGGAACA GGCTTTATAG AATATCATGC AGATGGACAT TATACAATTT CCACTGTTCC AAGTGTGCTC CCTACTGAGC AGATACCATC AACTGTAAAA GACTTCACAT GGAATGAACC AGTTCTATTA ACTGGAGAGA ATGTTTCTAT CCAATTAGAG CATCCAAACA AGTTAAAGGT TATACCAAAA GTTCCTCCAA TACGAGTAAT AGGAAAACCC TATACATTAG AGGATAATGA GATATTAAAA GCAGATGGTC TGAAAAGCAT TATAGAGATA TTTGGTGACA TAGGAGTTTA TCAGTTTCTT GAAATACCTA AAGATATTCC ATTAGATGAA ATCTGTTTGC GAATTGAACG TGGAAAAGTG AACGGAATAT CAGCTGATGG GACTGCTTCC GTAGAGAATG CAGAGTATCG AGTTAATTAC AGGTTTAGAT GTACTAAAAA TGGATTTATT CGATTACCGG TGGATCAGTT TCAAGATCAA ATAGGTGTCA ATCGCCTTCG ATTAAAAAGC ATATTAACTG ATAAAGGTGA GCTAAATTCT TCTATTCATG TGGATGTAGT AGCTTGGACT ACTTTTCAAC AATTATCAGC AGTCCCTTTA TTCGCTATTA AAGTAGAAGA GCAAAGAAAA ATTGAAATTG AAAAGCCTAG GATTGAGCAA CAAAATATAG AACAAGAAAA TTGGTATATT CGAGTTAAAA GTGGAAGGTT TTTAAAGAGA TTGCGATTGC CTTATCATGA GTTAAATTCA ATAGAAAGAC CACCTGAGAT ATACATTTCG TATCCGCAGC TATTGGGAAT GGTTAAATCG CCAAATGATA TCGTAGAGGT TGATTTGGAA TACAGCCTTC CAGAATATAC GAATCAAGAA TTCTATAATA ATGCTACTAT ACTGGTAGAT AAAGAACGGC CTATGATATT AAATGAGTAT TCCATACAAA CAAGGTATAC TCCTATTTTA TTAAGCTCAC CAAAAAATAC TAGTTATCTA GAAGTATATT CAATTAGAGG AAATAATAAA CGAAATCTCC GAGTAGCTGA TGTGGATGCA ATGAAGGGAA TTGTTTATTT ATTAGATCGT ATAGGTGAGC AAGATGAAGT ATATATAAAA TACGCATACA AAGAAGAATG GTATACATAT CGAGGGTTCG AAAAGAATGA ATCGTTTTTC CATTTAGATT TAAATCCAAC ACCAGGTCAT CGTCACACAG TAGCTGAAAA TGGTTTCCAT AGATGGATTC CTATTGAAAG TGATACAGAG CCATTTAAAG TAAAGAAAGA AGGAACTAGC AATCTAGAAT TACTAGTAAA ACAAATTCAT GTGTACATAT GCCCTTCTGC AGTTAGATTA GTTGATTTAG CTAACCCTAG TTTGCATGGC AAATTTGTAA CAGGGTCAGT TCGTAAGAAG GTACTTTTCC ATACAGATGA AGAGTTTTGG TTTAATCAAA AAGACCAGAA ATATAATCCA ACCTTGCTTC GATTAGGAAA AGCAATGCTA CAGTCCAATA GCACATCAAA AGACAACATA ACCATTCTTG ATACAAGAAC AAAGGGTGGA GGACTGGATG AATCTTTATC AAGGGAAATT ATTAAACAAG TTAATAAAGA GTCACTATAC CATTGGGACA TTGGCTACTT TGATGGAGAA GCTTACCAAG AGAATGGGGT TATAATTATT AGGCTTCCAA AAAGTATATT GAAATCAGAG GATAATCCTA ATGGATTTAT TGAATCCGAA ATACAAGAAG CCGTAGCCAA ACATAAAGCT TATGGTACAT TGCCTGTTAT TGAATTTTAT AATGAATTCA AGCAGGAAGA TGATTATAAT ATTCTTCCAA ATCATGAGTT CTTATATAAT CAGCATATAG GATATTATAA TAGCTCTATA AGTAAAGGTT CGTTTGAAGT AATTAATGCA TTCTTAGGAA CAGGGGATAA TTATGCTTTA CGTATCAATA ATGATGCTGA GTATGGCATT ACTTTACCTG GTCACTTAAT CAATCATAGT ACATATGGAA TTGAGATTAA AGCTATAAAA GAGCAAATGG CAACAAAGCG TAGTTTAGGT CAGTTAGAGA TATCCTATGA TGATGGGACG AAAGAAACAA TTGAGTTAGC GCAAATCAAT CAGCCTCAAT GGATGGTTTA CAAACAATCA ATAAAAGTAA AGTCTAGCGT ACATAAGATT GATATCATCT TAAATAAATC TAAAGACATC CGAAAAGGAT CGCTTCTTAT TGATTACGTT AAACTAAATG CAATGCCAGT GATATCAGAA ATATCTACAG AAATATATGA AATCTAG
|
Protein sequence | MSLEREKQGL DLHDYKSKLS NNTYSMRLGK EVPEGDVNLA YIHVPKVQIE ENLSLIDTSY TADNVITKDQ LESIVVANEA GELEYVDIVN EDEIKPRPPK GVFPSDKINV TRKFKKNEYK TESALYYKFE IDFHYDSKTA IANEKGVFKK EKYTGQQIEL TDENGNLLDD SYKYDIYVIP YKENPRIYSI QVYLHKNTDK NNTIKIRYNH IDKIVKDSSI QAVEKSIEFY TDKNNEIQVD KLSRQMLEGG KLRIINGISA FDQKTEEEVR QASIDNPDKE IFAVVSHSNG EGHKVIVAQR SESDPRTPKI FSHRIVAKYK NDDGKEMQVS VGHITDWVMN YDALLTNEKE EYTGEWKNIG LALDGGKINA KDMIELSLPM GTPSVPMDAK YFIEDGKGNL LYNVTTIVDN NEIESQINEV MSHAAEAKVK QLNVKPWKNA LQDNVKIKDE PILHRCTIIP ERQKTKWGFT WEANGQGFTE KKMDYKTNWQ VCADVAFKKE RETKVLDVLD KNKWSTIGIS NDLSKWQYSY IASMKKNVIQ YLENQNDVCG FYQRNEMING KTINLMEKTD YQFSVKVKMD DSIDDDCIGL MFRVQDAQNY YMFVWEKDEK STAEKTYTKE HGKDVAGKIQ PCDRAILDEY GFTMKTFSPV NNNMWNTTNS REAYLSSGFG RSHKRILKAL PSVLPPSPNQ DSWNSGSRYP TDKTNCSFKD ITNMTESYAS KGWEHGKDYK LTVVVTGDWF RIFISDNPES DELGQLVCQA KDNTHQKGSY GIFSASQRNT LWYDFKMADV IVDTVCTEKK DILLTDTKDK KLSNYAVEDL MAASIKSKAK QLGDATYEVF TYYGKSDGEF TISIDPRTHF VYGRSNDPIT GNVVRSKWTT KQNGLTIKGT GFIEYHADGH YTISTVPSVL PTEQIPSTVK DFTWNEPVLL TGENVSIQLE HPNKLKVIPK VPPIRVIGKP YTLEDNEILK ADGLKSIIEI FGDIGVYQFL EIPKDIPLDE ICLRIERGKV NGISADGTAS VENAEYRVNY RFRCTKNGFI RLPVDQFQDQ IGVNRLRLKS ILTDKGELNS SIHVDVVAWT TFQQLSAVPL FAIKVEEQRK IEIEKPRIEQ QNIEQENWYI RVKSGRFLKR LRLPYHELNS IERPPEIYIS YPQLLGMVKS PNDIVEVDLE YSLPEYTNQE FYNNATILVD KERPMILNEY SIQTRYTPIL LSSPKNTSYL EVYSIRGNNK RNLRVADVDA MKGIVYLLDR IGEQDEVYIK YAYKEEWYTY RGFEKNESFF HLDLNPTPGH RHTVAENGFH RWIPIESDTE PFKVKKEGTS NLELLVKQIH VYICPSAVRL VDLANPSLHG KFVTGSVRKK VLFHTDEEFW FNQKDQKYNP TLLRLGKAML QSNSTSKDNI TILDTRTKGG GLDESLSREI IKQVNKESLY HWDIGYFDGE AYQENGVIII RLPKSILKSE DNPNGFIESE IQEAVAKHKA YGTLPVIEFY NEFKQEDDYN ILPNHEFLYN QHIGYYNSSI SKGSFEVINA FLGTGDNYAL RINNDAEYGI TLPGHLINHS TYGIEIKAIK EQMATKRSLG QLEISYDDGT KETIELAQIN QPQWMVYKQS IKVKSSVHKI DIILNKSKDI RKGSLLIDYV KLNAMPVISE ISTEIYEI
|
| |