Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_A0099 |
Symbol | |
ID | 7169583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011774 |
Strand | + |
Start bp | 26841 |
End bp | 29762 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643559037 |
Product | tail sheath subunit |
Protein accession | YP_002454541 |
Protein GI | 218847878 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAACT TTGAACAGTA TGAAAACTTA CCGGGTGTAA AAGTATCTTA TGAAGATGGC AATCTTTATA CTGGTAAACA GGTAGAAGAC TCAAGAACAC AGGCTATTTT AATTATGGGT ACTGCTATTG ACGGTCCGGT TGGAGAGCCA GTATCAGTTA ATCAAATTGG TGGCCCTAAA GCTGCTGAAA AAATGTTTGG TGGTTTACTA GAGAAAAAGA CTATAGTTGA AAATGGTGTA GAAAAAACAG TTCGTGTTCC TCATCAGGGC ACACTAATAC GTGCAATGTG GGAAGCAATA CGTGCTGGTA ACGAAGATAT CCGTTTACTT CGTATCTCTG GTAAAACAGC AATGTCGGAG ATCCTTGCTA AGGATGAAAA TAGTGAAGTT ACAGAGCCGT TAGCTGATAT GTTGGGTAAT AATTTAATTC CAGGTAATGT TGCTTTTACG AAACGATTAA ACATTGAAAA AGATCAACGA TTAGTAAAAA TTGAAAAAAT TGAAGAGTTT GAAGGAACTG ATACAGCAGT GGATCCAGTT AAAACATTCC CAGATTCCAC TGGCTATCAA AATGTAGATA TTACACCTGG ATCCGAAACA ATTTATTTTA CAAAAGATAA GTTTCGTCCT AAAAATACTA TTAAGGTTAT TTACAAAGCT AAAAAACGTA ATTATTCAGA AGTAACACGA AACATGGATG GTAAAACTAA TTCATCAACT TTAGGATTAT TAACTCAAAA TCCTACAATG ACAAATTACT TTTCTTCAGA AGTAGGAAAT TGGTCAGATG AGCCTATTCA TCAAGTAAAT GTTTATACAA AAGATGCTGA AGGTAGAGTC AATGCTATTC CAATGGTTAA TACATCTGGA GAACGCTTAT GGCGTATTGG TAAAGGCGAT TCAGCGGTGA AAAACGAGCT TACTGATATC ATTACAGCAG AAGAATATAA ACAAGGCGGT ATTCGTTTTA CTGAAGCATA TCAACAAGAA GTAGCTAAGG GTATATACCC AGCACTAACG GCGGCTTTAT TAGTAGTGGC AGATTACTTC TATTACAACG ACTTAGAGAT ACAAAAATCT GATGAGTATG TAGTACCAGG CGCTGAAAAA GAGTCAGTAT TAAAACATAC ACCAGTAAGT GATTCTCTTG AAGTATATTA CGAGTTAAAC GGAAAACGTA TCACATTAAA GCCAAATGAT CATTTTACTG TTATTTATCC AGATGGAAAA GAAAGAATAA AAGTAATGAT TAAAGCTGGT GTTGCTCCTG TGGGTGCTAA ATTGTTTGCA CATTATAAAA CAGGTGAAGG TGCTACCCAA GGTGCAAAAA TTAGTATTTT AGGTAAACAC GGTGGTAAAG TGTACGGTGG TATTCAGGAT ATTCGTGATT TTGAAAGCCT TTATGGTGTT AAATACACTG TTGAATATGA AGTAAATGAA AATGGTACAC TTGATATGGA CAATCGTGTA GTGCGCTTTA TCAAACCTTC AGATAAAAAA TTAACATCAA ATGATACTGA AATCCGCTTT AGAACAAAAG AAATGCGTGG TATTCGCACA ATTCGTGAGT TTGCGAATTA TATTAATAGC TTGCCACAAA ATAACATTGT TCGATTGGAA GTTCCTGTGA ACACTGGTGA TGTAGCGATA ACGGGTTTAA TGGTTACGGA TTACAATGTA AGTCCAATTG ATGGACGTTA TGATTATCGT CCAATTAACT TAGGTGAAAA ATATAACGAA GATGAAGGGA AATATTCGTT ATATGTAGAT GATAATAAAG CCGAAAATGA TGCAGGACGT TTCCAGTGGT TAGGTGGAGA TGGTTTTTAT GATACTACTA ATTTAATGGC TATGAAAGAA TTATATGATT CTCTAGGCGG AAAATATGAG TTGGTTCCAG CTACTTTAGA TGAATATGAT CTAGTAGAAC AAGGAATATA TAGTAAATTA GAAAACTATT CGGTTGATCT TATTTGGCTA TGTGATGCAT TTGCTAACAC TACAATTGGT GCTATTGCTG ATGATGGTTC AACTTATATA GATAATAATC GAAACTTTGC TACTCAATTA GCACAACATT GTGCAATGGT TAGTGCAAAA ACATACGAAA CAATTGGTGT AATTGGTGTA GCGCCGGCCC CTGAAATGGG ATTGCGTGAG GTCCAGCAAT ACATTGACTT ATTAACTAAG GGAATTGGGG TTGACGAAGA ATCAGCTCAA TACTGGTATT CTCGTGGTAT TAATCCTAAT TATTTAAATG CTCATTATAT GTATAATCTT GCAACAAATG AGCGTATTTT CAATGACGAA GGTGAACCAA TTGATATAGG ACGTTATGCT AACGTTGTAT TCGGTCCTGA AACAGGATTA GCGCATGAAA AGCTTGGCAC ATATATTGCG AGTGGCGCAG GTATTTACGC CGCTCTAATT TCTCAATTGC GTCCTGAGGT GTCTACAACA AATAAACCAA TTGCAGTTTC TGGTTTACGT TACAATTTAT CTGAAGCACA GCATAATCAA TTGGCTGGTG GACATTACGT TACATTTGAG AATAAAGTGA GCATAAATGG CAACCGTTCT GTCGTTGTAA AAGATGGTGT AACAGCAGCG GGACCAATGA GTGATTATCA ACGCTTATCT ACAGTACGTA TTACACACGC TACAGTGCAA GTAATTCGCA AATTAGCGGA TAAATTTATT GGTTTACCAA ATGGTGTAGC TCAGCGTAAT AGTTTGGCAA CAGAAATTCA GGCTGGTTTA GATAAATTAA AAGATTTTGG TGTTTTAACT AACTTTAAAT TTTCTATCTT TACTTCAGCT AAAGATCGTG TGTTAGGTAA CGCATATATT CAACTTGAAT TAGTACCGGC ATATGAAACT CGTAAAATCT ATACAAGTGT AGCACTACGT GCAAGCTTAT AA
|
Protein sequence | MSNFEQYENL PGVKVSYEDG NLYTGKQVED SRTQAILIMG TAIDGPVGEP VSVNQIGGPK AAEKMFGGLL EKKTIVENGV EKTVRVPHQG TLIRAMWEAI RAGNEDIRLL RISGKTAMSE ILAKDENSEV TEPLADMLGN NLIPGNVAFT KRLNIEKDQR LVKIEKIEEF EGTDTAVDPV KTFPDSTGYQ NVDITPGSET IYFTKDKFRP KNTIKVIYKA KKRNYSEVTR NMDGKTNSST LGLLTQNPTM TNYFSSEVGN WSDEPIHQVN VYTKDAEGRV NAIPMVNTSG ERLWRIGKGD SAVKNELTDI ITAEEYKQGG IRFTEAYQQE VAKGIYPALT AALLVVADYF YYNDLEIQKS DEYVVPGAEK ESVLKHTPVS DSLEVYYELN GKRITLKPND HFTVIYPDGK ERIKVMIKAG VAPVGAKLFA HYKTGEGATQ GAKISILGKH GGKVYGGIQD IRDFESLYGV KYTVEYEVNE NGTLDMDNRV VRFIKPSDKK LTSNDTEIRF RTKEMRGIRT IREFANYINS LPQNNIVRLE VPVNTGDVAI TGLMVTDYNV SPIDGRYDYR PINLGEKYNE DEGKYSLYVD DNKAENDAGR FQWLGGDGFY DTTNLMAMKE LYDSLGGKYE LVPATLDEYD LVEQGIYSKL ENYSVDLIWL CDAFANTTIG AIADDGSTYI DNNRNFATQL AQHCAMVSAK TYETIGVIGV APAPEMGLRE VQQYIDLLTK GIGVDEESAQ YWYSRGINPN YLNAHYMYNL ATNERIFNDE GEPIDIGRYA NVVFGPETGL AHEKLGTYIA SGAGIYAALI SQLRPEVSTT NKPIAVSGLR YNLSEAQHNQ LAGGHYVTFE NKVSINGNRS VVVKDGVTAA GPMSDYQRLS TVRITHATVQ VIRKLADKFI GLPNGVAQRN SLATEIQAGL DKLKDFGVLT NFKFSIFTSA KDRVLGNAYI QLELVPAYET RKIYTSVALR ASL
|
| |