Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH820_5172 |
Symbol | |
ID | 7191278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH820 |
Kingdom | Bacteria |
Replicon accession | NC_011773 |
Strand | - |
Start bp | 4872107 |
End bp | 4875352 |
Gene Length | 3246 bp |
Protein Length | 1081 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643558583 |
Product | wall-associated protein |
Protein accession | YP_002454094 |
Protein GI | 218906260 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 0.000000000000236947 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGAAGA AAAAGACGAG TAGGGTATTA GCAGCTGGTA TATGTATATC TACATTGTTA TCACCAGTAG CTTTTGAAGC ATCGAAAGGA TATGCCGCAC CTTTGGAAGA GAATAAGGGC GGAAATTTAG AAGAGGTTAA AGAGAATAAA TTTGAGCAGA GAGTATTTCA GTTGCCGGGG AAAGGTAGTG TAGAAGAAAA CAGAGATCGT TTAAAGATGC AATTTGCCTT TTCACCTAAT GAACCAACTG GGATTTATGC GAAACCAGAT GAAGAAATTG TAGTTGAAAT AAAGGGAAGC CAATCTATTA AAGCGTTTAT AGGGACGCGA GCTTATGACA AGGAAGGGCC AAAAGAGTTC GACTTGAATC CTGGTAAAAA TATTATTTCT TCGCCCAGCG GCGGTATTTT ATACTTTTAT AATATGAATA ATACCGGGGA AGTTACAGCG ACTGTTACAA GTGGAGGTAC TCATTTTCCT CTATTTATAC TAGGGAAACA TACGAAAAAA GATTGGGATG CAATGCTAGA AAAATATAAA AACCCTTACG CAATAGAATT AAAGGGTGAT AGAAGTTTAA TAACTACAAG TTATGAAAAA GTAGAAAAAA ATATGAAAAA AACGGATCCA ACAGAGTTAA TGAAAAAACA TGATGAAGCA ATTCGGATTG AAAATGCATT ATCTGGATTA TCGGAAGATG GCATTGGAGT TGCGCATTCA GGGAAACATT ATATTCAATT TATTGAAGCA AAATATCCAA CTAGCCCTTT TATGTATGCG AATAATTACC TTACAGGATA CGCAGAAGAT TCAATAGAAT TTGTGCTAGA TATTGAAAAA TTCACAAAAG ATGGTTGGGG ACCATGGCAT GAAGTAGGAC ACATACATCA GCAGGTACCT TGGTTATCCG AGGGAATGGG TGAGACGACT GTTAACATTT ATAGCTTAGC TGTTCAGTTG GCCTTTGGAA ATAAATCGCG AATGGAAGTA GATGGGCGTT ATGAAGAGGC ATTTGCTTAT TTGAACCAAC CAGATGACCA AAAGAATTTT GATAAAGCAG ATCCAATTAT TATGTTTTGG CAGTTGCATT TAATTTATGG AGATCAATTC TATCCGAAGC TACATCAAAT GTATCGAGTG CTATCTGATA CAGAATATTC TATGTTAGAT ACTGAAGAAG TTATTTCGAG TAGAGAGAAA AAGCAAATGT TCATATACAT GGCTTCAAAA GCATCGGGAC AAAATTTAAT TTCTTACTTT GCGAAATGGG GATTACATGC AGAACCGGAT ACGATAGAAA AAGTAAATAA ACTACAATTA CCTGAACCGA AAAATGAAAT ATGGTTATCG AGAGATAGTA ATCCGATTCG TGAAAAACAA GTGGAAGCAT ACAAAGTTCC TTATGGGGAA GCAGTTAATA CAGTACCAGA TATTTTAATA GGTACAGAAT TTGATGAGAA AAAGGCAAGT GAACTTGTAA AAAATCTAGG GGCAAATGTA AAAACGACAG GGAAAATAGT ATGGCCTAAA CAAGAAAATG GAAAACAAAC TGTGAATGTA GAAATTGTGG ATGCAGAAGG AAATGTAAAT GCAATTCCTG TACCAGTTAA CGTTGTGTAT GGGGATAGTA TGGCATTTAA AACATATTGG AATACTAACG GTGTGTTAAC GTTACGTCAT GAAGATAAGA AATTCAATAT GACATTAGTA AGAAATATAT TGAATCATAG TTATCGAGAT AAAAAATATG TAGGTGTTAC GATCTATGAT GCAAACGGAA ATGAAAAGAA GAATGTTTCA GCCGAAGGAC ATGAGGGACT AAAGAATTTC GTGAAAGAGT TAGATGGAAC GTCGTTCGAG TATGGAGATA TCGTTAAGGT ATATCATATC CAGCCAGGTT ATTTAGAATG GTATGATGAC AATAAACGTG TAGACCAAGG GGAAGCTAAA AAGAAAAAGG AAAAACTATT TAAAATTACC CCGCAAGGAT ATGAATTAAT TGATGGTTTA CAAGAAGTAA CGGCAGTACC GCAAAAAGTA GTAATTGGAA CAGATGTTGA AAAATTAGAA GCAAAAGACT TTGTTCAAGT GAAAGACGGA GAAGTAGTAG GTTTTGTAGA AAAACCAATT ACAACAAAAA TCGGCGAACA AACGGTAAAA GTAGAAACGA AAGACCGTTT CGGAAATAAG CAAGTGACAG AAGTTCCTGT AGAAGTAATT TACGAAGATA GCATTATGTT CTTCGGTACA TGGTATGGTG GAACGAATAT TAAATCGATT GTTACATTAA ATCATGAAGA AAAGAAATTT AGCGTAACAG ATTCAGCAGG TAAAATGCAT ACTGCATTTG CAGATGAGAA ATATATGGGA ATGACTGTAT ATGATAAAGA TGGAAAAGAG AAAAAAGCTT TGTCTGTAAA AGCATCTGAG AATACGAAAG TATTTGCAGA ACAATTCAAT GGAATGACAT TTGAATACGG AGATGTAGTG AAAGTATATC AAAGAGAATT TGATAGATTT AAAGTGTACA AAAAGAACGA ACTAGTGGAT ACGCAGTATG GTGTACATGA AGTGTTCTTT AAAGTAACAG AGCAAGGTTT CGAGAGAGTG GAAGCTCAAC AAGAAGTAAC GGCAGTACCG CAAAAAGTAG TCATCGGAAC AGACACTGAA AAATTAGAAG CGAAAGACTT TGTTCAAGTG AAAGACGGAG AAGTAGTAGG ATTTGTAGAA AAGCTGAATA CAACGAAAAT CGGTGAACAA AAAGTAAAAG TAGAAACGAA AGGCCGTTTT GGAAATAAAA AAGTGACAGA AGTACCTGTA GAAGTGATTT ACGGAGATAG CATTATGTTC TTCGGTACAT GGTATGGTGG AACGAATGTT AAGTCGATTA TCACGTTAAA TCATGAAGAT AAAAAGTTAA GTACAATAGG TTCAGAAGGT CCAGTTCATA CTCAATTTAA GAATGAACAA TATATGGGTC TTGCTGTATT CGGTAAAGAT GGAAAAGAGA AGAAGCAAAT GATCTTAGAA GGCATGGAAA ATACAAAGGC ATTTGCGGAA CAATTTAATG GCATGTCATT TGAGTATGGT GATGTAGTAA AAGTGTATCA AGCAGAGTTT GATCGCTTTA AAGTATATAA AAACAATACA TTAATCGATA CAACATATGG TGTGAATGAT GTATTCTTTA AAATTACGGA AAAAGGTTTT GAAAGAACAG AAGGAATACA GGGAAAGACA GTTTAA
|
Protein sequence | MSKKKTSRVL AAGICISTLL SPVAFEASKG YAAPLEENKG GNLEEVKENK FEQRVFQLPG KGSVEENRDR LKMQFAFSPN EPTGIYAKPD EEIVVEIKGS QSIKAFIGTR AYDKEGPKEF DLNPGKNIIS SPSGGILYFY NMNNTGEVTA TVTSGGTHFP LFILGKHTKK DWDAMLEKYK NPYAIELKGD RSLITTSYEK VEKNMKKTDP TELMKKHDEA IRIENALSGL SEDGIGVAHS GKHYIQFIEA KYPTSPFMYA NNYLTGYAED SIEFVLDIEK FTKDGWGPWH EVGHIHQQVP WLSEGMGETT VNIYSLAVQL AFGNKSRMEV DGRYEEAFAY LNQPDDQKNF DKADPIIMFW QLHLIYGDQF YPKLHQMYRV LSDTEYSMLD TEEVISSREK KQMFIYMASK ASGQNLISYF AKWGLHAEPD TIEKVNKLQL PEPKNEIWLS RDSNPIREKQ VEAYKVPYGE AVNTVPDILI GTEFDEKKAS ELVKNLGANV KTTGKIVWPK QENGKQTVNV EIVDAEGNVN AIPVPVNVVY GDSMAFKTYW NTNGVLTLRH EDKKFNMTLV RNILNHSYRD KKYVGVTIYD ANGNEKKNVS AEGHEGLKNF VKELDGTSFE YGDIVKVYHI QPGYLEWYDD NKRVDQGEAK KKKEKLFKIT PQGYELIDGL QEVTAVPQKV VIGTDVEKLE AKDFVQVKDG EVVGFVEKPI TTKIGEQTVK VETKDRFGNK QVTEVPVEVI YEDSIMFFGT WYGGTNIKSI VTLNHEEKKF SVTDSAGKMH TAFADEKYMG MTVYDKDGKE KKALSVKASE NTKVFAEQFN GMTFEYGDVV KVYQREFDRF KVYKKNELVD TQYGVHEVFF KVTEQGFERV EAQQEVTAVP QKVVIGTDTE KLEAKDFVQV KDGEVVGFVE KLNTTKIGEQ KVKVETKGRF GNKKVTEVPV EVIYGDSIMF FGTWYGGTNV KSIITLNHED KKLSTIGSEG PVHTQFKNEQ YMGLAVFGKD GKEKKQMILE GMENTKAFAE QFNGMSFEYG DVVKVYQAEF DRFKVYKNNT LIDTTYGVND VFFKITEKGF ERTEGIQGKT V
|
| |