Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCE_0966 |
Symbol | |
ID | 2751735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus ATCC 10987 |
Kingdom | Bacteria |
Replicon accession | NC_003909 |
Strand | + |
Start bp | 989264 |
End bp | 992212 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637277792 |
Product | collagen adhesin domain-containing protein |
Protein accession | NP_977289 |
Protein GI | 42780042 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4932] Predicted outer membrane protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAT ATTTAAAAAG AATATCTGTG ATTTGTTTTA TTTTTACGGT TCTTATTGGA CAAATTTTTA TGCCTATTAT AGGGCATGCT CAAGAATTAA ATACGACAGG ATTTGTGGAT AGTTTTTCAT TTGAAAAGAC GAAATTAAAT TATGGAGAAA AGAATACGAT ACATGTAAAC TTTAGTGAAA AGCCTGGAAA GAAGATGAAA TCTGGGGATA CATTAACGTT AGCACTTCCA CCAGAATTAA AAGGTTATAG TGGCACAATT CCATTAAAGG ATGATTCAGG GCGTATTTTT GGTACGTGCC AGATTAATGC AAGTAATGTA GTTTGTACAT TTAATGATAC GGTAGAAAAG CTTGAAAATA TTAGAGGGAA CTTTAATTTC ACTGTTCAAG GTACGAATGT TGAAGCCGGA AAGACGAAAG ATGTACAAAC GAATTTAGGG ACAGATTTAG AAAAACAAAT GGTAAGTATT ACGCATCCAA AAGGAGAAGG TACAGAACCG GGGATCTTTT TCTATAAGTC TGGTGATATT CAGCCAGACA AAAGTAATGA AGTGCGTTGG TTTTTAAATA TAAATTTAAA GAAACAATAT TTACATGACA ACATCGTTTT AAAAGATACG TTACAAGAAG GACAAACACT AAATAAAGAT AGCTTTACCA TCACTATCAA TAATAAAGAA TATTTGTCTC TTAAACAATT TCAAGACCGA GGTTATGGAT ACATTAAGCT TATTAGTGAT AACTCATTTG AAGTTGTAAT TTATAGGCAT ATGGCGAACG CTACGTCGTT TACCGTTTTC TATACATCAA CGATTACTGA TAGCGGGAAA AAGTTGAAAT ATCTACAGAA TGACTACAAG CTTGATTATC AAATTTTATA TGAGAAACCT AGTACTGAAT CTAATAGTGT AAAGGTTGAA AATATATCAT TTGGCGGCGG GGCTGAAGGG GTTTTACCTG CGAAAGGAAC GCTGCAAATT GTAAAACATA TTGAAGGAGA CGAGAAAAAG TTTATTCCAG GTGTTTCTTT TAAATTGTTT ACAGAGTCAG GACAGCAAAT TGGTGATTCC TATACAACAA ATCAAGACGG AATAGTTGAA GCACCAAACC TTACTCCAGG TAATTATTAC GTACAAGAAA TATCTGCTCC GAACTATGTA GAGTTTGATT CACAAGCGAA AATTCCTTTC ACAATTAAGA CGGATGCTAC AAATGGAATA AAACTTATGG TTCCAAATAA GTTAAAAACT ACATCTGTTG CAGGAACGAA AACGTGGGAA GGCGATAAAG TAAACGATCG CCCAAAAACG ATTAAAGTAG ATTTACTGCA AAATGGTAAA GTCATTGCAA CGAAAGAAGT TACGGCAGAA AATGATTGGA AATATGAGTT TGGAAAGTTA CCAGCAGTTG ATAGTGAAGG AAAAGCTCAT ACATATGAAG TGAAAGAACA ACCAGTATCA GGATATCAGT CGAAAGTGAA CGGATATGAT ATAACAAATA TAAAGATACA GGAAGCAATA GAAGTAGAAG AGCAAAATAA AGAAGAAACA ACAGAAGAAC TAGAAGACTT AGAAAAACCG GAAGAACCAA AGGTAACAGA AGAACCGAAT GTATTAGAAA AACCAGAGGT AAAAGAGAAG CCGGAGATCT GGGTAAAACC GATTGAAGAA GAAAATAAAG AAGAAACAAC AGAAGAACTA GAAGACTTAG AAAAACCGGA AGAGCCAAAG GTAACAGAAG AACCGAATGT ATTAGAAAAA CCAGAGGTAA AAGAGAAGCC GGAGATCTGG GTAAAACCGA TTGAGGAAGA AAATAAAGAA GAAACAACAG AAGAACTAGA AGACTTAGAA AAACCGGAAG AACCAAAGGT AACGGAAGAG CCGAATGTGC TAGAAAAACC AGAGGTAACA GAAAAACCAG AAATCTGGGT GAAACCAGAG GAAGAAGAAA ATAAAGAAGA AACAACAGAA GAACTAGAAG ACTTAGAAAA ACCGGAAGAG CCAAAGGTAA CAGAAGAACC GAATGTATTA GAAAAACCAG AGGTAAAAGA GAAGCCGGAG ATCTGGGTAA AACCGATTGA GGAAGAAAAT AAAGAAGAAA CAACAGAAGA ACTAGAAGAC TTAGAAAAAT CGGAAGAACC AAAGGTAACG GAAGAGCCGA ATGTGCTAGA AAAACCAGAG GTAAAAGAGA AGCCAGAGAT CTGGGTAAAA CCGATTGAAG AAGAAAATAA AGAAGAAACA ACAGAAGAAA TGGAAGATCT AGAAAAACCG GAAGAGCCAA AGGTAACGGA AGAGCCGAAT GTGTTAGAGA AGCCAGAGGT AAAAGAGCAA CCGGAGATCT TGGTAACACC GATCGAAGAA GAAAATAAAG AAGAAACAAC AGAAGAAATG GAAGACTTAG AAAAACCGGA AGAGCCAAAG GTAACAGAAG AACCGAATGT ATTAGAAAAA CCAGAGGTAA AAGAGAAGCC GGAAATCTGG GTAAACCTAG AGGAAGTAGA AAATAAAGAA GGAACAACGG AAGAAATAAC AGAAGAACTA GAAGGTTTAT TAAAGCCGGA AGAGCTAAAG GTGAAAGAAG AGCCGAATGT GTTAGAGAAG CCAGAGGTAA AAGAGCAACC GGAGATCTTG GTAACACCGA TCGAAGAAGA AAATAAAGAA GAAACAACAG AAGAAATGAA AGACTTATTA AAGCCGGAAG AACCAAAGGT GAAAGAAGAA CCGAATGTGT TAGAGCAACC GGAAGTATCA AGCAAACCAG AAGTACAAAA CAAACAAGAT GTACAAGATA AATCTGAAGT AACATCTAAT GAAGAAGATA ATAAGCAATT AAAAGTACTT CCTCAAACAG GTGGAGCATC AACTGAAGCT ACTTCTGTTA TCGCGGGGAT GTTAACATTA ATTTTAGGTG CAAGGTTGTT TAGACGTTCA AAAAATTAA
|
Protein sequence | MSKYLKRISV ICFIFTVLIG QIFMPIIGHA QELNTTGFVD SFSFEKTKLN YGEKNTIHVN FSEKPGKKMK SGDTLTLALP PELKGYSGTI PLKDDSGRIF GTCQINASNV VCTFNDTVEK LENIRGNFNF TVQGTNVEAG KTKDVQTNLG TDLEKQMVSI THPKGEGTEP GIFFYKSGDI QPDKSNEVRW FLNINLKKQY LHDNIVLKDT LQEGQTLNKD SFTITINNKE YLSLKQFQDR GYGYIKLISD NSFEVVIYRH MANATSFTVF YTSTITDSGK KLKYLQNDYK LDYQILYEKP STESNSVKVE NISFGGGAEG VLPAKGTLQI VKHIEGDEKK FIPGVSFKLF TESGQQIGDS YTTNQDGIVE APNLTPGNYY VQEISAPNYV EFDSQAKIPF TIKTDATNGI KLMVPNKLKT TSVAGTKTWE GDKVNDRPKT IKVDLLQNGK VIATKEVTAE NDWKYEFGKL PAVDSEGKAH TYEVKEQPVS GYQSKVNGYD ITNIKIQEAI EVEEQNKEET TEELEDLEKP EEPKVTEEPN VLEKPEVKEK PEIWVKPIEE ENKEETTEEL EDLEKPEEPK VTEEPNVLEK PEVKEKPEIW VKPIEEENKE ETTEELEDLE KPEEPKVTEE PNVLEKPEVT EKPEIWVKPE EEENKEETTE ELEDLEKPEE PKVTEEPNVL EKPEVKEKPE IWVKPIEEEN KEETTEELED LEKSEEPKVT EEPNVLEKPE VKEKPEIWVK PIEEENKEET TEEMEDLEKP EEPKVTEEPN VLEKPEVKEQ PEILVTPIEE ENKEETTEEM EDLEKPEEPK VTEEPNVLEK PEVKEKPEIW VNLEEVENKE GTTEEITEEL EGLLKPEELK VKEEPNVLEK PEVKEQPEIL VTPIEEENKE ETTEEMKDLL KPEEPKVKEE PNVLEQPEVS SKPEVQNKQD VQDKSEVTSN EEDNKQLKVL PQTGGASTEA TSVIAGMLTL ILGARLFRRS KN
|
| |