Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B0792 |
Symbol | comEC |
ID | 7183891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 4275374 |
End bp | 4277695 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643552235 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_002447904 |
Protein GI | 218899493 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.92497 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0000000434572 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGCAGGAAC AATGGGGGTA TGTTGCAATC TCGTTTATAA TAGGGATTGC AATTGCCTTT TCCTCGTCAC TTTTATTGAT AGTTTTTTGC CTCTTTTTAA ATATTTTCTT CTGTTTATAT CGCACTTCGC GTAAAACCTT CTTTTTTTGC ATTGTAGCGT GCTTTAGTGG CGCTATATAC ACTTCTTATG TTCAAGGGTG GAATAAGTCT CTAGGAGAGT CCTATGGAGT TACAAGAGGT GTTATACAAA ATACACCGCT TGTTAATGGC GATCGCCTAT CTTTTCAATT TGAAGATCAC AATAAAAATA TAGTGCAATT AAGTTATAGA ATAAAATCAG CTCTGGAAAA GGAAAAAATG CAACAATTAT ACGCAGGGAT ATCATGTGTA TTCGAGGGTG AGAGGAAAGA ACCACAAACA GCTCGGAATT TTCATGGTTT TAATTACCGT GATTATTTAT ACAAGCAACA TATTCATTTC ACATTTGAGG CTACATATAT TTCTGAATGC TATAAAACAT CACTGTCGCT TATGCAATGG ATTCTCCTTT TGAGGCAACA AGCAATCTCG AAAGTTACAG AAATGTTTCC AGAGCAATCA GGTGCTTTTA TGAATGCGTT ACTATTTGGT GATAGACAAC AAATGACGTT TGAAGTGGAA GAGCAATATC AACAGTTTGG TCTTATACAT TTGTTAGCAA TTTCAGGTTC GCACATTGTA TTGTTAATGG GGATAGTATA TTTTATTTTG CTAAGAAGTG GTGCGACGAG GGAGGTAGCA ACAATTTGCC TTATCGTCTG TATTCCTTTG TATATGGTTT TAGCAGGGGC GTCACCATCT GTTATAAGGG CTTCTATAGC AGGAGTTTTG ATGTTGATTG CTTTTATGTG CTCGATTCGT TTATCTAGTT TAGATGCTTT AAGTGTAACA GCTATATGTA TGCTTATATA TGACCCTTAT CTTGTTTTTA ATATTGGATT TCAGTTTTCT TTTGTAGGTA GCTTCGCTTT ACTTTTGTCT GCTCCGTTCT TATTAGGGAG TAGCAACGGA GTAATCCGAA ATTCTATTTA TATTTCTATC ATTTCACAGC TCGTTAGTAC TCCGATCTTG TTATATCACT TTGGTTATTT TTCTCCATAT AGCATTTTCC TCAATCTTTT ATACGTTCCG TTTTTATCTA TCATTGTATT ACCATGTAGC ATTATTATTT TACTGTGTAT GCCAATTGTC CCGTTTCTCG CAAAAGGATT TGCGGATGTG TTATCAATAG GCTTGAGTAT GTCTAATGTT CTTCTAAGTT ATTGTGAAAG TTTACCATTT AACCAGCTTA TTTTCGGGAA AACGTCCTTC AATCTTGTAG CTGTATATTG CACGAGTATT GTCAGTATAT TGATAGTTTG GGAAAGGAGA GCGCCGAAAA AAATGGTTTG TATAGTTGTG AGTATATTTC TTTTTATTAG CACGTGTCAT TATGTATATC CATATTTTAG AGGAAGCGGA AGTGTAACAT TTATTGATGT TGGACAAGGT GATGCAATAT TAATTCGCCT TCCGTATGAT AAAGGGGTTT ATCTTATTGA TACAGGTGGA ACGCTTCGTG TCAATAAGGA AGAGTGGCAA CAAAAAAAGC ATGAATTTTC TGTTGGAACT GATGTTTTAA TACCCTATTT ACAAAAAGAA GGTATTAAAA CAATTGATAA ATTAATCGTA ACGCATGGAG ATGCAGATCA TATAGGTGCT GCGCAAGAAT TATTATCATA TATACCTGTA AAAGAAGTTG TATTTGGCCG GAAGGGAAAA GATGCAGTAT TAGAAAAAGT AGTAAAGAAA CAGGCATTAG AAAAGGAAAT GAAAATAACT GAAGTGGGGG AAGGGGAAAG TTGGAGTGTA AATGAAGCTG AGTTTTTCGT TTTAGCTCCA AAAGGAAAAG AAAAAAGTGA AAATAATGCT TCAATTGTAA TATGGGCAAA ATTTGGAGGA TTGACGTGGC TATTTACAGG CGATTTAGAA GAAGAAGGAG AGCAGTTTTT AGTATCTGCG TATCCAGATT TGCGAGTGGA TGTTTTAAAA GTTGCTCATC ATGGAAGTAA TACGTCATCT ATAGAGTCTT TTTTGAACGT TATACAGCCC CGTGTAGCGA TTATATCTGT TGGTAAGCAG AATAGATATG GACACCCGCA TAAAGAAGTA ATAGAACGTT TGGGGAAGAT GGGGAGTGCG ATATGGCGCA CGGACAAGCA AGGTGCTATT TCCTATGTTT TTAAAGGGGA AAATGGAACC TTTCAAAGTA AAATCACATA TGATAAGACA TATAATAGGT GA
|
Protein sequence | MQEQWGYVAI SFIIGIAIAF SSSLLLIVFC LFLNIFFCLY RTSRKTFFFC IVACFSGAIY TSYVQGWNKS LGESYGVTRG VIQNTPLVNG DRLSFQFEDH NKNIVQLSYR IKSALEKEKM QQLYAGISCV FEGERKEPQT ARNFHGFNYR DYLYKQHIHF TFEATYISEC YKTSLSLMQW ILLLRQQAIS KVTEMFPEQS GAFMNALLFG DRQQMTFEVE EQYQQFGLIH LLAISGSHIV LLMGIVYFIL LRSGATREVA TICLIVCIPL YMVLAGASPS VIRASIAGVL MLIAFMCSIR LSSLDALSVT AICMLIYDPY LVFNIGFQFS FVGSFALLLS APFLLGSSNG VIRNSIYISI ISQLVSTPIL LYHFGYFSPY SIFLNLLYVP FLSIIVLPCS IIILLCMPIV PFLAKGFADV LSIGLSMSNV LLSYCESLPF NQLIFGKTSF NLVAVYCTSI VSILIVWERR APKKMVCIVV SIFLFISTCH YVYPYFRGSG SVTFIDVGQG DAILIRLPYD KGVYLIDTGG TLRVNKEEWQ QKKHEFSVGT DVLIPYLQKE GIKTIDKLIV THGDADHIGA AQELLSYIPV KEVVFGRKGK DAVLEKVVKK QALEKEMKIT EVGEGESWSV NEAEFFVLAP KGKEKSENNA SIVIWAKFGG LTWLFTGDLE EEGEQFLVSA YPDLRVDVLK VAHHGSNTSS IESFLNVIQP RVAIISVGKQ NRYGHPHKEV IERLGKMGSA IWRTDKQGAI SYVFKGENGT FQSKITYDKT YNR
|
| |