Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B5645 |
Symbol | comFA |
ID | 7186510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 5054262 |
End bp | 5055386 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643553082 |
Product | comF operon protein 1 |
Protein accession | YP_002448723 |
Protein GI | 218900312 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4098] Superfamily II DNA/RNA helicase required for DNA uptake (late competence protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0073075 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.000000000065503 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGGAGGG TAAGTGAATG TGCTGTACTT GTTCGCGGGA TTGCTGAAAG AAAGAGAGAA AAGAAATTAA ACTTGTTACA GTGGAACGGG AAGTTGTCTA CTGGTCAGAA TTTGGCGGCA CAAGGTGTTG TAGAGGCTAT TAAGCAAAAA GAATCATTTT TTATTTGGGC TGTATGCGGG GCTGGGAAAA CAGAGATGTT GTTTTACGGT ATTAACGAAG CGCTTCAAAA AGGAGAAAGA GTTTGTATCG CAACGCCGAG AACGGATGTT GTTCTGGAAT TAGCACCGAG ATTACAAGAA GTATTTCCAT ATATAAAGGT AGCGGCTTTA TATGGAGGGA GTGTGGATAA AGAAAAAGAT GCAGTACTAG TCGTTGCGAC CACGCATCAA TTATTGCGTT ATTATAGGGC GTTTCATGTC ATGGTTGTAG ATGAGATAGA TGCTTTTCCA TATTGTGCAG ATCAAATGTT ACAGTACGCG GTAAAACAAG CGATGAAAGA AAGGGCGGCG CGTATTTATT TAACTGCGAC TCCAGATGAA ACGTGGAAGC GAAAATTTAG AAAAGGTGAA CAAAAAGGTG TTATTGTTTC TGGACGATAT CACCGTCATC CTTTGCCAGT TCCTTTATTT TGTTGGTGCG GGAATTGGAA AAAAAGCCTC ATTCATAAAA GAATTCCTCG AGTTTTACTA CAGTGGTTAC AAACATACTT AAATAAAAAA CATCCTATTT TTTTATTTGT CCCCCATGTG CGATATATAG AAGAGATAAG CCTGTTGTTA AAATCATTAA ACAAGCGAAT TGAAGGTGTA CATGCAGAAG ATCCGGGGAG AAAAGAAAAA GTAGCGGCTT TCAGAAAAGG AGAAATCCCA TTATTAGTTA CAACGACAAT TTTAGAGCGA GGCGTAACGG TGAAAAATTT GCAAGTAGCG GTTTTAGGGG CTGAAGAAGA AATATTCTCA GAAAGTGCAC TCGTACAAAT TGCGGGCCGA GCAGGGCGGA GCTTTGAAGC ACCGTATGGA GAGGTCATTT ATTTTCACTA TGGTAAGACA GAGGCGATGG TGCGCGCGAA AAAACATATT CAAGGTATGA ATAAAAATGC CAAAGAACAA GGATTGATCG ATTAA
|
Protein sequence | MGRVSECAVL VRGIAERKRE KKLNLLQWNG KLSTGQNLAA QGVVEAIKQK ESFFIWAVCG AGKTEMLFYG INEALQKGER VCIATPRTDV VLELAPRLQE VFPYIKVAAL YGGSVDKEKD AVLVVATTHQ LLRYYRAFHV MVVDEIDAFP YCADQMLQYA VKQAMKERAA RIYLTATPDE TWKRKFRKGE QKGVIVSGRY HRHPLPVPLF CWCGNWKKSL IHKRIPRVLL QWLQTYLNKK HPIFLFVPHV RYIEEISLLL KSLNKRIEGV HAEDPGRKEK VAAFRKGEIP LLVTTTILER GVTVKNLQVA VLGAEEEIFS ESALVQIAGR AGRSFEAPYG EVIYFHYGKT EAMVRAKKHI QGMNKNAKEQ GLID
|
| |