Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0310 |
Symbol | |
ID | 7977428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 356716 |
End bp | 357717 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644797303 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002948503 |
Protein GI | 239825879 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.753182 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA CGCTGTATAT TTTTCAAAGC GGAGAGCTTC GCCGCAAAGA CAATAGTTTG TATTTCGAAA CAGAAGAGCG GAAACGATAT ATTCCTGTAG AGAACACTAA CGATATTTAT ATTTTCGGCG AAGTAGATGT ATCAAAAAAG TTTCTTGAGT TTGCGGCGCA AAAAGAGATT ATTGTTCATT ATTTCAACCA TCATGGGTAT TATGTCGGCT CTTTTTATCC ACGGGAACAC TTAAATGCTG GTCATGTAAT TTTAAAACAA GCGGAACATT ACATTGATTC GAACAAACGG CTCGATTTAG CACAGCGGTT TGTGGAAGGT GCTATTCAGC AAATGACGAG AGTAATCAAA TACTACCGAA CTCGCCTTTC TGAGGCAGAT GTGTTAGCCC GTGCGTTAGA AGTGATCGAA CAAGAAACAG AACGGATGAG ACAAGCGCAG TCGGTGGAGC AGCTAATGGC AGCGGAAGGG CATATTCGGG AAATGTATTA TTCCACCTTT GATGTCATTA TTCGTCATCC CGATTTCGTG TTTGAAAAAC GGACGAAGCG CCCTCCGCAT AACCGTCTAA ATGCTCTTAT TAGCTTTGGT AATTCGATTG TGTATACGAT GTGTCTTAGC GAAATTTACA AAACGTATTT AGACCCACGG ATCGGTTTTT TGCACAGCAC GAATTTTCGA CGGTTTTCAC TCAATCTCGA TGTGGCAGAA ATTTTTAAAC CGATTATCGT GGACCGCCTT ATTTTCACTC TTGTGAATAA GAAAATGCTA TCGGCAAAGC ACTTTGAAAA ATTGACAGAT GGGATTTTGC TTAATGAAGA AGGACGAAAA TTATTCGTCA GTGAACTTGA GAAAAAAATG CAGACGACGG TGCAGCACCG TCACCTTGGC AAATCCGTTT CCTATCGCCG ATTGATTCGG CTAGAGCTAT ATAAAATTCA AAAACATTTG TTAGGTGAAA AAACGTACGA GCCATACGCG GCGAAATGGT AG
|
Protein sequence | MKKTLYIFQS GELRRKDNSL YFETEERKRY IPVENTNDIY IFGEVDVSKK FLEFAAQKEI IVHYFNHHGY YVGSFYPREH LNAGHVILKQ AEHYIDSNKR LDLAQRFVEG AIQQMTRVIK YYRTRLSEAD VLARALEVIE QETERMRQAQ SVEQLMAAEG HIREMYYSTF DVIIRHPDFV FEKRTKRPPH NRLNALISFG NSIVYTMCLS EIYKTYLDPR IGFLHSTNFR RFSLNLDVAE IFKPIIVDRL IFTLVNKKML SAKHFEKLTD GILLNEEGRK LFVSELEKKM QTTVQHRHLG KSVSYRRLIR LELYKIQKHL LGEKTYEPYA AKW
|
| |