Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2008 |
Symbol | |
ID | 7978963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2066880 |
End bp | 2068685 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644798833 |
Product | CRISPR-associated CXXC_CXXC protein Cst1 |
Protein accession | YP_002950003 |
Protein GI | 239827379 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01908] CRISPR-associated CXXC_CXXC protein Cst1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTAC AACTCCGTAT GGGAGAATGG ATGATAACAA TGGGGCTTGT CGGGCTTTAT CGTGTATTTG AATATGGACT GAAAAACGAA ATTATTGACC AGCAATACCG ACGCAGCATT TCTATCCGAC CGTGGGGATT GGAACTGGAT GTAGACGTGC TGTCACAATT GCCAAAAGCC TATTTTTTAT ATTTAATGGA TGAATACAGC GTGGCAAAGC GGGATAGCGA AAAATTGCAG CTTTATATGA AGCAGGCGGA AAAAGAGGCA CAGTTTTCAT ATGCGTTATC AGCCATCAAA AAAACGATAA CAGATACAGG AAAAAAAGTG CTCAAATATT TTTCGCATCC ACCTCTAGCC AATGCTTTAG AAGCTTTAAA ACAAGTAAAA AAGCCGGAAA ATTTTGAACT GCTAGCGACA TGTGCCCATA CGTTCGAAAC GGCACTGTAT GAACCAAAAA TTAACGAGAA ACTGACCCTT AACTATTTTA AAGCGGCGAT ATTAAAGGCG TTTTTTGGAC AAGTTTCTTT CCTGAACGTC TCGAAAAACA GCTTGGATTT ACAAGGGCAT ATTCAAGAAT TCCAAAAAGA TTATATTGCA CCTGCACAGT ACGACTTACA ATTTGAGCAC ATATTAGCGG ATGCCAAATC TTCAGAGGAG ATTATCGCGT TTTTAGACAA GCATAAAGAT TATCAGCCAT TTAAAGTATT GAAAAGAGCG TGGAAAAACA AAACATTAGA AGATATTCAA GATTATGTGA AGAATACCAT AACCAAATGC TTGCTTTTGC ACGACCGTCT CGCTTTTTAC AACTTTGAAG AAATGGTATT TGCTCCAATC GGGGTAGCAA AAAACAATGC GTTTAATTTT AACTGGAATT TGGAAGATCA TCAGCCTAAA CCAATTTCTT CTTTGGCAAA ACTAGTATTA TTTTTTGCTC CAGCTGGTGC AGCTATTTAT TTGAAAAAAG AAGGGGTTAA TGAACAGGGG GAATATCGTA CGTATGCGGG ATTTGTCCAA TCCGATGCTG CCTTTTCAGA AATATTACAG AAAAACAATC ATTTCAAACA GTTAAAACAA CAGAAAGAAC CGTTTGATCG TATTGTTAGC AAACTAGTAC AAGGCATCAC AAAAGAAGCA GAGTACGTGG CAGATCACTT ATTTTTTTTA GAGTTTTCCT CTGACTACGA TAGTAAAAAA ACACATTTGC ATTACTACCA CTTGCCGATG TATTTAGCAT ACTATTTTAA AAACTGTAAG GGGAAACTCG ATTATATTCA ACCTTATGAA TATCGCGAGC ATTTTGTCCA GTATGTGCTT CGTGGGGAAG ACCCTGCCAT TGTTATTTAT TCATACTTGC GTTATTGCAT CGAAAAAGGC AATTCGCACA TCGGTCCTTA TATCACAGTT CGTGAGCGGA ATCGAATCCT GCAATTGAAA AAAGGAGTGA AAGATATGTC AAGTACGGAT AAACGCGTAT ATGCGGTATT TCGAAGCGGG CAGGAAATTC GCAAAGCGCT CGAACAAGCA GCCGCATCAA AGACATCGGA ACAATATTCC GCTAGTTCGA ATAAAAAGGT GAATGCGATT GCGTACCGTC TATTAAATGC GGCAAAAGCA GGAAATCAAA AAAGTTTTAT GGATACATTG TTCCGGCTGC ACATGTCAGC GGAAAAGCCG ATATCTCCTA TTTTCTTGAA TGCATTGCAT GAACGTGACT TGGACTTTGC AACAGTAGCC AATGCCTTTA TTGCTGGATT GTTATCTAGT GGATTACAAG AAGAACAGGA GGATGTCGTA TCATGA
|
Protein sequence | MKVQLRMGEW MITMGLVGLY RVFEYGLKNE IIDQQYRRSI SIRPWGLELD VDVLSQLPKA YFLYLMDEYS VAKRDSEKLQ LYMKQAEKEA QFSYALSAIK KTITDTGKKV LKYFSHPPLA NALEALKQVK KPENFELLAT CAHTFETALY EPKINEKLTL NYFKAAILKA FFGQVSFLNV SKNSLDLQGH IQEFQKDYIA PAQYDLQFEH ILADAKSSEE IIAFLDKHKD YQPFKVLKRA WKNKTLEDIQ DYVKNTITKC LLLHDRLAFY NFEEMVFAPI GVAKNNAFNF NWNLEDHQPK PISSLAKLVL FFAPAGAAIY LKKEGVNEQG EYRTYAGFVQ SDAAFSEILQ KNNHFKQLKQ QKEPFDRIVS KLVQGITKEA EYVADHLFFL EFSSDYDSKK THLHYYHLPM YLAYYFKNCK GKLDYIQPYE YREHFVQYVL RGEDPAIVIY SYLRYCIEKG NSHIGPYITV RERNRILQLK KGVKDMSSTD KRVYAVFRSG QEIRKALEQA AASKTSEQYS ASSNKKVNAI AYRLLNAAKA GNQKSFMDTL FRLHMSAEKP ISPIFLNALH ERDLDFATVA NAFIAGLLSS GLQEEQEDVV S
|
| |