Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2005 |
Symbol | |
ID | 7978960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2063099 |
End bp | 2065279 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644798830 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_002950000 |
Protein GI | 239827376 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 [TIGR01596] CRISPR-associated endonuclease Cas3-HD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTTTT ATGCGAAATC GGAAACAAAA GAGACGATCC GTGAACACAC GGATCGTCTC CTTGATAATT TGTGGTTATT AAAGAACAGT TATGGGCACA AATTTGTTCG CATGGATGAA AGAATGTGGG AATTGCTCCG GCTTGCGGTG GAATATCACG ACGTTGGCAA AGCAAACACC GTATTTCAAA ATAAAATACG CCGTGCGATT CGTGAGGAGA TGCTTGAAAC CGACTGCGAT GCAGATGTGC CACATAACTA TTTGTCTGTC GGATTGATTC CGTTTTCACG GCTTGATTTG ACGAAAGAAG AAAGCCGCCT GCTCATTCAT GCGGTCGGTT ACCATCATGA GCGGGATAAG CCTCCTGATA AGGAGATGAT TCGCCATATT CTTGAGACGG ATATGAAGAA TCAGCGAAAC GCGATTTCCG AGCATATGAA ATTGCCGCTT GCCGAGAAAC TATCATTTCG ATTTGTGGAT CGCCTGTTAA AACGGTACAC CATTCATGAC GGCGATCAGT TTTGGAACTA CGTATTGTTA AAAGGGCTTC TTCATCGTAT AGACCATGCA GCATCCGCGC ATCTTCCAGT AGAAATCGAT ATCGATCAGT CCATTGGTGC AAGTGTGCGC AACTATATGA ATCAGCAAGG ATTTCGCAAA AACGCTTTGC AGCAATTTAC CGAAGCCAAT CAAGACAAAA ATGTCGTTGT TGTTGCACAA ACGGGGATGG GAAAAACGGA AGCAGCTTTG TTATGGATTG GCGATGATAA AGCGTTTTTC ACCTTGCCGC TTCGGGTAAG CATTAACGCC ATTTATGAAA GAATTCGGGA CAAAATGGGA TACGATGCGG TCGGATTATT ACATTCAACA AGTGTTCATT ATTTGGCAGA AAAAGGAGAA GAAAACTGGG AAGCAATAAA ACAACAATCC CAACAGCTGT CAACGAAGCT GCTGATGACA ACGATTGATC AAATTCTGAA ATTTCCGTTT TATTATCGCG GGTTTGAAAA AGAGTTAGCG ACAATGGCGA ACTCTAAAAT CGTGATTGAT GAAATTCAAG CATATGATCC GAAAATCGCT GCCATGCTTA TTAAAGCATT GGAAATGATT CATAACGTCG GCGGTAAATT TATGATCATG ACCGCTACGT TGCCAACATT ATATCTCGAT GAATTAAAGA AAAGAAACAT CATTAATGAG GAACATAGCG TATTTGGCGA ATTTATCGAT ACAAGCAAAC ACCGTCATCG CATTCAACTG CACGCAAAAG AGATAACAGA AGGGTTGGAG GATATCGTCC GCAAGGCGCA AACATCCCAA GTGCTTGTCA TTGTCAACAC GGTACGCCGT GCTATCGAGT TGTATCAGAG CTTATCGGAA CATGAACAAA ATATCCCTGT TTATTTGCTT CATTCGCAAT TTACCCAAGA GGATCGACAG TTGTTGGAAC GGAAAATAAA AGAATTTAAT GATAAGAAAA TAAACGGAAT TTGGATTACG ACACAACTTG TAGAAGCAAG TATTGATATT GACTTTGATT ATTTATTTAC CGAAATGTCT ACCTTAGATA GTTTATTTCA GCGGCTAGGG CGCTGTTATC GTAAACGTAT ATTAGACGAG GAACGATGTA ATGTGCATAT TTTTACCGAA AACATTTCCG GTGTGCCGCG AGTATATAAC GAACATCTCA TAAATGAAAG CATAAAGCTA TTGCAGCCGT ACGACGGGCA CATTCTGGAT GAACGCACCA AAGTAGAAAT GGTAAAGCAA TTGTACGAGC GGAAACGGTT AATCGGAACG GCATTTTTAC AAGAATTCGA GGATGCCTTA TATATGTTTG ATAACCTAGA TCCGTATGGA ATGAACAAGA AAGAAGCACA AAGAAAATTG CGGGATATTC AAAATATTCA AGTCGTTCCT CGACGAATAT ATGATGGGAT GATTGATTTG TTAGAAGCGT ATGCCGCTTG CCGTGATGTG AAGGAGCGGC TTCGGCTGCG AATGGAAATA GAAAAGAAAA CGGTAAGCGT TCAACGTTTT ATTGCAGAAA AATACGTATC AGAGCGGCTG CCAAAACCGT TTGAACATAT ATACATCATT GATGTCGAAT ACGATTTCCA ACGAGAAACA TCGAAGGGAA AAGGAATTTT ATTAGATACC CCGATGTCTA CCTTTTACTA A
|
Protein sequence | MTFYAKSETK ETIREHTDRL LDNLWLLKNS YGHKFVRMDE RMWELLRLAV EYHDVGKANT VFQNKIRRAI REEMLETDCD ADVPHNYLSV GLIPFSRLDL TKEESRLLIH AVGYHHERDK PPDKEMIRHI LETDMKNQRN AISEHMKLPL AEKLSFRFVD RLLKRYTIHD GDQFWNYVLL KGLLHRIDHA ASAHLPVEID IDQSIGASVR NYMNQQGFRK NALQQFTEAN QDKNVVVVAQ TGMGKTEAAL LWIGDDKAFF TLPLRVSINA IYERIRDKMG YDAVGLLHST SVHYLAEKGE ENWEAIKQQS QQLSTKLLMT TIDQILKFPF YYRGFEKELA TMANSKIVID EIQAYDPKIA AMLIKALEMI HNVGGKFMIM TATLPTLYLD ELKKRNIINE EHSVFGEFID TSKHRHRIQL HAKEITEGLE DIVRKAQTSQ VLVIVNTVRR AIELYQSLSE HEQNIPVYLL HSQFTQEDRQ LLERKIKEFN DKKINGIWIT TQLVEASIDI DFDYLFTEMS TLDSLFQRLG RCYRKRILDE ERCNVHIFTE NISGVPRVYN EHLINESIKL LQPYDGHILD ERTKVEMVKQ LYERKRLIGT AFLQEFEDAL YMFDNLDPYG MNKKEAQRKL RDIQNIQVVP RRIYDGMIDL LEAYAACRDV KERLRLRMEI EKKTVSVQRF IAEKYVSERL PKPFEHIYII DVEYDFQRET SKGKGILLDT PMSTFY
|
| |