Gene GWCH70_2005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2005 
Symbol 
ID7978960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2063099 
End bp2065279 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content41% 
IMG OID644798830 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_002950000 
Protein GI239827376 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3
[TIGR01596] CRISPR-associated endonuclease Cas3-HD 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTTTT ATGCGAAATC GGAAACAAAA GAGACGATCC GTGAACACAC GGATCGTCTC 
CTTGATAATT TGTGGTTATT AAAGAACAGT TATGGGCACA AATTTGTTCG CATGGATGAA
AGAATGTGGG AATTGCTCCG GCTTGCGGTG GAATATCACG ACGTTGGCAA AGCAAACACC
GTATTTCAAA ATAAAATACG CCGTGCGATT CGTGAGGAGA TGCTTGAAAC CGACTGCGAT
GCAGATGTGC CACATAACTA TTTGTCTGTC GGATTGATTC CGTTTTCACG GCTTGATTTG
ACGAAAGAAG AAAGCCGCCT GCTCATTCAT GCGGTCGGTT ACCATCATGA GCGGGATAAG
CCTCCTGATA AGGAGATGAT TCGCCATATT CTTGAGACGG ATATGAAGAA TCAGCGAAAC
GCGATTTCCG AGCATATGAA ATTGCCGCTT GCCGAGAAAC TATCATTTCG ATTTGTGGAT
CGCCTGTTAA AACGGTACAC CATTCATGAC GGCGATCAGT TTTGGAACTA CGTATTGTTA
AAAGGGCTTC TTCATCGTAT AGACCATGCA GCATCCGCGC ATCTTCCAGT AGAAATCGAT
ATCGATCAGT CCATTGGTGC AAGTGTGCGC AACTATATGA ATCAGCAAGG ATTTCGCAAA
AACGCTTTGC AGCAATTTAC CGAAGCCAAT CAAGACAAAA ATGTCGTTGT TGTTGCACAA
ACGGGGATGG GAAAAACGGA AGCAGCTTTG TTATGGATTG GCGATGATAA AGCGTTTTTC
ACCTTGCCGC TTCGGGTAAG CATTAACGCC ATTTATGAAA GAATTCGGGA CAAAATGGGA
TACGATGCGG TCGGATTATT ACATTCAACA AGTGTTCATT ATTTGGCAGA AAAAGGAGAA
GAAAACTGGG AAGCAATAAA ACAACAATCC CAACAGCTGT CAACGAAGCT GCTGATGACA
ACGATTGATC AAATTCTGAA ATTTCCGTTT TATTATCGCG GGTTTGAAAA AGAGTTAGCG
ACAATGGCGA ACTCTAAAAT CGTGATTGAT GAAATTCAAG CATATGATCC GAAAATCGCT
GCCATGCTTA TTAAAGCATT GGAAATGATT CATAACGTCG GCGGTAAATT TATGATCATG
ACCGCTACGT TGCCAACATT ATATCTCGAT GAATTAAAGA AAAGAAACAT CATTAATGAG
GAACATAGCG TATTTGGCGA ATTTATCGAT ACAAGCAAAC ACCGTCATCG CATTCAACTG
CACGCAAAAG AGATAACAGA AGGGTTGGAG GATATCGTCC GCAAGGCGCA AACATCCCAA
GTGCTTGTCA TTGTCAACAC GGTACGCCGT GCTATCGAGT TGTATCAGAG CTTATCGGAA
CATGAACAAA ATATCCCTGT TTATTTGCTT CATTCGCAAT TTACCCAAGA GGATCGACAG
TTGTTGGAAC GGAAAATAAA AGAATTTAAT GATAAGAAAA TAAACGGAAT TTGGATTACG
ACACAACTTG TAGAAGCAAG TATTGATATT GACTTTGATT ATTTATTTAC CGAAATGTCT
ACCTTAGATA GTTTATTTCA GCGGCTAGGG CGCTGTTATC GTAAACGTAT ATTAGACGAG
GAACGATGTA ATGTGCATAT TTTTACCGAA AACATTTCCG GTGTGCCGCG AGTATATAAC
GAACATCTCA TAAATGAAAG CATAAAGCTA TTGCAGCCGT ACGACGGGCA CATTCTGGAT
GAACGCACCA AAGTAGAAAT GGTAAAGCAA TTGTACGAGC GGAAACGGTT AATCGGAACG
GCATTTTTAC AAGAATTCGA GGATGCCTTA TATATGTTTG ATAACCTAGA TCCGTATGGA
ATGAACAAGA AAGAAGCACA AAGAAAATTG CGGGATATTC AAAATATTCA AGTCGTTCCT
CGACGAATAT ATGATGGGAT GATTGATTTG TTAGAAGCGT ATGCCGCTTG CCGTGATGTG
AAGGAGCGGC TTCGGCTGCG AATGGAAATA GAAAAGAAAA CGGTAAGCGT TCAACGTTTT
ATTGCAGAAA AATACGTATC AGAGCGGCTG CCAAAACCGT TTGAACATAT ATACATCATT
GATGTCGAAT ACGATTTCCA ACGAGAAACA TCGAAGGGAA AAGGAATTTT ATTAGATACC
CCGATGTCTA CCTTTTACTA A
 
Protein sequence
MTFYAKSETK ETIREHTDRL LDNLWLLKNS YGHKFVRMDE RMWELLRLAV EYHDVGKANT 
VFQNKIRRAI REEMLETDCD ADVPHNYLSV GLIPFSRLDL TKEESRLLIH AVGYHHERDK
PPDKEMIRHI LETDMKNQRN AISEHMKLPL AEKLSFRFVD RLLKRYTIHD GDQFWNYVLL
KGLLHRIDHA ASAHLPVEID IDQSIGASVR NYMNQQGFRK NALQQFTEAN QDKNVVVVAQ
TGMGKTEAAL LWIGDDKAFF TLPLRVSINA IYERIRDKMG YDAVGLLHST SVHYLAEKGE
ENWEAIKQQS QQLSTKLLMT TIDQILKFPF YYRGFEKELA TMANSKIVID EIQAYDPKIA
AMLIKALEMI HNVGGKFMIM TATLPTLYLD ELKKRNIINE EHSVFGEFID TSKHRHRIQL
HAKEITEGLE DIVRKAQTSQ VLVIVNTVRR AIELYQSLSE HEQNIPVYLL HSQFTQEDRQ
LLERKIKEFN DKKINGIWIT TQLVEASIDI DFDYLFTEMS TLDSLFQRLG RCYRKRILDE
ERCNVHIFTE NISGVPRVYN EHLINESIKL LQPYDGHILD ERTKVEMVKQ LYERKRLIGT
AFLQEFEDAL YMFDNLDPYG MNKKEAQRKL RDIQNIQVVP RRIYDGMIDL LEAYAACRDV
KERLRLRMEI EKKTVSVQRF IAEKYVSERL PKPFEHIYII DVEYDFQRET SKGKGILLDT
PMSTFY