Gene GWCH70_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2008 
Symbol 
ID7978963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2066880 
End bp2068685 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content39% 
IMG OID644798833 
ProductCRISPR-associated CXXC_CXXC protein Cst1 
Protein accessionYP_002950003 
Protein GI239827379 
COG category 
COG ID 
TIGRFAM ID[TIGR01908] CRISPR-associated CXXC_CXXC protein Cst1 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAC AACTCCGTAT GGGAGAATGG ATGATAACAA TGGGGCTTGT CGGGCTTTAT 
CGTGTATTTG AATATGGACT GAAAAACGAA ATTATTGACC AGCAATACCG ACGCAGCATT
TCTATCCGAC CGTGGGGATT GGAACTGGAT GTAGACGTGC TGTCACAATT GCCAAAAGCC
TATTTTTTAT ATTTAATGGA TGAATACAGC GTGGCAAAGC GGGATAGCGA AAAATTGCAG
CTTTATATGA AGCAGGCGGA AAAAGAGGCA CAGTTTTCAT ATGCGTTATC AGCCATCAAA
AAAACGATAA CAGATACAGG AAAAAAAGTG CTCAAATATT TTTCGCATCC ACCTCTAGCC
AATGCTTTAG AAGCTTTAAA ACAAGTAAAA AAGCCGGAAA ATTTTGAACT GCTAGCGACA
TGTGCCCATA CGTTCGAAAC GGCACTGTAT GAACCAAAAA TTAACGAGAA ACTGACCCTT
AACTATTTTA AAGCGGCGAT ATTAAAGGCG TTTTTTGGAC AAGTTTCTTT CCTGAACGTC
TCGAAAAACA GCTTGGATTT ACAAGGGCAT ATTCAAGAAT TCCAAAAAGA TTATATTGCA
CCTGCACAGT ACGACTTACA ATTTGAGCAC ATATTAGCGG ATGCCAAATC TTCAGAGGAG
ATTATCGCGT TTTTAGACAA GCATAAAGAT TATCAGCCAT TTAAAGTATT GAAAAGAGCG
TGGAAAAACA AAACATTAGA AGATATTCAA GATTATGTGA AGAATACCAT AACCAAATGC
TTGCTTTTGC ACGACCGTCT CGCTTTTTAC AACTTTGAAG AAATGGTATT TGCTCCAATC
GGGGTAGCAA AAAACAATGC GTTTAATTTT AACTGGAATT TGGAAGATCA TCAGCCTAAA
CCAATTTCTT CTTTGGCAAA ACTAGTATTA TTTTTTGCTC CAGCTGGTGC AGCTATTTAT
TTGAAAAAAG AAGGGGTTAA TGAACAGGGG GAATATCGTA CGTATGCGGG ATTTGTCCAA
TCCGATGCTG CCTTTTCAGA AATATTACAG AAAAACAATC ATTTCAAACA GTTAAAACAA
CAGAAAGAAC CGTTTGATCG TATTGTTAGC AAACTAGTAC AAGGCATCAC AAAAGAAGCA
GAGTACGTGG CAGATCACTT ATTTTTTTTA GAGTTTTCCT CTGACTACGA TAGTAAAAAA
ACACATTTGC ATTACTACCA CTTGCCGATG TATTTAGCAT ACTATTTTAA AAACTGTAAG
GGGAAACTCG ATTATATTCA ACCTTATGAA TATCGCGAGC ATTTTGTCCA GTATGTGCTT
CGTGGGGAAG ACCCTGCCAT TGTTATTTAT TCATACTTGC GTTATTGCAT CGAAAAAGGC
AATTCGCACA TCGGTCCTTA TATCACAGTT CGTGAGCGGA ATCGAATCCT GCAATTGAAA
AAAGGAGTGA AAGATATGTC AAGTACGGAT AAACGCGTAT ATGCGGTATT TCGAAGCGGG
CAGGAAATTC GCAAAGCGCT CGAACAAGCA GCCGCATCAA AGACATCGGA ACAATATTCC
GCTAGTTCGA ATAAAAAGGT GAATGCGATT GCGTACCGTC TATTAAATGC GGCAAAAGCA
GGAAATCAAA AAAGTTTTAT GGATACATTG TTCCGGCTGC ACATGTCAGC GGAAAAGCCG
ATATCTCCTA TTTTCTTGAA TGCATTGCAT GAACGTGACT TGGACTTTGC AACAGTAGCC
AATGCCTTTA TTGCTGGATT GTTATCTAGT GGATTACAAG AAGAACAGGA GGATGTCGTA
TCATGA
 
Protein sequence
MKVQLRMGEW MITMGLVGLY RVFEYGLKNE IIDQQYRRSI SIRPWGLELD VDVLSQLPKA 
YFLYLMDEYS VAKRDSEKLQ LYMKQAEKEA QFSYALSAIK KTITDTGKKV LKYFSHPPLA
NALEALKQVK KPENFELLAT CAHTFETALY EPKINEKLTL NYFKAAILKA FFGQVSFLNV
SKNSLDLQGH IQEFQKDYIA PAQYDLQFEH ILADAKSSEE IIAFLDKHKD YQPFKVLKRA
WKNKTLEDIQ DYVKNTITKC LLLHDRLAFY NFEEMVFAPI GVAKNNAFNF NWNLEDHQPK
PISSLAKLVL FFAPAGAAIY LKKEGVNEQG EYRTYAGFVQ SDAAFSEILQ KNNHFKQLKQ
QKEPFDRIVS KLVQGITKEA EYVADHLFFL EFSSDYDSKK THLHYYHLPM YLAYYFKNCK
GKLDYIQPYE YREHFVQYVL RGEDPAIVIY SYLRYCIEKG NSHIGPYITV RERNRILQLK
KGVKDMSSTD KRVYAVFRSG QEIRKALEQA AASKTSEQYS ASSNKKVNAI AYRLLNAAKA
GNQKSFMDTL FRLHMSAEKP ISPIFLNALH ERDLDFATVA NAFIAGLLSS GLQEEQEDVV
S