Gene GWCH70_0305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0305 
Symbol 
ID7977424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp350424 
End bp352145 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content38% 
IMG OID644797298 
ProductCRISPR-associated protein, TM1802 family 
Protein accessionYP_002948498 
Protein GI239825874 
COG category 
COG ID 
TIGRFAM ID[TIGR02556] CRISPR-associated protein, TM1802 family
[TIGR02591] CRISPR-associated protein, Csh1 family 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGATAG AGGCAATAGC ACAAATAGGA CGTATAGAGT TGGGAGGAAA GGATCAAGCA 
GCTATCGATC AGTTCATTGA AAATCCAGGC TATCCTAATT GCATACATAT TCTCCTTCAA
AGAGGGGACA ACGGTGCATT TGTTTGGGAA GACTGCGAAG TAGAGGAGCA AAAATCTGAT
TTTCGTAAAT ATTTATTTCG CAGCGGCTCT TCACGTGGGG TGAATTACTC GCCTACAGCC
AAAATCACAA CAGTAGAGAA TACGTACCAG CAAAAAGTGT TAGGATGGTT TAAGAAGATT
TGCAAAATCA ATGGGAGTGA CTTTTTCCGA GGTATTCAAG AAACATTAGA AAAAGAAAAA
GATCGCATTT TACAAAAATT GCAAAGTAAG TTAGCGCTTT CAAACGAAAG GACTATTCTT
TCTTTAAAAG TGGATGGGTC CTATCTCTAT GATATTCCTG AGTTTCGAGA TGCTTTTATG
TTTTTGATTA ATGAAAAAGA TTTGGAATTA TCCGCCTCTA ATCAAGTTTG TTCCATTTGT
GGACAGCGAA AAGATACTGT TATTGGCAAA ATGTCGGTGT TCCGTTTTTA TACGCTTGAT
AAGCCGGGAT TTATCACCGG AGGATTGGAT GAAGAAAAAG CGTGGAGAAA TTATCCGGTT
TGTCTCTCTT GTAAGTCATA TATTGAAGAA GGAAGAAGAT TTATTGAGGA GAATCTCCGC
TTCCAATTTT ATGGATTTTC CTATTTGCTT ATTCCTAAGC TGATTGTTCC TGTTGCAGAG
GATAATGATA GTTTTGCTGA GGTGTTGGAA TATTTATCTT CCCATAAAAA AGAAGTGCGG
ATTCATCAGG AGCAAGTTGA ATCGTTTATG ACAACGGAAG AAGATGTCAT GGACATCTTG
AAGGATATGA AAAATGTCGT TTCTATCCAG CTATTATTTT TGCAAAAAAT CCAAAGTGCG
GAGCGAATTT TGCTTTTGAT CGAGGATGTG TTGCCTTCAA GAATCAGTGA GCTGTTTAAA
GCCAAGTCCT ATGTCGAACA ACTTCTATAT GCGGAAGGAA ATCATCCATT TCATTTTGGA
TTTATCCGGA CTTTCTTCCA TAACTCGTAT GACAAATATT TTTTGCACAT CGTCGAGTCG
GTATTCAAAA AAAGGCCGCT TTCTATTTCT TTCTTAACAA AATTCATTAT GGAACAGTTG
CGTAAAGAAT TACCAGCCTA TGAATCAGGA GAGAATTCCT TCTTTTATAA AACGAGACAG
GCTGTTGCAG TGATTTTATT TTTAGAGCAG GCAGGTGTAT TGCCAGTCAA AGGAGGAGCT
GCCATGAGCG AAAAATTTGA TGTGTTGTTT GAAAAGTACG GGAAGCAGCT AGATACTCCG
GAAAAGAAAG CTGTTTTTCT GCTAGGTTCA TTGACGCAAA TGCTTCTGGA CATTCAACAA
AACAAGCGGG GCAGCAAGCC TTTTCTAAAG CAATTAATGG GGCTGAAAAT GGATGAGCGG
ACCGTAAAAG GGCTGCTTCC GAAAGTAATC AATAAGTTGG AAGAGTATGA ATCCTATCAT
TTTATCCATA AGCAATTAGC TGAAGAAATT TCTACTTTAT TTTTGCTTTC ATCCCCGCGG
TGGAAAATGT CAGTAGATGA ATTGAATTTT TATTTTGCTT GCGGCATGAA TCTTGTGTCC
AAAGTGAAAG AGTCATTAAT AATAGCGAAG GAGGAAGCAT AA
 
Protein sequence
MLIEAIAQIG RIELGGKDQA AIDQFIENPG YPNCIHILLQ RGDNGAFVWE DCEVEEQKSD 
FRKYLFRSGS SRGVNYSPTA KITTVENTYQ QKVLGWFKKI CKINGSDFFR GIQETLEKEK
DRILQKLQSK LALSNERTIL SLKVDGSYLY DIPEFRDAFM FLINEKDLEL SASNQVCSIC
GQRKDTVIGK MSVFRFYTLD KPGFITGGLD EEKAWRNYPV CLSCKSYIEE GRRFIEENLR
FQFYGFSYLL IPKLIVPVAE DNDSFAEVLE YLSSHKKEVR IHQEQVESFM TTEEDVMDIL
KDMKNVVSIQ LLFLQKIQSA ERILLLIEDV LPSRISELFK AKSYVEQLLY AEGNHPFHFG
FIRTFFHNSY DKYFLHIVES VFKKRPLSIS FLTKFIMEQL RKELPAYESG ENSFFYKTRQ
AVAVILFLEQ AGVLPVKGGA AMSEKFDVLF EKYGKQLDTP EKKAVFLLGS LTQMLLDIQQ
NKRGSKPFLK QLMGLKMDER TVKGLLPKVI NKLEEYESYH FIHKQLAEEI STLFLLSSPR
WKMSVDELNF YFACGMNLVS KVKESLIIAK EEA