Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0305 |
Symbol | |
ID | 7977424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 350424 |
End bp | 352145 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644797298 |
Product | CRISPR-associated protein, TM1802 family |
Protein accession | YP_002948498 |
Protein GI | 239825874 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02556] CRISPR-associated protein, TM1802 family [TIGR02591] CRISPR-associated protein, Csh1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGATAG AGGCAATAGC ACAAATAGGA CGTATAGAGT TGGGAGGAAA GGATCAAGCA GCTATCGATC AGTTCATTGA AAATCCAGGC TATCCTAATT GCATACATAT TCTCCTTCAA AGAGGGGACA ACGGTGCATT TGTTTGGGAA GACTGCGAAG TAGAGGAGCA AAAATCTGAT TTTCGTAAAT ATTTATTTCG CAGCGGCTCT TCACGTGGGG TGAATTACTC GCCTACAGCC AAAATCACAA CAGTAGAGAA TACGTACCAG CAAAAAGTGT TAGGATGGTT TAAGAAGATT TGCAAAATCA ATGGGAGTGA CTTTTTCCGA GGTATTCAAG AAACATTAGA AAAAGAAAAA GATCGCATTT TACAAAAATT GCAAAGTAAG TTAGCGCTTT CAAACGAAAG GACTATTCTT TCTTTAAAAG TGGATGGGTC CTATCTCTAT GATATTCCTG AGTTTCGAGA TGCTTTTATG TTTTTGATTA ATGAAAAAGA TTTGGAATTA TCCGCCTCTA ATCAAGTTTG TTCCATTTGT GGACAGCGAA AAGATACTGT TATTGGCAAA ATGTCGGTGT TCCGTTTTTA TACGCTTGAT AAGCCGGGAT TTATCACCGG AGGATTGGAT GAAGAAAAAG CGTGGAGAAA TTATCCGGTT TGTCTCTCTT GTAAGTCATA TATTGAAGAA GGAAGAAGAT TTATTGAGGA GAATCTCCGC TTCCAATTTT ATGGATTTTC CTATTTGCTT ATTCCTAAGC TGATTGTTCC TGTTGCAGAG GATAATGATA GTTTTGCTGA GGTGTTGGAA TATTTATCTT CCCATAAAAA AGAAGTGCGG ATTCATCAGG AGCAAGTTGA ATCGTTTATG ACAACGGAAG AAGATGTCAT GGACATCTTG AAGGATATGA AAAATGTCGT TTCTATCCAG CTATTATTTT TGCAAAAAAT CCAAAGTGCG GAGCGAATTT TGCTTTTGAT CGAGGATGTG TTGCCTTCAA GAATCAGTGA GCTGTTTAAA GCCAAGTCCT ATGTCGAACA ACTTCTATAT GCGGAAGGAA ATCATCCATT TCATTTTGGA TTTATCCGGA CTTTCTTCCA TAACTCGTAT GACAAATATT TTTTGCACAT CGTCGAGTCG GTATTCAAAA AAAGGCCGCT TTCTATTTCT TTCTTAACAA AATTCATTAT GGAACAGTTG CGTAAAGAAT TACCAGCCTA TGAATCAGGA GAGAATTCCT TCTTTTATAA AACGAGACAG GCTGTTGCAG TGATTTTATT TTTAGAGCAG GCAGGTGTAT TGCCAGTCAA AGGAGGAGCT GCCATGAGCG AAAAATTTGA TGTGTTGTTT GAAAAGTACG GGAAGCAGCT AGATACTCCG GAAAAGAAAG CTGTTTTTCT GCTAGGTTCA TTGACGCAAA TGCTTCTGGA CATTCAACAA AACAAGCGGG GCAGCAAGCC TTTTCTAAAG CAATTAATGG GGCTGAAAAT GGATGAGCGG ACCGTAAAAG GGCTGCTTCC GAAAGTAATC AATAAGTTGG AAGAGTATGA ATCCTATCAT TTTATCCATA AGCAATTAGC TGAAGAAATT TCTACTTTAT TTTTGCTTTC ATCCCCGCGG TGGAAAATGT CAGTAGATGA ATTGAATTTT TATTTTGCTT GCGGCATGAA TCTTGTGTCC AAAGTGAAAG AGTCATTAAT AATAGCGAAG GAGGAAGCAT AA
|
Protein sequence | MLIEAIAQIG RIELGGKDQA AIDQFIENPG YPNCIHILLQ RGDNGAFVWE DCEVEEQKSD FRKYLFRSGS SRGVNYSPTA KITTVENTYQ QKVLGWFKKI CKINGSDFFR GIQETLEKEK DRILQKLQSK LALSNERTIL SLKVDGSYLY DIPEFRDAFM FLINEKDLEL SASNQVCSIC GQRKDTVIGK MSVFRFYTLD KPGFITGGLD EEKAWRNYPV CLSCKSYIEE GRRFIEENLR FQFYGFSYLL IPKLIVPVAE DNDSFAEVLE YLSSHKKEVR IHQEQVESFM TTEEDVMDIL KDMKNVVSIQ LLFLQKIQSA ERILLLIEDV LPSRISELFK AKSYVEQLLY AEGNHPFHFG FIRTFFHNSY DKYFLHIVES VFKKRPLSIS FLTKFIMEQL RKELPAYESG ENSFFYKTRQ AVAVILFLEQ AGVLPVKGGA AMSEKFDVLF EKYGKQLDTP EKKAVFLLGS LTQMLLDIQQ NKRGSKPFLK QLMGLKMDER TVKGLLPKVI NKLEEYESYH FIHKQLAEEI STLFLLSSPR WKMSVDELNF YFACGMNLVS KVKESLIIAK EEA
|
| |