Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1250 |
Symbol | |
ID | 6374927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1355547 |
End bp | 1358099 |
Gene Length | 2553 bp |
Protein Length | 850 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642683748 |
Product | CRISPR-associated protein, Csm1 family |
Protein accession | YP_001959663 |
Protein GI | 189500193 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02578] CRISPR-associated protein, Csm1 family |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00529102 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCGATA CAGAAAGAGA CTCGTTTTAC CTTGGCGCAC TGCTGCACGA TATCGGCAAG TTTATACAGC GAGCCGAGAT CGATGAGTGG AAAGAGCGTG CTGACGAATA TGTCAAAAGC GGTGATGCAT CGAAAGGTGA TTATACACAC AAGAGGTATT CAGCGGCTTT TATCAAGAAG TTTCGCAATA AAACGGGGAT TTTTCCGCAA GATGCTGCTG CTGAATCTTA TGCCCTCTGG CATCATCGCG GGAAAGATCG CTCAAAATTT GACAACGAAA GCATCAATAA AAAAGGTGTT CCGCTAAAGC TGATTCATAT TGCCGATATC TCGGCTTCCG AAGAACGTCA GACCGATGCA GAAATTGATT CGCAGGATTA CAACCTTGCA AAACTCGAAT CCATATTCTG CAATGTTGCT CTCAGAAACA GTGATGAACA ACTAAACAGC TTTCCCGAAA AGCGATATAT AAACCTTGGA ACTCTGACGT GCAAATATGA ATCAGCGTTC CCTTTGTCGA AAGAGCCTGC ATATCATGAA CAGAATTACA CAACGCTTGT TAATGAATTT CTTGCGGGTT TTGAAAAATC AATAGGAAAA ACCAAAGAAC TTCAACCGTT CCTTGTCAAG TACCTTCATG CTGTTCCAGC CCAGACTCCT TTTAAAAAAA AGGATCATCA AGAACCATAT AGGCCAGACA TAAATCTTTT CGACCATTCA CGGGTGACAG CGGCCATAGC ACTTTGTTTG TATGACGAGT GGATACAAGG GGATTGGAGA GGAAAGGATA AACAGATTCT GGATGACACG GAAAATGGAT ATAAAAGTTC GGGTTTTTCT GCGCCGTGTC TGCTTGTTTC CGGCGACATA TCGGGCATAC AGGATTTCAT CTTCAACGTT CCGTCGAAAG GTGCGGCGAA AACACTGAAA GCGCGTTCGT TCTTTGTCCA GATGCTTGCC GATGTGTGTG TGCAAAAAAT TCAGGATACG CTTGATCTGA AACCAGCTAA TCTGCTCTAC AACGGCGGAG GGCAGTTTTA TTTCCTCGTG CCGAAATGCA GGGCCCAGGA TTTGAATGAT TGCAAGGAGG ACATTGCCCG TGCTTTGATA GATGAGGAGT TGTTTTTATC AATTGCGAGT GTCGAAGTGA AGGTGTTCGA TTTTATGAAC AATTTCGGCA AAAAATGGAA GGGTGTCAAT GATAAGTTGA GCATCGAGAA GCTCAGAAAG TTCAAGGGGC AAAAGCGGAA AGATGTTTTT GGCCCTTTCA AACAGGTGAT ACGTGAAAAT AATGAGAGAG ACCCGTTTTT CGAAATTACG GACAAGAAGT TCAGGAAGAA TGACTATTAC CGCATCTTAG CGGAGAGTTC AGGTGTCAAG GTGCCAGAAG AAGAAGAGAA GAAATCCTGG GTTGATATTC TCGGAAAGCT TGGATACCGT TTTGAGTTTC AAGTAATGGG ATCGACTGAT GCCGTTGCTT TCAATACCAC AGATTTCGAG GAAAAGTGTG CCGGTTTCAG ATTTTCTGTC AAAGATTTGC CGCATTGGAG GACTGTGGCA TCGATAGAGC AGTTCAAACA GGATGTCGAG AACTGTGGCC GTTCAGTTGA AGAGTATTAT GATAAAGATA AAGACGGTAA GAGATGGGGG TTAAAACCCG ATAACATCAT TACCTATTCA CAGCTTGCCT TCAAGGCGTA TAAGGAAACC GGTACCCACA AGCTCGGTAT CCTGAAAATG GATGTTGATA ATCTGGGACA AATTTTCTCC GATGGATTTC CGGAGGAGAT CAGAACTCCT TCACGGATGA TGTCCCTGTC GCGATCCTTG CAGTGGTTTT TCGAAGGCTA CATGAACACC TTGCTTGAAG ACGAGGAGTT CAGGGATTAT TTGTATCCGA TCTTTTCCGG CGGCGACGAT CTGTTCATCG TCGGCGCATG GCACAAAGTG TTTGACATTG CCTTGAGGAT TCAGAAGGAT TTCCGGCAGT TTGTCTGCGA GAATCCTTCG GTAACCCTGT CGGCATCCCT GCTGGTCGTT GACGAGCATT ATCCGGTGTC GCGATTCGCC GTGCTTGCAG AAGAGCGTCT GCACGAAGCC AAATACGGGA GTCTTGACAA GAACTCGGTC AACGTGTTCG GTCAGACGTT GAGCTGGGTG GAGTTCGGGA GAGCCTGCGA GATCAAAGAG AAACTTGTCA GAATGGTTCT GGAGCTGAAG GAGCCGAAAG CGATCATTCA AAAGGTGCTT CAGGGGTGCA AAGGGCTCGA GGTGCTGTGC GATCGTGCCG TCAGACATCG GAATGTGAGT AGCGAGAGGA ACCTTCAGGG CTTGTCGGTA CTGGACCGGG AAAAGCCTGC CGGTGAAAAG GTGTGGCAGA TGGCCTGGTT CCTGAGAGAC ATCGAAAAGG AAGAGTCCCG ACCGATTGCC GAGGAGATTG TCGGAGAGTA CGAACGGGTT GTCTTTGCTG CCATGAAGGG CGAAACAGTG AACCCGATGT ATATTGCTGT TGGAGCCCGC TGGGCTGAAT TTAGCTGTAG AAAATCACTA TAA
|
Protein sequence | MSDTERDSFY LGALLHDIGK FIQRAEIDEW KERADEYVKS GDASKGDYTH KRYSAAFIKK FRNKTGIFPQ DAAAESYALW HHRGKDRSKF DNESINKKGV PLKLIHIADI SASEERQTDA EIDSQDYNLA KLESIFCNVA LRNSDEQLNS FPEKRYINLG TLTCKYESAF PLSKEPAYHE QNYTTLVNEF LAGFEKSIGK TKELQPFLVK YLHAVPAQTP FKKKDHQEPY RPDINLFDHS RVTAAIALCL YDEWIQGDWR GKDKQILDDT ENGYKSSGFS APCLLVSGDI SGIQDFIFNV PSKGAAKTLK ARSFFVQMLA DVCVQKIQDT LDLKPANLLY NGGGQFYFLV PKCRAQDLND CKEDIARALI DEELFLSIAS VEVKVFDFMN NFGKKWKGVN DKLSIEKLRK FKGQKRKDVF GPFKQVIREN NERDPFFEIT DKKFRKNDYY RILAESSGVK VPEEEEKKSW VDILGKLGYR FEFQVMGSTD AVAFNTTDFE EKCAGFRFSV KDLPHWRTVA SIEQFKQDVE NCGRSVEEYY DKDKDGKRWG LKPDNIITYS QLAFKAYKET GTHKLGILKM DVDNLGQIFS DGFPEEIRTP SRMMSLSRSL QWFFEGYMNT LLEDEEFRDY LYPIFSGGDD LFIVGAWHKV FDIALRIQKD FRQFVCENPS VTLSASLLVV DEHYPVSRFA VLAEERLHEA KYGSLDKNSV NVFGQTLSWV EFGRACEIKE KLVRMVLELK EPKAIIQKVL QGCKGLEVLC DRAVRHRNVS SERNLQGLSV LDREKPAGEK VWQMAWFLRD IEKEESRPIA EEIVGEYERV VFAAMKGETV NPMYIAVGAR WAEFSCRKSL
|
| |