Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2050 |
Symbol | |
ID | 4568734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2371523 |
End bp | 2373811 |
Gene Length | 2289 bp |
Protein Length | 762 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 639766631 |
Product | CRISPR-associated Csm1 family protein |
Protein accession | YP_912486 |
Protein GI | 119357842 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02578] CRISPR-associated protein, Csm1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACGA GTATTACGGG AAATGATATC ATCAAACTTG GTTTTGATCT TCTTCGAGAA GGCACTCTAA GTATTGTGGA TATCGTTGAT GCATTTAAAC TTGCAGGCAT TCAGTCAACT GAACTCAAAC TCTCTTCGCT CAAATCTCCT TTTGGGGGTG ACAGATTTTT TGAACCGAAA CAGCTTTCTG CAACAGAAAT TAACTATCCG GTTTATGACC GAGTTACCGA TTTTTCAGAA ATAATACCTC AAGATGACGA TCAAGGCTTA ACTGCGTTAG AACGTTTGGG ATCGTTTGTG GCAACAGATG CCGATGAAAA ATATTCCGTT TTTGACCTCT TTAAGACTGT TGCCGCCATT CAGGATTGTT TACGTAATCC TGAAGAGGAT AAACCGTTTT TGCTGGTAAG TGCAGATTTT TCCGGTATTC AGGATACGGT TTATACCATT TCCTCAAAAG GTGCTCTCAA GACGCTTCGT GCCCGCTCTT TCATGCTTGA GTTGCTCACC GAGCATATCA TCTACGAAAT TTTATCAGAA GTTGAATCTG AACGATATGC TGTTGTGTTC TCTGGTGGAG GCGGATTTGG CTTGCTTTTG CCAAACAGTG AAGAGTCCAT CCGAACGATC AATGATTACC GTACCCGGAT CAACGAATGG GCATTTGATG AGTTTTCAGG TCGCTTTTTC ATCGCAATGG ACGCATTGCC TTTCGGGAGG GAGGAACTTC TCTCCGGAAT GAGTTTTCGA AATATCCGAC AAACCCAGGC CGACAATCTC GATCGTCTGA AGAGGCGCAA GTTTATCGAT CAGTTTGAGC AACTGTTTAC ACCATCCATG CCAAAACAGT TGACGGTAAA TACCGAATGC CAGATTACAC GAAGGGACGA TATGCCGGAC GAGTTTATGT TTGATCTGGA AACCGAAACC CGTATGAGTT TGGTGCCGGC TAAAGAACGT GAAGATGATA AATGGATATG GGTTTCTGAA AGCTGTTATC ACCAGTTCTG GATTGGTGAC AAGCTTGCAA CCGCACAATC TGTTGTTCGA TCTCAAAAGA CACCGGAAGG AAAGGAACCA TATTTCAAAT ATCCGGATGC AACGTGGAGC AAAGAGAACA AGAGTTGGGT CTATTATCAA ATAGATGATC AACAAACAAA ATCAGCAGAT TGCTGGTTCA TGAATGACTG GACAGCCGGA CAGCCGATTT TGTATGCAAA TTATGTGAGA AAGCATGGCG AGTTGTCAGA ATATGCTAAA AAACTGGAAA ACGAAAGCCT GAAAGAGGAG AATCCTACAG CCAAGCCCAA TCATACAGCA ACATTTCAGG GACTTGCGGC AAGTTCTTGT GGAGCTGATC TTATAGGCGC ATTAAGGATG GATGTTGATG ACATGGGTAA CCTGTTCAGT AGCATAGGGT CACTCACCGA ACTTTCTGCC AAGTCGCGGA TGCTTAATCT CTTCTTTAAA GTTTACCTCA ATCAAATATG CGCTGCAAAT TTAGGTGGCG GATTTTCTTA CACGGACATT GCTAAGAAAA ATTACTCGGT TAAAAACAGC CATGGTGATA TAGGGAGGAA TGTTTCGGTG ATCTATGCTG GAGGCGATGA CCTATTTATT CTCGGTGCTT GGGACGAAAC AACAGAACTC GCCTTTGACA TTCAACGCTG CTTTTCTCTT TTCACAGGAG AAACATTGAA CAAAGAGAAA AAGACAGTTG TTGAAGGACT TGGAATTTCG GGAGGACTTA CCTTGCACCA GCCAAAATTT CCTCTGTACC AGATGGCCCA AAAATCAGGT GAAGCAGAAC ACGGGGCAAA AAATGATATA GAAATCCAGA ATGCTGAGAT AGAAAAAAAC AGAATTTCTC TTTTCTTTGA TGATTCAAAA CGTCAATGCA GATTAAAAAT CCAAGAGCCT TATCGTTATA TGCTTTCAAT GAAATGGGAT TTGAGCAGTG CTTTTCTACT TCCCTTGATG AAAACATATC GCGAATGCGG AAATGTTGCA TTCCAAGATG GACGAATGGT CTTGGAAATT GAAAAATTCA GCTATCAGAC CATTGAAAAG TGGTTTGCTG TCATTGAAAA ATATCAGGAA AGCTCCATGC TCTATCTGCC AACGATGGCA AGAGTGATGA AGCAGGTTGA AGAGAATCCA AGGATGGATG CTTCGTTGTT CAAAACATTG CTTGGCTTCC TTTATACAAA TGATGAAAGT AAGAAAAACT GGATTTCACA TTTACATGTA GCTCTGAATT GGCTCACATA CCTAAGGAGG AAAAATTAA
|
Protein sequence | MTTSITGNDI IKLGFDLLRE GTLSIVDIVD AFKLAGIQST ELKLSSLKSP FGGDRFFEPK QLSATEINYP VYDRVTDFSE IIPQDDDQGL TALERLGSFV ATDADEKYSV FDLFKTVAAI QDCLRNPEED KPFLLVSADF SGIQDTVYTI SSKGALKTLR ARSFMLELLT EHIIYEILSE VESERYAVVF SGGGGFGLLL PNSEESIRTI NDYRTRINEW AFDEFSGRFF IAMDALPFGR EELLSGMSFR NIRQTQADNL DRLKRRKFID QFEQLFTPSM PKQLTVNTEC QITRRDDMPD EFMFDLETET RMSLVPAKER EDDKWIWVSE SCYHQFWIGD KLATAQSVVR SQKTPEGKEP YFKYPDATWS KENKSWVYYQ IDDQQTKSAD CWFMNDWTAG QPILYANYVR KHGELSEYAK KLENESLKEE NPTAKPNHTA TFQGLAASSC GADLIGALRM DVDDMGNLFS SIGSLTELSA KSRMLNLFFK VYLNQICAAN LGGGFSYTDI AKKNYSVKNS HGDIGRNVSV IYAGGDDLFI LGAWDETTEL AFDIQRCFSL FTGETLNKEK KTVVEGLGIS GGLTLHQPKF PLYQMAQKSG EAEHGAKNDI EIQNAEIEKN RISLFFDDSK RQCRLKIQEP YRYMLSMKWD LSSAFLLPLM KTYRECGNVA FQDGRMVLEI EKFSYQTIEK WFAVIEKYQE SSMLYLPTMA RVMKQVEENP RMDASLFKTL LGFLYTNDES KKNWISHLHV ALNWLTYLRR KN
|
| |