Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2061 |
Symbol | |
ID | 4569494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2392804 |
End bp | 2393847 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639766642 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_912497 |
Protein GI | 119357853 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTGAGG CTTCTCTTTT TCTTCGCACG CTCTATATCC AGGAGCAGGG TTCGATGCTC AGGGTTGAGA ACGGGTGTTT TCGCGTGACG TGCGGGCACG ATGACGATGT TGCCGAACTG CTTGAGGTGC AGTCAATCAA GGTTGGGCAG ATTGTGCTGT TCGGCGCCTG TATGATTACT CCGGCTGCCA TCCGGCATTG CCTCATGAAC CGCATTCCCG TTGTGCTGCT TTCGCAGCAT GGCGAGTATT TCACCCGTCT TGAATCGACC GATGATGTCA ATATCGATCT TGAGCGGTTA CAGTTTCAGC GATCTGCTGA AGAGTCGTTT CCGCTGGAGT GTTCGCGAAC TATCGTGCGG GCGAAGCTGC ATAATTCCGG GGTTTTGCTC AGGCGTCATG CTGAGTCATC GGGTTCAGAG GCGCTCCGGC ACGCTGCTAC GCAACTGCGG CAACTGGAGG AGCATGTTGA TCGTGCCGAT TCGATCGATG CGGTGCGGGG CTACGAAGGG AGCGGTGCGG CGACATATTT CGGCGTGTTC GAGGATTTTT TTGATACCGG GGGGTTCATC TTCAGAGAGC GGGTCAAGCG TCCGCCGACC GATCCGGTCA ATGCGATGCT GAGTTTCGGG TACAGTCTGC TTTTCAACAA CATTTTTTCG ATGGCAAGAT TGCATCGGCT GCACCCTTAC GTCGGGTTTC TGCATGCCGA CAAGCCCGCT CATCCTGCAC TTGTGAGCGA TCTGATCGAG GAGTTCCGCA CGCTTGTTGA CGGTCTCGTG ATCGCGCTTA TCAACAAGCG GCTCATCAGC CCGGAGGAGT TTACCGTTGC GCGGCATGAT GACGGAAAAC CCAAAGGGTG CTACCTCTCG GATGGAGCGC GCAAAACTTT TCTTCGCGAG TTCGAAAACC TCATGCACCG GACAACGACC CACCCGGCAA CGGGCTATGA GGTAACCTGC AGGAGGTGTC TTGACTTGCA GGTGGGGGAG TTTGCCCTTT ATCTCAAAGG GGAGAAACCG TATACGCCAT ATCTGAGGAG GTGA
|
Protein sequence | MSEASLFLRT LYIQEQGSML RVENGCFRVT CGHDDDVAEL LEVQSIKVGQ IVLFGACMIT PAAIRHCLMN RIPVVLLSQH GEYFTRLEST DDVNIDLERL QFQRSAEESF PLECSRTIVR AKLHNSGVLL RRHAESSGSE ALRHAATQLR QLEEHVDRAD SIDAVRGYEG SGAATYFGVF EDFFDTGGFI FRERVKRPPT DPVNAMLSFG YSLLFNNIFS MARLHRLHPY VGFLHADKPA HPALVSDLIE EFRTLVDGLV IALINKRLIS PEEFTVARHD DGKPKGCYLS DGARKTFLRE FENLMHRTTT HPATGYEVTC RRCLDLQVGE FALYLKGEKP YTPYLRR
|
| |