Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3303 |
Symbol | |
ID | 5540801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 4295005 |
End bp | 4296021 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640895421 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001433372 |
Protein GI | 156743243 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.597515 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.540685 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCTCA TCGTAGACCA ATTCGGCGTA TTTATCTCAA AGCACCAGGG GCGCATTCGC GTCATGAAAG AGAAGGAACG TCTATCGGAA GTGCCGATCA TGCACCTGGA ACAAATCCTC ATTTGCAGCG ACGGTGTCGG TCTCAGCAGC GACGTGGTGC GTGCCTGCGC CGAAGAAGGG ATTCCGATCC ACTTCCTCAA CAGCGCCAAC GGCGGCGACT ATGGCACGTT CGTGCATAGC GGCATCACCG GCATGGCGCT CACCCGCCGT GCTCAATTGC GAGCCGGTGA CGACGAGCGC GGATTGCGTC TGGCGCAGGC GTTCGCCAGC GGCAAAATCC AGAGCCAGGC GAACATGCTG CGCTACGCCG CCAAAAATCG CAAAGAGAAC GACCCGGACC TGCACAACGA CCTGATGCGC ACCGCCACCG AAATCCTCGA CGCCCTCCCG CCGCTGCGCG CCGTGCGCGG CGTCCTTACC GACGAAACCC GCGCTGCGCT GATGGGCTTC GAGGGAATGG CCGGCGCGCG CTACTGGACA GCCGTGGCGC GCATCATTCC CGACGACCTC GGTTGGCCCG GACGCGAAAC GCGCGGTGCG CGCGACCGCT TCAATCAGGC GCTCAACTAC GGCTACGGCG TTCTCCAGTC TCAGGTGCGC ACTGCCCTGA TCCTTGCCGG GTTGGACCCC AATGCCGGGT TCCTCCACGC CGACCGACCG GGCAAGCCGA GCCTGACCCT TGACCTGATC GAAGAGTTTC GCCAGGCAGT CGTCGACCGC ACCCTCATCG GGCTGGTCAA CCGCCAGTTC GAGATTGTCC AGCGCGACGA CGGACTGCTC GACGAGGACA CCCGCAAACG CATCGCCGAG AAAATCCTCG AACGCCTGAA CAGCACTGAG CTCTACGAAG GCAAGCGTCA GCCGCTCCGT CACATTCTCC AATGCCAGGC GCGCCACATC GCCACCTTTG TGCGCGGCGA ACGTCCCACC TATGAACCGT TTGTGATGGG GTGGTGA
|
Protein sequence | MHLIVDQFGV FISKHQGRIR VMKEKERLSE VPIMHLEQIL ICSDGVGLSS DVVRACAEEG IPIHFLNSAN GGDYGTFVHS GITGMALTRR AQLRAGDDER GLRLAQAFAS GKIQSQANML RYAAKNRKEN DPDLHNDLMR TATEILDALP PLRAVRGVLT DETRAALMGF EGMAGARYWT AVARIIPDDL GWPGRETRGA RDRFNQALNY GYGVLQSQVR TALILAGLDP NAGFLHADRP GKPSLTLDLI EEFRQAVVDR TLIGLVNRQF EIVQRDDGLL DEDTRKRIAE KILERLNSTE LYEGKRQPLR HILQCQARHI ATFVRGERPT YEPFVMGW
|
| |