Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2676 |
Symbol | |
ID | 5540158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3458387 |
End bp | 3459385 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640894798 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001432765 |
Protein GI | 156742636 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACCC TTTACATCCA GGAACAGGGC GTGATGGTGC GCAAACGCGA TAATCAGGTG CTGGTCACCA GAGACGGTCA GACGCTCCAC GATGTTCCGC TGGCGAAGAT TGACCAGGTG GTACTGATGG GGCGTGGTGT GCAGATCTCG ACAGCGCTGC TCATCGACCT GCTCGAACGC GGCATTCCGG TCACGCTCAC CAATCAGCAC GGTAGTCGCC ACTACGCCAC GCTCACGGCA GGACCGTCGC GGTTCGGCGA TCTGCGCACC GGGCAGATGC AGTACGTCAA CACTCCTTCA CGCGCGCTGG AACTGGCGCG CGCGATTGTG ATCGTCAAGC TGACGAATCA GCGCCGCCTC CTGGCGACGA CCGGCTGGCC CGCTGCGGCA TCCGCCATGC AGCAGATCGA CGCAGCGCTG ACAGCGGCGT CTCAGGCGCA GAATGTGGAC ATATTGCGCG GGCATGAAGG CGCTGCCGCC GCCGCCTACT TCGGCGCCTG GCGCGCATCG CTGCCGCCTG CCTGGGGATT CGGCGGGCGC GCCTTCTACC CGCCGCCCGA CCCGATCAAT GCCATGCTGT CGTTCGGCTA CACCCTGGCG CTCCATGATG TCATCACCGC CGTGCAGATC ACGGGTCTCG ACCCTTACCT GGGCACATTC CACGTCATCG AAACCGGTCG CCCATCACTG GCGCTCGATC TGCTGGAAGA GTTCCGCCCG GTGATCGTCG ACCGCATGGT GCTCGACATC GTGCGCACCA ATGCCATCGG ACGCGAGCGC TTTCACCGTC CGCAGGAACG ACCCGAAGCG GTCTACCTCG ATGCTGAAGG GCGCGCCTTC CTTGTGCAGC GGTACGAAAC GCTTCTCCAG ACGAAGGTGC GGTTGCCCGG CGGCGAGCAG ACGCCGATGC GCCGGGTCAT CCTGCTGCAG GCGCAGGCGA TCGCGCGCGT GCTGCGCGGC GAACAGGAGC GATATACCGG ATTCAGTCTC AATTCTTGA
|
Protein sequence | MPTLYIQEQG VMVRKRDNQV LVTRDGQTLH DVPLAKIDQV VLMGRGVQIS TALLIDLLER GIPVTLTNQH GSRHYATLTA GPSRFGDLRT GQMQYVNTPS RALELARAIV IVKLTNQRRL LATTGWPAAA SAMQQIDAAL TAASQAQNVD ILRGHEGAAA AAYFGAWRAS LPPAWGFGGR AFYPPPDPIN AMLSFGYTLA LHDVITAVQI TGLDPYLGTF HVIETGRPSL ALDLLEEFRP VIVDRMVLDI VRTNAIGRER FHRPQERPEA VYLDAEGRAF LVQRYETLLQ TKVRLPGGEQ TPMRRVILLQ AQAIARVLRG EQERYTGFSL NS
|
| |