Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1984 |
Symbol | |
ID | 5539462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2542652 |
End bp | 2543671 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640894119 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001432090 |
Protein GI | 156741961 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.424178 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACCC TCTACGTGAC CGAACAGGGC AGCGAGATCG GCTGCGACGG TGAACGGTTG GCAGTGCGGC GCGACAACGC CATCATCGCC AGCATACCGC TGATCAAGAT CGAGGACATC GTCATTATCG GCAACGTCGG ACTGAGCACC CCCGCCATCA AGCGCATGCT CGACAACGGC ATCAACGTCA CGTTTCTGAC GGTACACGGG CGCTATCAGG GGCGATTGGT CGGCAGCGTC AGCGCCCATG CCGCGCTGCG CGCCGCACAA TATCGGCGCG CCGACGACCG CGCCTGGTCG CTGCGCCTGG CGCAGCGGTT CGTCGAAGGC AAACTGCGCA ACTGCCGCGC CCTGCTGCGC CGTTTTGCGC GCAACCGCGC CGACGCGCCA GCGGAAGCCG GGCAAGCCGC CGACGACCTC GACCGCTTCA TTGATCGCGT GCCGCGCACC ACCACCCTCA ACGCACTTAT GGGAGTGGAA GGCAGCGCCA CTGCGCGCTA CTTCGCCGGC GTCCGCGCCC TGATCGGAGC GGAATGGCGC TTCGAGGCGC GCATCCGGCG TCCACCGCCG GATCGTGTGA ACGCCCTGCT CTCGTTTGGC TACACCCTGC TGGTTCACAA AATGCTCGGC GCCGTTGAAG CAGCCGGCTT CGACCCGTAT CTCGGCTACC TCCACCACAT CGACTACGGA CGACCATCAC TGGCGCTCGA CCTGATCGAA GAGTTCCGCC CTATCCTGGT CGACTCGCTA GTAATCCGTT GTTGCAACGA CGGGCGCATC GCGTTCGACG ACTTCACCGA AACGCCGGAT GGCGACTATC CGGTCTTGCT GAGTGATGAC GGCAAACGGC GCTTTGTCGC CGCATTCGAG GAACGCATGC GCACTGAAGC CACCCATCCC GACGGCGCCG ACGGACGACC GGGGAAAGTC AGTTACCTGC GCTGTCTGGC GCTTCAGGCG CGGCGTCTGG CGCGAGCGGT GCAGGGAGGA GCGGACTACG AACCGTTTGC TGTGCGATAA
|
Protein sequence | MDTLYVTEQG SEIGCDGERL AVRRDNAIIA SIPLIKIEDI VIIGNVGLST PAIKRMLDNG INVTFLTVHG RYQGRLVGSV SAHAALRAAQ YRRADDRAWS LRLAQRFVEG KLRNCRALLR RFARNRADAP AEAGQAADDL DRFIDRVPRT TTLNALMGVE GSATARYFAG VRALIGAEWR FEARIRRPPP DRVNALLSFG YTLLVHKMLG AVEAAGFDPY LGYLHHIDYG RPSLALDLIE EFRPILVDSL VIRCCNDGRI AFDDFTETPD GDYPVLLSDD GKRRFVAAFE ERMRTEATHP DGADGRPGKV SYLRCLALQA RRLARAVQGG ADYEPFAVR
|
| |