Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0347 |
Symbol | |
ID | 3834634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 419071 |
End bp | 419952 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637824430 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_425439 |
Protein GI | 83591687 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAGCG GAAGGCTGGG ACTTGAAAAA GCGCGCATTC CCCATGCCGA CCGCCATGGG CTGGTCTGGC TTGATCGCGG CCGTCTGGAG GTCGAAGACG GCTGTCTGCG CTTTGTGACC GCCGGTGGCG GTGAACTGGC GGCGGGGGAT TATCAGATCC CCCATCAGGC GGTATCGATC ATCCTGCTGG GGCCGGGGTC AAGCGTTACC CATGATGCGC TGCGGCTGCT GGCCCGCCAT GGCTGCGCCT TGGCCGCCAT TGGCCAGGGG GCCGTACGGT TTTATACCGC GCCGCCCCTG ATGCCCGACA GTTCCGCCCT GGCGCGGGCG CAGGTAACGC TTTGGGCCGA TCCGAAGACC CGCATGGAGG TGGCGCGCGC CATGTATGCC ATGCGCTTTG GCGAGATCGT GCGAACGCGC GACATCGAGG TTCTGCGCGG GCAGGAAGGC GCCCGCATCA AGCGAAGCTA CCAGCTGGCG GCGGAACGCT ATGGCATACC CTGGCGGGGG CGAAGCTATG ATCGGGCCGA TCCCGAGGCC GGGGATGAGG CCAATCAGGC GATCAACCAC GCCGCAACGG CCATGACCGC CGCCGCCTCG GTCGCCGTGG CGGCGGTGGG GGCGATCCCC CAACTGGGGT TCGTCCACGA GGATTCCGGC CAGTCCTTCG TTCTCGATAT CGCCGATCTT TATCGCCACG ATATCACCCT GGATATCGCC TTTGGCGCCG TTAAGGAGGC GGAAAAATCC GGTGATCCCC TTGAGCGACT CACCCGCCAG CGGGCGGCCA AGCTGTTTCG CCAACGCGGC GTTATTCCCT CGATGATCGA CAGGATCAAA ACGCTTCTGG GTCTTGGGGC CGAGACCGAG GAGATCGCTT GA
|
Protein sequence | MLSGRLGLEK ARIPHADRHG LVWLDRGRLE VEDGCLRFVT AGGGELAAGD YQIPHQAVSI ILLGPGSSVT HDALRLLARH GCALAAIGQG AVRFYTAPPL MPDSSALARA QVTLWADPKT RMEVARAMYA MRFGEIVRTR DIEVLRGQEG ARIKRSYQLA AERYGIPWRG RSYDRADPEA GDEANQAINH AATAMTAAAS VAVAAVGAIP QLGFVHEDSG QSFVLDIADL YRHDITLDIA FGAVKEAEKS GDPLERLTRQ RAAKLFRQRG VIPSMIDRIK TLLGLGAETE EIA
|
| |