Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0833 |
Symbol | |
ID | 3834407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 987840 |
End bp | 988874 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637824921 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_425921 |
Protein GI | 83592169 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0219387 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC TGTTGAACAC CGTTTACGTC ACCACCGAGG GAACGGGGTT GCGCAAGGAC GGTGAAAATC TGGTCGCCGA GCTTGACGGC GTGCAAAAGG GCCGGGTTCC CTTGCATATG GTGGGCTCTG TGGTGGTTTT TGGCGGCACC TATGTTTCGC CCGGGCTAAT GGGGGCTTGC GCCGCCCATG GCATCACCAT CGTGCTGTTG GACCGGGTTG GTCGTTTTCA GGCGCGGGTC GAGGGGCCGG TCGCCGGCAA TGTGCTGCTG CGCCGCGCCC AATACAAAGC CTCCGAGGCG CCCGAGGATA TCGTCAAAAG CCTGATCTTG GGCAAAGTCT CCAATCAGCG GGCGGTTCTG CTGCGGGCGC TGCGCGATCA CGGTGCCGAT TTTCCGGCGG CCGAAGCCTT GGCGGTGAAA GACGCCATCG ATCGCATGGC CCATATCCTG CGCAAGGTCG GCGCCTCGGC CGAGGATGCC GACCACCTGC GCGGCGCCGA AGGCGAGGCG GCCAGCCTTT ATTTCGGGGT TTTTGGCCAG TTGATCCGCT CTCCCGATGG CGATTTCGCC TTTCGTGGTC GCTCGCGCCG TCCGCCGCTC GATCCGACGA ATGCCCTGCT GTCGTTTCTA TATACCCTGT TGACCCACGA TTGCCGCAGT GCTTGCGAAA GTGTCGGACT TGATCCCGCC GTGGGATTTC TGCACCGCGA TCGCCCGGGC CGCCCGAGCC TCGCCCTTGA TCTGATGGAA GAATTGCGGC CGGTTCTGGT TGATCGCTTG GCGTTGTCCT TGATCAATCG CCGTCAGCTT CGGGCGACGG ATTTCCAGCG CCTGGACGGC GGCGCCGTTC TTTTGACCGA CGAGGCGCGC AAGACGGTGC TAAGCGCTTG GCAGGAGCGC AAGAAACAGG AACGGCGCCA TCCCTTTCTT GAGGAAAGCG CCCCGCTTGG CCTTGTGCCC TATCTTCAGG CCCAGATGCT TGCCCGCCAC CTGCGCGGGG ATCTTGACGC CTATCCGCCG TGGTTCTGGA AGTAG
|
Protein sequence | MKKLLNTVYV TTEGTGLRKD GENLVAELDG VQKGRVPLHM VGSVVVFGGT YVSPGLMGAC AAHGITIVLL DRVGRFQARV EGPVAGNVLL RRAQYKASEA PEDIVKSLIL GKVSNQRAVL LRALRDHGAD FPAAEALAVK DAIDRMAHIL RKVGASAEDA DHLRGAEGEA ASLYFGVFGQ LIRSPDGDFA FRGRSRRPPL DPTNALLSFL YTLLTHDCRS ACESVGLDPA VGFLHRDRPG RPSLALDLME ELRPVLVDRL ALSLINRRQL RATDFQRLDG GAVLLTDEAR KTVLSAWQER KKQERRHPFL EESAPLGLVP YLQAQMLARH LRGDLDAYPP WFWK
|
| |