Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0169 |
Symbol | |
ID | 3834147 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 200286 |
End bp | 201227 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637824247 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_425261 |
Protein GI | 83591509 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGATC CCGCCTTCGT CCCCTTGCGG CCCATCGCCA TCAAGGACCG CTCGTCGATC GTTTTTCTTC AGCGCGGCCA ACTCGACGTA GTGGACGGCG CCTTCGTTCT GATCGATCAA GAGGGGGTGC GCGTGCAGAT CCCCGTGGGC GGGCTGGCCT GCCTGATGCT GGAGCCGGGA ACGCGCATCA CCCATGCCGC CATCGTTCTC TGCGCGCGGG TGGGATGTCT GGTGATCTGG GTCGGCGAAC GCGGGACCCG TCTTTACGCC GCCGGGCAGC CCGGCGGCGC CAGGGCTGAC AGATTATTGT TCCAGGCGCG CAACGCCCTT GATGAAACCG CCCGTCTGAA TGTCGTTCGC GAAATGTATC GGCGCCGCTT TGACGACGAC CCGCCCGCCC GCCGGTCGGT GGACCAATTG CGCGGCATGG AGGGCGTGCG GGTGCGCGAG ATCTATCGCC TGCTCGCCAA AAAATACGCT GTGGACTGGA ACGCCCGGCG CTACGATCAC AACGATTGGG ATGGCGCCGA TATCCCCAAC CGCTGTCTGT CGGCGGCCAC CGCCTGTCTT TACGGGTTGT GCGAAGCGGC CATTCTGGCG GCGGGCTATG CCCCGGCCAT CGGTTTTCTC CATCGCGGCA AGCCGCAAAG CTTCGTTTAC GACGTCGCCG ACCTCTATAA GGTCGAAACC GTCGTTCCCA CCGCCTTTTC GATCGCCGCG AAGATCGCCG CCGGCAAGGG CGACGACAGC CCGCCCGAGC GTCAGGTCCG TATCGCCTGC CGCGACCAGT TCCGCAAATC CGGTCTGTTG GAAAAGATCA TTCCCGACAT CGAGGAGATC CTGCGCGCCG GGGGCCTGGA ACCGCCCCTT GACGCCCCCG AGGCCGTCGA TCCGGTTATC CCGCCAGAGG AGCCTTCGGG TGATGATGGT CATCGTGGTT GA
|
Protein sequence | MADPAFVPLR PIAIKDRSSI VFLQRGQLDV VDGAFVLIDQ EGVRVQIPVG GLACLMLEPG TRITHAAIVL CARVGCLVIW VGERGTRLYA AGQPGGARAD RLLFQARNAL DETARLNVVR EMYRRRFDDD PPARRSVDQL RGMEGVRVRE IYRLLAKKYA VDWNARRYDH NDWDGADIPN RCLSAATACL YGLCEAAILA AGYAPAIGFL HRGKPQSFVY DVADLYKVET VVPTAFSIAA KIAAGKGDDS PPERQVRIAC RDQFRKSGLL EKIIPDIEEI LRAGGLEPPL DAPEAVDPVI PPEEPSGDDG HRG
|
| |