Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0179 |
Symbol | |
ID | 3833870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 211593 |
End bp | 213335 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637824257 |
Product | CRISPR-associated Cas1/Cas4 family protein |
Protein accession | YP_425271 |
Protein GI | 83591519 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1468] RecB family exonuclease [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR00372] CRISPR-associated protein Cas4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCCCT CCGACACACC GCCTTCCGCG GAAGATCTCC CCTCCCAGGG CGAATTGGCC CTGTTCGCCC CACCGGCGAC CGCCGAGGAT GCCCTGGTGC CCGCCAGCAT GGTCAACGCC TGGATCTATT GTCCCCGGCT GGCTGTTTTG GAATGGGGGC GTGGCGAAAA GGCCCGCAGT GTGGATCTCA TCGCCGGCCT GCGCGCCCAT CAAGCCACGG AAAGCGGCCC GACTCCCGCC CTCCCCGATC CCATGGTCCT GCGAGAAGAC CAAAGCCTGA AAACCCGCAG ATTATCGCTG TCTTCGGAGC GCCTTGGGCT GACGGCCGAG CTTGATCTTC TCGATGTCGA AGAGGGGATG GTGATCCCGG TCGAGATCAA GGTCGGCAAA CGCCCCTCCG TCGACGAGGG GGCTTATCTG CCCGAACGCG CCCAGGTTTG CGCCCAGGCC CTTTTGCTCC GCGAGGCGGG CTACACCTGC CTTGAAGGAG CGCTGTGGTT CGCCGAAAGC CGCGAGCGTG TAACGGTGGA TCTGACCGAG GCGTTGGTCA CCGCCACCCT GGTCGCCACA TCGGATCTGC GCTTGACCGT GGCCAGCGGC CGGCTGCCGC CGCCTTTGGA TCATTCGGCC AAATGCCCCC GCTGTTCGCT TTTGCCGATC TGCCTGCCCG ATGAAATCGC TTGGTTCCGC AAGGGATCGA TCGCGCGAAC CCCTCCGCCC CCGGCTTCTC CGGCCTTGCC ACTTTACGGA CAGACCCCCG GGGCCCGCAT CGGCAAGAAA GACTATACGC TTGTCATCCA GGTCGAGGGC GAGGCCGACC GCAGCTTGGC GCTCGACGAG ATATCGGAGG TGGTTCTGGC CGGCCCGGTG TCTCTGACCA CACCGGCCAT TCATGAATTG CTGCGTCGCG AGATCCCCGT GGCCTGGATG TCCTCGGGCT TTTGGTTCCT GGGATCCACC GGCGGCCAGG GCCCCAGAAG CGCCGCGGTT CGTACGGCCC AATACGCCCT TGCCGGTGAC GAAAGGCGAC GCCAAGCCTT CGCCCGCGAT CTGGTTTCGG CGAAAATCCG CAATGGCCGG ACCTTGTTGC GCCGCAACTG GCGGGGAGCG GAAGCCGAGC GCCAGATCGC GTTGGACCGC TTGGCCCGCC TTGCCGAGCG GGCGACCACG GCCGAAACCA CCGCCTGTTT ATTGGGGATC GAGGGGGAAG CCGCTGCGGT CTATTTCCGC GCCTTCCCAC AGCTTTTCAC CCAGGCCGTA ACCACCCTTC CGGCCTTTGC CTTCGAGCGC CGCAACCGTC GCCCGCCGGC CGATCCGGTA AACGCCTGCC TGTCGCTGTG CTACGCCGTG CTCACCCGCA CCTTATCGTC GGCGTTGAGC ATCGCCGGCC TTGATCCGTG GAAGGGCTTC TATCACACCG AGCGCCCCGG TCGCCCGGCC CTCGCCCTTG ATCTGATCGA ATCCTTTCGC CCCGTTCTGG CCGATTCCAC CGTCTTGATG GTTTTAAACA ATGGCGAGAT CGGCACCAAT GATTTCCTGT ACGCTGGCGG CGGCTGCGCT TTGAAGCCAA ACGCCCGACG CGGCCTGATC GCGGCCTATG AACGCCGATT GGACCAGGAA ACCACCCACC CGGTCTTTGG GTATCAGCTT TCCATGCGGC GCCTGATTCA GGTGCAGGCC CGCTTGCTCG CCCGCTTCGT CTCTGGCGAT ATTCCCCGCT ACCCTCATTA TTGTCCTCGT TAG
|
Protein sequence | MAPSDTPPSA EDLPSQGELA LFAPPATAED ALVPASMVNA WIYCPRLAVL EWGRGEKARS VDLIAGLRAH QATESGPTPA LPDPMVLRED QSLKTRRLSL SSERLGLTAE LDLLDVEEGM VIPVEIKVGK RPSVDEGAYL PERAQVCAQA LLLREAGYTC LEGALWFAES RERVTVDLTE ALVTATLVAT SDLRLTVASG RLPPPLDHSA KCPRCSLLPI CLPDEIAWFR KGSIARTPPP PASPALPLYG QTPGARIGKK DYTLVIQVEG EADRSLALDE ISEVVLAGPV SLTTPAIHEL LRREIPVAWM SSGFWFLGST GGQGPRSAAV RTAQYALAGD ERRRQAFARD LVSAKIRNGR TLLRRNWRGA EAERQIALDR LARLAERATT AETTACLLGI EGEAAAVYFR AFPQLFTQAV TTLPAFAFER RNRRPPADPV NACLSLCYAV LTRTLSSALS IAGLDPWKGF YHTERPGRPA LALDLIESFR PVLADSTVLM VLNNGEIGTN DFLYAGGGCA LKPNARRGLI AAYERRLDQE TTHPVFGYQL SMRRLIQVQA RLLARFVSGD IPRYPHYCPR
|
| |