Gene Rru_A0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A0833 
Symbol 
ID3834407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp987840 
End bp988874 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content63% 
IMG OID637824921 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_425921 
Protein GI83592169 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0219387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC TGTTGAACAC CGTTTACGTC ACCACCGAGG GAACGGGGTT GCGCAAGGAC 
GGTGAAAATC TGGTCGCCGA GCTTGACGGC GTGCAAAAGG GCCGGGTTCC CTTGCATATG
GTGGGCTCTG TGGTGGTTTT TGGCGGCACC TATGTTTCGC CCGGGCTAAT GGGGGCTTGC
GCCGCCCATG GCATCACCAT CGTGCTGTTG GACCGGGTTG GTCGTTTTCA GGCGCGGGTC
GAGGGGCCGG TCGCCGGCAA TGTGCTGCTG CGCCGCGCCC AATACAAAGC CTCCGAGGCG
CCCGAGGATA TCGTCAAAAG CCTGATCTTG GGCAAAGTCT CCAATCAGCG GGCGGTTCTG
CTGCGGGCGC TGCGCGATCA CGGTGCCGAT TTTCCGGCGG CCGAAGCCTT GGCGGTGAAA
GACGCCATCG ATCGCATGGC CCATATCCTG CGCAAGGTCG GCGCCTCGGC CGAGGATGCC
GACCACCTGC GCGGCGCCGA AGGCGAGGCG GCCAGCCTTT ATTTCGGGGT TTTTGGCCAG
TTGATCCGCT CTCCCGATGG CGATTTCGCC TTTCGTGGTC GCTCGCGCCG TCCGCCGCTC
GATCCGACGA ATGCCCTGCT GTCGTTTCTA TATACCCTGT TGACCCACGA TTGCCGCAGT
GCTTGCGAAA GTGTCGGACT TGATCCCGCC GTGGGATTTC TGCACCGCGA TCGCCCGGGC
CGCCCGAGCC TCGCCCTTGA TCTGATGGAA GAATTGCGGC CGGTTCTGGT TGATCGCTTG
GCGTTGTCCT TGATCAATCG CCGTCAGCTT CGGGCGACGG ATTTCCAGCG CCTGGACGGC
GGCGCCGTTC TTTTGACCGA CGAGGCGCGC AAGACGGTGC TAAGCGCTTG GCAGGAGCGC
AAGAAACAGG AACGGCGCCA TCCCTTTCTT GAGGAAAGCG CCCCGCTTGG CCTTGTGCCC
TATCTTCAGG CCCAGATGCT TGCCCGCCAC CTGCGCGGGG ATCTTGACGC CTATCCGCCG
TGGTTCTGGA AGTAG
 
Protein sequence
MKKLLNTVYV TTEGTGLRKD GENLVAELDG VQKGRVPLHM VGSVVVFGGT YVSPGLMGAC 
AAHGITIVLL DRVGRFQARV EGPVAGNVLL RRAQYKASEA PEDIVKSLIL GKVSNQRAVL
LRALRDHGAD FPAAEALAVK DAIDRMAHIL RKVGASAEDA DHLRGAEGEA ASLYFGVFGQ
LIRSPDGDFA FRGRSRRPPL DPTNALLSFL YTLLTHDCRS ACESVGLDPA VGFLHRDRPG
RPSLALDLME ELRPVLVDRL ALSLINRRQL RATDFQRLDG GAVLLTDEAR KTVLSAWQER
KKQERRHPFL EESAPLGLVP YLQAQMLARH LRGDLDAYPP WFWK