Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0453 |
Symbol | |
ID | 3833787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 530734 |
End bp | 534255 |
Gene Length | 3522 bp |
Protein Length | 1173 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637824537 |
Product | CRISPR-associated endonuclease Csn1 family protein |
Protein accession | YP_425545 |
Protein GI | 83591793 |
COG category | [S] Function unknown |
COG ID | [COG3513] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01865] CRISPR-associated protein, Csn1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACCCA TCGAGCCCTG GATTCTCGGC CTTGATATCG GCACCGACAG CCTTGGCTGG GCGGTCTTCT CCTGCGAGGA AAAGGGGCCG CCGACGGCCA AGGAGCTTCT TGGCGGCGGG GTGCGACTGT TCGACAGTGG CCGTGACGCC AAGGATCACA CCTCGCGCCA GGCCGAGCGC GGCGCCTTTC GCCGGGCGCG GCGGCAAACG CGGACTTGGC CCTGGCGGCG CGACCGGCTG ATCGCCCTGT TCCAGGCCGC CGGGCTTACC CCCCCTGCCG CGGAAACCCG CCAGATCGCC CTGGCCCTGC GCCGCGAGGC GGTAAGCCGC CCCCTGGCGC CCGACGCCCT GTGGGCGGCG CTTTTGCACC TCGCCCACCA CCGGGGCTTT CGCAGCAACC GCATCGACAA ACGCGAGCGC GCCGCCGCCA AGGCCTTGGC GAAGGCCAAG CCGGCAAAGG CGACGGCCAA GGCGACCGCC CCGGCCAAGG AGGCCGACGA CGAAGCCGGC TTTTGGGAGG GGGCGGAAGC GGCGCTGCGC CAGCGCATGG CGGCCAGCGG CGCCCCCACC GTTGGCGCCC TGCTGGCCGA TGACCTGGAT CGCGGCCAAC CGGTGCGCAT GCGCTATAAC CAAAGCGACC GCGACGGCGT GGTCGCCCCC ACCCGCGCCC TGATCGCCGA GGAACTGGCC GAAATCGTCG CCCGGCAAAG CTCCGCCTAC CCCGGCCTGG ATTGGCCGGC CGTGACCCGT CTGGTGCTCG ACCAGCGGCC CTTGCGCAGC AAGGGGGCGG GCCCTTGCGC CTTCCTGCCC GGCGAAGACC GCGCCTTGCG CGCCCTGCCG ACGGTGCAGG ATTTCATCAT CCGCCAGACG CTGGCCAACC TGCGCCTGCC TAGCACCTCC GCCGATGAGC CGCGCCCCTT GACGGACGAG GAGCACGCCA AGGCTTTGGC CCTGCTGTCG ACGGCGCGGT TCGTAGAATG GCCCGCCCTG CGCCGCGCCC TCGGCCTGAA GCGCGGCGTG AAATTCACCG CCGAAACCGA ACGCAATGGC GCCAAACAGG CCGCGCGCGG CACGGCGGGC AATCTGACCG AAGCCATTCT CGCCCCCTTG ATCCCGGGTT GGTCGGGGTG GGATCTTGAT CGCAAAGACC GGGTTTTTTC CGATCTTTGG GCGGCGCGCC AGGATCGTTC GGCCCTGCTG GCGCTGATTG GCGATCCCCG GGGACCAACT CGGGTGACGG AGGACGAGAC CGCCGAAGCG GTGGCCGATG CCATCCAGAT CGTTCTGCCG ACCGGTCGGG CCAGCCTGTC GGCCAAGGCC GCCCGGGCGA TCGCCCAGGC CATGGCCCCG GGGATCGGTT ATGACGAGGC GGTAACCCTG GCCCTGGGGC TTCACCATTC CCACCGTCCG CGCCAGGAGC GTTTGGCGCG CCTGCCCTAT TACGCCGCCG CCCTGCCCGA TGTCGGGCTT GATGGCGATC CCGTCGGCCC GCCGCCAGCC GAAGACGACG GGGCGGCCGC CGAGGCCTAT TACGGCCGCA TCGGCAATAT CTCGGTCCAC ATCGCCCTCA ACGAAACCCG CAAGATCGTC AACGCCCTGC TCCATCGCCA TGGCCCCATC CTGCGGCTGG TCATGGTGGA AACCACCCGC GAGTTGAAGG CCGGCGCCGA CGAGCGCAAG CGGATGATCG CCGAACAGGC CGAGCGCGAA CGCGAAAACG CCGAGATCGA CGTGGAACTG CGCAAAAGCG ATCGGTGGAT GGCGAATGCC CGCGAACGCC GCCAGCGCGT CCGCTTGGCC CGGCGCCAAA ACAACCTCTG CCCCTATACC TCCACCCCCA TCGGCCACGC CGATCTGCTT GGCGACGCCT ATGACATCGA CCATGTCATC CCGCTGGCCC GGGGTGGGCG CGATAGCCTT GACAATATGG TGCTGTGCCA ATCCGACGCG AACAAGACCA AGGGCGACAA AACCCCCTGG GAGGCCTTCC ACGACAAACC GGGCTGGATC GCCCAGCGCG ATGATTTCCT GGCGCGCCTG GATCCGCAAA CGGCCAAGGC CCTGGCATGG CGCTTCGCCG ATGACGCCGG CGAACGGGTC GCCCGCAAGA GCGCCGAAGA CGAGGACCAG GGCTTTTTAC CCCGGCAATT GACCGATACC GGGTATATCG CCCGCGTCGC CCTGCGCTAC CTGAGCCTTG TCACCAATGA GCCCAACGCC GTGGTCGCCA CCAATGGCCG CCTGACCGGT CTGCTGCGCT TGGCCTGGGA CATCACCCCC GGACCGGCGC CCCGCGACCT GTTGCCCACG CCGCGCGACG CCCTGCGCGA CGATACCGCC GCCCGCCGTT TCCTTGACGG CCTGACCCCG CCGCCTCTCG CCAAGGCGGT TGAAGGGGCG GTGCAGGCGC GCCTTGCCGC CCTGGGCCGT AGCCGGGTGG CCGATGCGGG TTTGGCCGAC GCGCTGGGCC TGACCCTGGC CAGCCTGGGC GGCGGTGGCA AGAACCGCGC CGACCATCGC CATCATTTCA TCGATGCCGC GATGATCGCC GTCACCACCC GCGGCCTTAT CAATCAGATC AATCAGGCAA GCGGGGCGGG CCGCATTCTC GATCTGCGGA AATGGCCCCG CACCAATTTC GAACCCCCTT ACCCGACCTT CCGCGCCGAG GTCATGAAAC AGTGGGATCA TATCCACCCG TCGATCCGCC CCGCCCATCG CGATGGCGGT TCGCTGCACG CCGCCACGGT TTTCGGCGTC CGTAACCGAC CCGACGCCCG CGTTCTGGTT CAGCGCAAAC CGGTGGAAAA GCTGTTTCTT GACGCCAACG CCAAGCCCTT GCCCGCCGAC AAGATCGCGG AAATCATCGA CGGCTTCGCC TCGCCGCGCA TGGCCAAACG GTTCAAGGCC TTGCTGGCCC GCTATCAGGC GGCCCACCCC GAGGTTCCCC CGGCCCTGGC CGCCCTGGCC GTTGCCCGCG ACCCGGCTTT CGGCCCGCGC GGCATGACCG CCAATACGGT GATCGCCGGC CGCTCGGACG GAGACGGCGA GGACGCCGGC CTGATCACCC CCTTTCGCGC CAACCCGAAA GCCGCCGTGC GCACCATGGG CAATGCCGTT TACGAGGTGT GGGAAATCCA GGTGAAGGGC CGGCCCCGGT GGACCCACCG GGTTCTGACT CGCTTCGACA GGACCCAACC CGCCCCGCCG CCGCCGCCGG AGAACGCCCG TTTGGTGATG CGCCTGCGCC GGGGCGATCT GGTCTATTGG CCCCTGGAGA GCGGGGACAG GCTGTTCCTG GTGAAAAAGA TGGCCGTGGA TGGCCGCCTC GCCCTGTGGC CCGCCCGATT AGCCACCGGA AAAGCGACGG CTCTTTATGC ACAACTGTCT TGTCCAAACA TCAACTTGAA TGGCGACCAG GGGTATTGCG TTCAAAGTGC CGAGGGCATC CGAAAGGAAA AAATCCGAAC AACCTCATGC ACCGCCCTGG GCCGACTCAG ACTTTCCAAA AAAGCGACAT AG
|
Protein sequence | MRPIEPWILG LDIGTDSLGW AVFSCEEKGP PTAKELLGGG VRLFDSGRDA KDHTSRQAER GAFRRARRQT RTWPWRRDRL IALFQAAGLT PPAAETRQIA LALRREAVSR PLAPDALWAA LLHLAHHRGF RSNRIDKRER AAAKALAKAK PAKATAKATA PAKEADDEAG FWEGAEAALR QRMAASGAPT VGALLADDLD RGQPVRMRYN QSDRDGVVAP TRALIAEELA EIVARQSSAY PGLDWPAVTR LVLDQRPLRS KGAGPCAFLP GEDRALRALP TVQDFIIRQT LANLRLPSTS ADEPRPLTDE EHAKALALLS TARFVEWPAL RRALGLKRGV KFTAETERNG AKQAARGTAG NLTEAILAPL IPGWSGWDLD RKDRVFSDLW AARQDRSALL ALIGDPRGPT RVTEDETAEA VADAIQIVLP TGRASLSAKA ARAIAQAMAP GIGYDEAVTL ALGLHHSHRP RQERLARLPY YAAALPDVGL DGDPVGPPPA EDDGAAAEAY YGRIGNISVH IALNETRKIV NALLHRHGPI LRLVMVETTR ELKAGADERK RMIAEQAERE RENAEIDVEL RKSDRWMANA RERRQRVRLA RRQNNLCPYT STPIGHADLL GDAYDIDHVI PLARGGRDSL DNMVLCQSDA NKTKGDKTPW EAFHDKPGWI AQRDDFLARL DPQTAKALAW RFADDAGERV ARKSAEDEDQ GFLPRQLTDT GYIARVALRY LSLVTNEPNA VVATNGRLTG LLRLAWDITP GPAPRDLLPT PRDALRDDTA ARRFLDGLTP PPLAKAVEGA VQARLAALGR SRVADAGLAD ALGLTLASLG GGGKNRADHR HHFIDAAMIA VTTRGLINQI NQASGAGRIL DLRKWPRTNF EPPYPTFRAE VMKQWDHIHP SIRPAHRDGG SLHAATVFGV RNRPDARVLV QRKPVEKLFL DANAKPLPAD KIAEIIDGFA SPRMAKRFKA LLARYQAAHP EVPPALAALA VARDPAFGPR GMTANTVIAG RSDGDGEDAG LITPFRANPK AAVRTMGNAV YEVWEIQVKG RPRWTHRVLT RFDRTQPAPP PPPENARLVM RLRRGDLVYW PLESGDRLFL VKKMAVDGRL ALWPARLATG KATALYAQLS CPNINLNGDQ GYCVQSAEGI RKEKIRTTSC TALGRLRLSK KAT
|
| |