Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3153 |
Symbol | |
ID | 3836599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 3638948 |
End bp | 3641653 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637827268 |
Product | virulence factor protein |
Protein accession | YP_428235 |
Protein GI | 83594483 |
COG category | [S] Function unknown |
COG ID | [COG4458] Uncharacterized protein conserved in bacteria, putative virulence factor |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.490604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGATA CCACCAAGGA TCTGGGACGG GCCATTGACG CTCTCGACAA GGGGGCCGAG AAGGCCCGGT TGTGGATCGA CGCCCTGCGC GAAAGCGCGC CGTCGGTTTC CCTGCAGGCC GAAAGCCTGA GCAATGCGGC GCGCCGCGCC CGGCTGGCCT GTAAGCGGCT GGCCCATGCC GCCGACCGCA ACAATTGCGT GGGCGTCTTT GGCCCCAGTC AGGCCGGAAA ATCCTATCTG GTCTCGGCCC TGGCCCGCAC CAAGGGCGGG CGGCTGACCA TCCGCCTGGG CGCCGAGAGC CGGGATTTCC TGCGCGAGAT CAATCCGCCG GGCGATCGCG AGTCCACCGG GCTGGTCACC CGGTTCACCA TCCATGCCAA TGACATCGAC CCGGCCTATC CGGTGGAGCT GCGGCTGCTG AGCGAAACCG ATGTGGTCAA GATCCTGGCC AACAGCTTCT TTCAGGATTT CGATCCCAAC AGCATGACCA TCGCGCCGCT GGAGGAAGAC GATATCCGCG CCGCCCTGCG CGAGGCGGCG GCGGCGGCGA CCGCCAAGCC GGCGGCCCAT CTCACCGAGA TCGCCCTGTT CGAACTCGAT GAATATTTCC ATCAGAACTT CAAGAAGCGC ATCGGCGCCC TTGATCGCGC CGATTTCTGG GCCGGGCTGA TCCGCCATGG CGGGCGCCTG CCGATCGCGG CGCGGGCCCG GCTATTCTCG GTGCTCTGGG GGCGGGTGGA GGCCTTCACC AAGCTGTATA TCCATCTGGC CGGCGCCCTG GAGGCGATCG GCAATCCCGA AGAGGCGCGC GCCGCCATCA GCGGCCTGAT CCCGCGCGAG ATCGGCTCGC CGCCCAGGGC CAATTCGATC ATCGATGTCG CCGTGCTCAA CCGCCTGGGC ACCAGCGAAG ACGGCTCCGA CCCCATCGCC CTGCTGCCGG TGGTGGCCGG CAAGTCCGGC GCCCCGGTCA GTCTGCCGCG CGCCACCCTG ACGGCGCTGA TCGCCGAAGT GCGGCTGGTC ATCGAGCACC AGCCCTGGCC GTTTTTCGAG CATACCGACC TTCTGGACTT CCCCGGCGCC CGGTCGCGGC TGAAGCTTCT TCAGATGCCC AAGGAGGCCG AGGAAGAGGC CCGCCAGACC CGCGAACTGT ATCTGCGCGG CAAGATCGCC TATCTGTTCC AGCGCTATAC CGACGAATTG GAACTGACCT CGATGCTGCT GTGCATGCCG CCCAGCGTCG CCGAGGTCAA GGATCTGGCC ATGATGGTAA GGTCGTGGAT CAACGTGACC CACGGCGAAA CCCCGGCCAA GCGCAAGGCG GTGCGCAACG CCTTGTTCCT GGTGCTGACC AAGCACGATC TGGAATTTCT GGAAAAGGGC GGTGAAACCC CGGACTCGCG CGCCGGCAAA TGGGACCGCC GCCTTCACGC CTCGCTGCTG GAGCTTTATG GCAAGGACGA CTGGCCCGGC GATTGGGACG GCAAGCCCTT TGACAACACG GTGTTCCTGC GCAATCCCAG CATGAAGCAG GTGCATCTGA TGCGCTATCG CGACGAGGCG ACGCTTGACG AGGAGGGGCC CGTCGATAGC CCGGTGTTTC GCGAATACCG CGACGCCTTC CTGGCCTCGG CCGACGTCGC CCGCCATTTC GCCGATAAGA CGGCGGTGTG GGACGCGGCG ATGACGGCCA ATGACGGCGG CGTCGCTTAT CTGGTCGACC GGCTGGTCGC CGTGCTTGAT CCCGGCTTGA AGAGGCATCA GGCCTCGGAA CGGCTGGCGA CCACCGCCCG GGCGCTGGAA GAGCCCTTGC GCGCCTTCCA TTACGCCGAG GGCGACGAGG CGAAGCGCGC CAAGGACGCC GCCCTGGTCG ATCTGCGCCG CCGGTTGTTC AGCCAAATCC GCGAAAGCGA CCATCGGTCC TTCGCCGCCC TTCTCGCCGG GCTGATGGCC GATCCGGCCC AAGTGCGCGG CCTCTACATG AATGTCGCGG AGATGCGCGA GGACGAGTTG AACGAGATCG CCGACGGGGC GGTTGCCGAT GAGCCGGTGG TCGAGGACGA CGACGATCCC TGGGCCGAGG CCGGCGCCGA CCCGGTTCCG GCCAAAAAGG CCGCTCCCCC CCGCCGCAAG GACCGCCCCG AGGTCTTCGC CAGTCAGGTG ATGAACCAGT GGGCTGGCGG CTTGCGGGCC TTTCAGCGCA ACGAGGTCGC CCTGGCGGTT CTCGGCCTGA GCGCCGCCAC CGTCGGCCCG ATCATCGACG AGATGCTGGT CGGGGCCAAT CGCCTGGGGC TGCAAGAGCG CATCGCCGAG GCGGCGCGCG AGGAAACCCG GGCGGTGGGA ACGCGTTGGA GCAGTGTCGC CGACCGGGTC ACCGGGATCG CCGCCAATAC CATCAACGAT TTCGTCGCCT ATCTCGATTA TGCCGCCCTG CCGGTCGATC AGCGCCCGGG GGTTCCCGAA CCGCCGAAGG AGCGCAAGCG CGGGATCTTC ACCACCGCGT CGTTGAAGCA CCCGGGGCCG GTCTTGGGCG ACGAGCCCGA AGAGATCGAG AAGGCCTTTT TCCTTGATTG GGGCGTGGCC CTGCGCGCCT TTGGATGCGA TAACGTGGGC CATGCCGCCG GACGGGAAAT TTCCGATGAG AAGAACCGGG AATTGGGGGC CATCCTCGAC ATGATCGACG TGTCATGGCT GGTTGCGGCC GAATAG
|
Protein sequence | MADTTKDLGR AIDALDKGAE KARLWIDALR ESAPSVSLQA ESLSNAARRA RLACKRLAHA ADRNNCVGVF GPSQAGKSYL VSALARTKGG RLTIRLGAES RDFLREINPP GDRESTGLVT RFTIHANDID PAYPVELRLL SETDVVKILA NSFFQDFDPN SMTIAPLEED DIRAALREAA AAATAKPAAH LTEIALFELD EYFHQNFKKR IGALDRADFW AGLIRHGGRL PIAARARLFS VLWGRVEAFT KLYIHLAGAL EAIGNPEEAR AAISGLIPRE IGSPPRANSI IDVAVLNRLG TSEDGSDPIA LLPVVAGKSG APVSLPRATL TALIAEVRLV IEHQPWPFFE HTDLLDFPGA RSRLKLLQMP KEAEEEARQT RELYLRGKIA YLFQRYTDEL ELTSMLLCMP PSVAEVKDLA MMVRSWINVT HGETPAKRKA VRNALFLVLT KHDLEFLEKG GETPDSRAGK WDRRLHASLL ELYGKDDWPG DWDGKPFDNT VFLRNPSMKQ VHLMRYRDEA TLDEEGPVDS PVFREYRDAF LASADVARHF ADKTAVWDAA MTANDGGVAY LVDRLVAVLD PGLKRHQASE RLATTARALE EPLRAFHYAE GDEAKRAKDA ALVDLRRRLF SQIRESDHRS FAALLAGLMA DPAQVRGLYM NVAEMREDEL NEIADGAVAD EPVVEDDDDP WAEAGADPVP AKKAAPPRRK DRPEVFASQV MNQWAGGLRA FQRNEVALAV LGLSAATVGP IIDEMLVGAN RLGLQERIAE AAREETRAVG TRWSSVADRV TGIAANTIND FVAYLDYAAL PVDQRPGVPE PPKERKRGIF TTASLKHPGP VLGDEPEEIE KAFFLDWGVA LRAFGCDNVG HAAGREISDE KNRELGAILD MIDVSWLVAA E
|
| |