Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2731 |
Symbol | |
ID | 5085234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 2768843 |
End bp | 2770777 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640484294 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001168923 |
Protein GI | 146278764 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.92 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTCCT GTGCCATCGC GGCCGCCGCT CGTGGGTCCG CCGCGCCGGT CATTCTCGTT ACTGCTTTAC CGAAGCTCGT CCCTCGGTCA CTCTGTCGTG ATCGGACGGA TGCCGGGGCT CCGTCGTGCG ACGTGGTGGT TGACCGGATG ACGGGCGTGC CTCCTGCCTT GCGGATGTAT CTTTTCGGGC CTTTCACCCT GCTGGGGCCC GACGGCGAGG AACTGACCCC GAAATCCCGC AAGTCACGAG CGATCCTTGC CATGCTGGCG GTCGCGCCCC GCGGCTCGCG CTCGCGCGTG TGGCTGCGGG ACAAGCTCTG GAGCGATCGG GGCGAGGATC AGGCGTCCGC GAGCCTTCGG CAGGCGCTGC TCGATATCCG CAAGTCGCTG GGGCCGCGCG CGGCCCATGT GCTGACGGCC GACAAGAACA CCGTCTCGCT CGATCTGGCG GCGGTCTCGG TCGATGCGCT CGAGTTCGCC TCGCGCGAGC GGCGGGGAGA GGAGGCGGGG ACGACCGAAC ATTTCCTCGA GGGGATCGAC GTGCGCGATC CCGAGTTCGA GGACTGGCTG TCGCTCGAAC GGCAGAGCTG GTTTGCCCGG CTCGAGGAGG CGGGGTTCGA GGCCGACCTG GCCCCGCGGC CCGCGCCGTC CGAGGCGCGG CGCGAGGCTT CGCAGGCGGT GCCGCGCACC TCGCCCCCGG CCATGCCGCA GGGGCCGTCA CAGGGGATTG CCCGGCAGGA GGATGAGGCG GGATGGCGCG TGGCGATGAT GCCTCCCGTC ATCCTCGGCG GTGATCCCGC GGCCGCGGTG CTGCAGGCCG ACGTGCAGCG TGCGCTGCGC CGCGCGCTGA TGGAGACGGG CGACCTGCGC CTCGTGGACA TGGGGCCGCT GGGGTTCGGC GACACGGGAT TGGCCGGAGG CGCGCTCGCG GCACGGCTGC CCGAGCTGAT CCACCTGAGC GTTCAGGTCC GTGTCCTGTC GGACGTCAGC TATTTTCGTC TCGGGATCGT GCTGCAGAAC CCGGCGGACA ACGGGCTGGT CTGGGCAGAC GAGATGGTGG TGCCCCGCCG GGAAGCGGCC ACCGAGGCAA GTTTCGCCAT GCCGCTGATC GTGCGGGCCA CGGAGGAGGC CATGCTGTAT TTCCTCCGCC GGCACGGAGG AGAGGCGGCC GAGGCCGACG GCCGCATCGC CGCTGCGGTG GCCTCGATGT TCCGGCTGGC GCGCGGCGAT CTGGACCGTT CGGAGCAGAT CCTGCGCCGG CATCTCGACC GGACGCCTAC GGCGCAGGGC TATGCCTGGC TGGCCTTCCT GAACACCTTC CGCGTGGGTC AGAGGTTCAA CCCCGCCGAT GCGCCGCTGA TCGAGGAGAC GCAGCATCTC GCGCGCCGGG CGCTCGCGCT CGAGCCCGGC AACGCGCTGG TGTCGGCGCT GGTCGGACAT ATCCACTCCT ACCTGTTCGG CGAGTTCGAC TATGCGGCGG CATTGTTCGA GCAGTCCTTG CGGGTCAATC CGGCGCAGAC GCTGGCGTGG GATCTCTACG CCATGCTGCA CGCCTATGCG GGCCAGCCGA AGCGGGCGCT GGCCATGGCG CGCTGGGCGC GGCATCTGGG GGCCTTCAGC CCGCACCGCT ACTATTTCGA GACGACGCGG GCGATCACCG GCAATTTCTC GGGCGATCAC CGGACGGCCA TCGACGCCGG CCAGTCGGCG CTGGCCGAAC GGCCGGACTT CAACTCGCTG CTGCGGGTGC TTGTCTCGTC GAACGCCCAT CTCGACCGGC CCGACGAGGC GCGGCTGTTC CTCGAGCGCC TGTTGCAGGT CGAGCCGAAC TTCTCGATCG CCTCGTTGCG CGAGGGGGGC TACCCCGGCC TCGACACCGA GGGGGGGCGG CATTTTCTGG ACGGGCTGAT GAAGGCCGGC GTGCGCAGGC ACTGA
|
Protein sequence | MLSCAIAAAA RGSAAPVILV TALPKLVPRS LCRDRTDAGA PSCDVVVDRM TGVPPALRMY LFGPFTLLGP DGEELTPKSR KSRAILAMLA VAPRGSRSRV WLRDKLWSDR GEDQASASLR QALLDIRKSL GPRAAHVLTA DKNTVSLDLA AVSVDALEFA SRERRGEEAG TTEHFLEGID VRDPEFEDWL SLERQSWFAR LEEAGFEADL APRPAPSEAR REASQAVPRT SPPAMPQGPS QGIARQEDEA GWRVAMMPPV ILGGDPAAAV LQADVQRALR RALMETGDLR LVDMGPLGFG DTGLAGGALA ARLPELIHLS VQVRVLSDVS YFRLGIVLQN PADNGLVWAD EMVVPRREAA TEASFAMPLI VRATEEAMLY FLRRHGGEAA EADGRIAAAV ASMFRLARGD LDRSEQILRR HLDRTPTAQG YAWLAFLNTF RVGQRFNPAD APLIEETQHL ARRALALEPG NALVSALVGH IHSYLFGEFD YAAALFEQSL RVNPAQTLAW DLYAMLHAYA GQPKRALAMA RWARHLGAFS PHRYYFETTR AITGNFSGDH RTAIDAGQSA LAERPDFNSL LRVLVSSNAH LDRPDEARLF LERLLQVEPN FSIASLREGG YPGLDTEGGR HFLDGLMKAG VRRH
|
| |