Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2958 |
Symbol | |
ID | 4895699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 3114820 |
End bp | 3116754 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640113561 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001044832 |
Protein GI | 126463718 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.30707 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.853466 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTGCG GGACAGCATC CGCGGTCGGC GCCGCAAGGG TGCAGCCGTT CATCATCGTA AATGCTTTAC CCGGCGTCGC GGCTCGGTCA CTGTGGCGTG AGCGGGCGGT TGCCGGGGCA GCGTCCGAGG AACCAGCGGC AGACCGAATG ACGGGAGCGT CTCCTGACCT GCGGATGTAT CTGTTCGGGC CCTTCACCCT GCTGGGACCC GACGGAGAGG AACTGACTCC GAAATCCCGC AAGTCGCGCG CCATCCTCGC CATGCTGGCG GTGGCGCCTC GGGGCTCGCG CTCGCGCGTC TGGCTGCGCG ACAAGCTCTG GAGCGACCGA GGTGAGGATC AGGCCTCGGC GAGCCTCCGG CAGGCGCTTC TGGACATCCG CAAGTCGCTC GGGCCCCGCG CGGCGCATGT GCTGACGGCG GACAAGAACA CGGTCTCGCT GGATCTGGCG GCGGTGGCGG TGGATGCGCT GGAACTCGCA GCCCGCGAGC GGCGGGGCGA GGAGGGCAGC ACGGAGCATT TCCTCGAGGG GATCGACGTG CGCGATCCCG AATTCGAGGA CTGGCTGGCG CTGGAGCGCC AGAGCTGGTT CGCGCGCCTG GAGGAAGCCG GTTTCGAAGC CGATCTCGCG CCGCGCCCGG CGCCCTCGGA GGCGCGACGC GAGGCGTCGC AGGCGGTGCC GCGCACCTCG CCGCCGGCTG CGCCGCAGCC GCCGCAATCG CTGCTGCGGC AGGAGGACGA GGGCGGCTGG CGCGTGGCGA TGATGCCGCC CGTGATCCTC GGCGGCGATC CGGCGGCGGC GGTGCTGCAG GCCGATGTGC AGCGTGCGCT GCGCCGCGCG CTTGTCGAGA CGGGCGATCT GCGGCTGGTC GACATGGCGC CCGTGGCGCT GGGCGATCTG GGGGCCGGCC TTGCGGGGAG CGGGCTGCTC GCGCGTCTGC CCGAACAGGT GCATCTGAGC GTGCAGGTCC GGGTTCTGGC GGATCACAGC TACCTCCGCG TCGGGATCGT GCTGCAGAAC CCGGCCGACA ATGCGCTGGT CTGGTCGGAC GAGGTGATCG TGCCGCGGCG CGAGTCGATC GGCGAGGCGA GCTTTGCCAT GCCGCTGATC GTGCGCGCGA CCGAAGAGGC GACGCTGCAT TTCCTGCGCC GCCACGGCAC CGAGGGGGCC GAGGCCGAGG GGCGGATCGC GGCCGCCGTG GCCTCGATGT TCCGGCTGGC GCGGGGCGAT CTCGACCGGT CCGAGGAGAT CCTGCGCCGG CATCTCGACC GGACCCCGAC CGCGCAGGGC TATGCCTGGC TGGCCTTCCT CAACACCTTC CGTGTGGGCC AGCGCTTCAA TCCCGCCGAC GCGCCGCTGA TCGAGGAGAC GCAGCATCTC GCACGCCGCG CGCTGGGGCT CGAGCCCAAC AATGCGCTGG TCTCGGCGCT GGTGGGCCAC ATCCACTCCT ACCTCTTCGG CGAGTTCGAC TATGCCGCAG CGCTCTTCGA GCAGTCCCTG CGGGTCAATC CGGCGCAGAC GCTGGCCTGG GATCTCTATG CCATGCTCCA TGCCTATGCC GGCCAGCCGA AGCGGGCGCT CGCCATGGCG CGCTGGGCGC GTCATCTGGG CGCCTTCAGC CCGCATCGCT ACTATTTCGA GACCACCCGC GCGATCACCG GAAACTTCTC GGGCGATCAT CAGACGGCGA TCGACGCCGG ACAGTCGGCG CTGGCCGAGC GGCCGGACTT CAACTCGCTC CTGCGGGTGC TGGTCTCGTC GAACGCCCAT CTCGACCGGC CCGAGGAAGC GCGGCTGTTC CTCGAGCGGC TGTTGCAGGT CGAGCCGAAC TTCTCGGTCG CCTCGCTGCG GGACGCGGGC TATCCGGGCC TCGATACCGA GGGCGGGCGG CATTTTCTGG ATGGTCTCGT GAAGGCGGGC GTGCGCAAGC ACTGA
|
Protein sequence | MLCGTASAVG AARVQPFIIV NALPGVAARS LWRERAVAGA ASEEPAADRM TGASPDLRMY LFGPFTLLGP DGEELTPKSR KSRAILAMLA VAPRGSRSRV WLRDKLWSDR GEDQASASLR QALLDIRKSL GPRAAHVLTA DKNTVSLDLA AVAVDALELA ARERRGEEGS TEHFLEGIDV RDPEFEDWLA LERQSWFARL EEAGFEADLA PRPAPSEARR EASQAVPRTS PPAAPQPPQS LLRQEDEGGW RVAMMPPVIL GGDPAAAVLQ ADVQRALRRA LVETGDLRLV DMAPVALGDL GAGLAGSGLL ARLPEQVHLS VQVRVLADHS YLRVGIVLQN PADNALVWSD EVIVPRRESI GEASFAMPLI VRATEEATLH FLRRHGTEGA EAEGRIAAAV ASMFRLARGD LDRSEEILRR HLDRTPTAQG YAWLAFLNTF RVGQRFNPAD APLIEETQHL ARRALGLEPN NALVSALVGH IHSYLFGEFD YAAALFEQSL RVNPAQTLAW DLYAMLHAYA GQPKRALAMA RWARHLGAFS PHRYYFETTR AITGNFSGDH QTAIDAGQSA LAERPDFNSL LRVLVSSNAH LDRPEEARLF LERLLQVEPN FSVASLRDAG YPGLDTEGGR HFLDGLVKAG VRKH
|
| |