Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_1299 |
Symbol | |
ID | 3720959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007493 |
Strand | + |
Start bp | 3067938 |
End bp | 3069872 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640072526 |
Product | hypothetical protein |
Protein accession | YP_354380 |
Protein GI | 77464876 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.135233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGTGCG GGACAGCATC CGCGGTCGGC GCCGCAAGGG TGCAGCCGTT CATCATCGTA AATGCTTTAC CCGGCGTCGC GGCTCGGTCA CTGTGGCGTG AGCGGGCGGT TGCCGGGGCA GCGTCCGAGG AACCAGCGGC TGGCCGAATG ACGGGAGCGT CTCCTGACCT GCGGATGTAT CTGTTCGGGC CCTTCACCCT GCTGGGACCC GACGGAGAGG AACTGACTCC GAAATCCCGC AAATCGCGCG CCATCCTCGC CATGCTGGCG GTGGCGCCTC GGGGCTCGCG CTCGCGCGTC TGGCTGCGCG ACAAGCTCTG GAGCGACCGG GGCGAGGATC AGGCCTCGGC GAGCCTCCGG CAGGCGCTTC TGGATATCCG CAAGTCGCTG GGGCCCCGCG CCGCGCATGT GCTGACGGCG GACAAGAACA CGGTCTCGCT GGATCTGGCG GCGGTGGCGG TGGATGCGCT GGAACTCGCC GCTCGCGAGC GGCGGGGCGA GGACGGCAGC ACGGAGCATT TCCTCGAAGG GATCGACGTG CGGGATCCCG AATTCGAGGA CTGGCTGGCG CTGGAGCGCC AGAGTTGGTT CGCGCGCCTC GAAGAGGCCG GCTTCGAGGC CGATCTCGCG CCGCGCCCGG CGCCCTCGGA GGCGCGGCGC GAGGCGTCGC AGGCGGTGCC GCGCACCTCG CCGCCGGCTG CGCCGCAGCC GCCGCAATCG CTGCTGCGGC AGGAGGACGA GGGCGGCTGG CGCGTGGCTA TGATGCCGCC CGTCATCCTC GGCGGCGATC CGGCGGCGGC GGTGCTGCAG GCCGATGTGC AGCGCGCGCT GCGCCGCGCG CTCGTCGAGA CGGGCGATCT GCGGCTGGTC GACATGGCGC CCGTGGCGCT GGGTGATCTG GGGGCCGGCC TTGCGGGGGG CGGGCTGCTC GCGCGTCTGC CCGAACAGGT GCATCTGAGC GTGCAGGTCC GGGTTCTGGC GGATCACAGC TACCTCCGCG TCGGGATCGT GCTGCAGAAC CCGGCCGACA ATGCGCTGGT CTGGTCCGAC GAGGTGATCG TGCCGCGGCG CGAGTCGATC GGCGAGGCGA GCTTTGCCAT GCCGCTGATC GTGCGCGCGA CCGAAGAGGC GACGCTGCAT TTCCTGCGCC GCCACGGCAC CGAGGGGGCC GAGGCCGAGG GGCGGATCGC GGCCGCCGTG GCCTCGATGT TCCGGCTGGC GCGGGGCGAT CTCGACCGGT CCGAGGAGAT CCTGCGCCGG CATCTCGACC GGACCCCGAC CGCGCAGGGC TATGCCTGGC TGGCCTTCCT CAACACCTTC CGTGTGGGCC AGCGCTTCAA TCCCGCCGAC GCGCCGCTGA TCGAGGAGAC GCAGCATCTC GCACGCCGCG CGCTGGGGCT CGAGCCCAAC AATGCGCTGG TCTCGGCGCT GGTGGGCCAC ATCCACTCCT ACCTCTTCGG CGAGTTCGAC TATGCCGCGG CGCTCTTCGA GCAGTCCCTG CGGGTCAATC CGGCGCAGAC GCTGGCCTGG GATCTCTATG CCATGCTCCA TGCCTATGCC GGCCAGCCGA AGCGGGCGCT CGCCATGGCG CGCTGGGCGC GTCATCTGGG CGCCTTCAGC CCGCATCGCT ACTATTTCGA GACCACCCGC GCGATCACCG GAAACTTCTC GGGCGATCAT CAGACGGCGA TCGACGCCGG ACAGTCGGCG CTGGCCGAGC GGCCGGACTT CAACTCGCTG CTGCGGGTGC TGGTCTCGTC GAACGCCCAT CTCGACCGGC CCGAGGAAGC GCGGCTGTTC CTCGAGCGGC TGTTGCAGGT CGAGCCGAAC TTCTCGGTCG CCTCGCTGCG GGACGCGGGC TATCCGGGCC TCGATACCGA GGGCGGGCGG CATTTTCTGG ATGGTCTCGT GAAGGCGGGC GTGCGCAAGC ACTGA
|
Protein sequence | MLCGTASAVG AARVQPFIIV NALPGVAARS LWRERAVAGA ASEEPAAGRM TGASPDLRMY LFGPFTLLGP DGEELTPKSR KSRAILAMLA VAPRGSRSRV WLRDKLWSDR GEDQASASLR QALLDIRKSL GPRAAHVLTA DKNTVSLDLA AVAVDALELA ARERRGEDGS TEHFLEGIDV RDPEFEDWLA LERQSWFARL EEAGFEADLA PRPAPSEARR EASQAVPRTS PPAAPQPPQS LLRQEDEGGW RVAMMPPVIL GGDPAAAVLQ ADVQRALRRA LVETGDLRLV DMAPVALGDL GAGLAGGGLL ARLPEQVHLS VQVRVLADHS YLRVGIVLQN PADNALVWSD EVIVPRRESI GEASFAMPLI VRATEEATLH FLRRHGTEGA EAEGRIAAAV ASMFRLARGD LDRSEEILRR HLDRTPTAQG YAWLAFLNTF RVGQRFNPAD APLIEETQHL ARRALGLEPN NALVSALVGH IHSYLFGEFD YAAALFEQSL RVNPAQTLAW DLYAMLHAYA GQPKRALAMA RWARHLGAFS PHRYYFETTR AITGNFSGDH QTAIDAGQSA LAERPDFNSL LRVLVSSNAH LDRPEEARLF LERLLQVEPN FSVASLRDAG YPGLDTEGGR HFLDGLVKAG VRKH
|
| |