Gene Rsph17025_2731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2731 
Symbol 
ID5085234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2768843 
End bp2770777 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content70% 
IMG OID640484294 
ProductSARP family transcriptional regulator 
Protein accessionYP_001168923 
Protein GI146278764 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.92 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCCT GTGCCATCGC GGCCGCCGCT CGTGGGTCCG CCGCGCCGGT CATTCTCGTT 
ACTGCTTTAC CGAAGCTCGT CCCTCGGTCA CTCTGTCGTG ATCGGACGGA TGCCGGGGCT
CCGTCGTGCG ACGTGGTGGT TGACCGGATG ACGGGCGTGC CTCCTGCCTT GCGGATGTAT
CTTTTCGGGC CTTTCACCCT GCTGGGGCCC GACGGCGAGG AACTGACCCC GAAATCCCGC
AAGTCACGAG CGATCCTTGC CATGCTGGCG GTCGCGCCCC GCGGCTCGCG CTCGCGCGTG
TGGCTGCGGG ACAAGCTCTG GAGCGATCGG GGCGAGGATC AGGCGTCCGC GAGCCTTCGG
CAGGCGCTGC TCGATATCCG CAAGTCGCTG GGGCCGCGCG CGGCCCATGT GCTGACGGCC
GACAAGAACA CCGTCTCGCT CGATCTGGCG GCGGTCTCGG TCGATGCGCT CGAGTTCGCC
TCGCGCGAGC GGCGGGGAGA GGAGGCGGGG ACGACCGAAC ATTTCCTCGA GGGGATCGAC
GTGCGCGATC CCGAGTTCGA GGACTGGCTG TCGCTCGAAC GGCAGAGCTG GTTTGCCCGG
CTCGAGGAGG CGGGGTTCGA GGCCGACCTG GCCCCGCGGC CCGCGCCGTC CGAGGCGCGG
CGCGAGGCTT CGCAGGCGGT GCCGCGCACC TCGCCCCCGG CCATGCCGCA GGGGCCGTCA
CAGGGGATTG CCCGGCAGGA GGATGAGGCG GGATGGCGCG TGGCGATGAT GCCTCCCGTC
ATCCTCGGCG GTGATCCCGC GGCCGCGGTG CTGCAGGCCG ACGTGCAGCG TGCGCTGCGC
CGCGCGCTGA TGGAGACGGG CGACCTGCGC CTCGTGGACA TGGGGCCGCT GGGGTTCGGC
GACACGGGAT TGGCCGGAGG CGCGCTCGCG GCACGGCTGC CCGAGCTGAT CCACCTGAGC
GTTCAGGTCC GTGTCCTGTC GGACGTCAGC TATTTTCGTC TCGGGATCGT GCTGCAGAAC
CCGGCGGACA ACGGGCTGGT CTGGGCAGAC GAGATGGTGG TGCCCCGCCG GGAAGCGGCC
ACCGAGGCAA GTTTCGCCAT GCCGCTGATC GTGCGGGCCA CGGAGGAGGC CATGCTGTAT
TTCCTCCGCC GGCACGGAGG AGAGGCGGCC GAGGCCGACG GCCGCATCGC CGCTGCGGTG
GCCTCGATGT TCCGGCTGGC GCGCGGCGAT CTGGACCGTT CGGAGCAGAT CCTGCGCCGG
CATCTCGACC GGACGCCTAC GGCGCAGGGC TATGCCTGGC TGGCCTTCCT GAACACCTTC
CGCGTGGGTC AGAGGTTCAA CCCCGCCGAT GCGCCGCTGA TCGAGGAGAC GCAGCATCTC
GCGCGCCGGG CGCTCGCGCT CGAGCCCGGC AACGCGCTGG TGTCGGCGCT GGTCGGACAT
ATCCACTCCT ACCTGTTCGG CGAGTTCGAC TATGCGGCGG CATTGTTCGA GCAGTCCTTG
CGGGTCAATC CGGCGCAGAC GCTGGCGTGG GATCTCTACG CCATGCTGCA CGCCTATGCG
GGCCAGCCGA AGCGGGCGCT GGCCATGGCG CGCTGGGCGC GGCATCTGGG GGCCTTCAGC
CCGCACCGCT ACTATTTCGA GACGACGCGG GCGATCACCG GCAATTTCTC GGGCGATCAC
CGGACGGCCA TCGACGCCGG CCAGTCGGCG CTGGCCGAAC GGCCGGACTT CAACTCGCTG
CTGCGGGTGC TTGTCTCGTC GAACGCCCAT CTCGACCGGC CCGACGAGGC GCGGCTGTTC
CTCGAGCGCC TGTTGCAGGT CGAGCCGAAC TTCTCGATCG CCTCGTTGCG CGAGGGGGGC
TACCCCGGCC TCGACACCGA GGGGGGGCGG CATTTTCTGG ACGGGCTGAT GAAGGCCGGC
GTGCGCAGGC ACTGA
 
Protein sequence
MLSCAIAAAA RGSAAPVILV TALPKLVPRS LCRDRTDAGA PSCDVVVDRM TGVPPALRMY 
LFGPFTLLGP DGEELTPKSR KSRAILAMLA VAPRGSRSRV WLRDKLWSDR GEDQASASLR
QALLDIRKSL GPRAAHVLTA DKNTVSLDLA AVSVDALEFA SRERRGEEAG TTEHFLEGID
VRDPEFEDWL SLERQSWFAR LEEAGFEADL APRPAPSEAR REASQAVPRT SPPAMPQGPS
QGIARQEDEA GWRVAMMPPV ILGGDPAAAV LQADVQRALR RALMETGDLR LVDMGPLGFG
DTGLAGGALA ARLPELIHLS VQVRVLSDVS YFRLGIVLQN PADNGLVWAD EMVVPRREAA
TEASFAMPLI VRATEEAMLY FLRRHGGEAA EADGRIAAAV ASMFRLARGD LDRSEQILRR
HLDRTPTAQG YAWLAFLNTF RVGQRFNPAD APLIEETQHL ARRALALEPG NALVSALVGH
IHSYLFGEFD YAAALFEQSL RVNPAQTLAW DLYAMLHAYA GQPKRALAMA RWARHLGAFS
PHRYYFETTR AITGNFSGDH RTAIDAGQSA LAERPDFNSL LRVLVSSNAH LDRPDEARLF
LERLLQVEPN FSIASLREGG YPGLDTEGGR HFLDGLMKAG VRRH