Gene Rsph17029_2958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2958 
Symbol 
ID4895699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp3114820 
End bp3116754 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content70% 
IMG OID640113561 
ProductSARP family transcriptional regulator 
Protein accessionYP_001044832 
Protein GI126463718 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.30707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.853466 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTGCG GGACAGCATC CGCGGTCGGC GCCGCAAGGG TGCAGCCGTT CATCATCGTA 
AATGCTTTAC CCGGCGTCGC GGCTCGGTCA CTGTGGCGTG AGCGGGCGGT TGCCGGGGCA
GCGTCCGAGG AACCAGCGGC AGACCGAATG ACGGGAGCGT CTCCTGACCT GCGGATGTAT
CTGTTCGGGC CCTTCACCCT GCTGGGACCC GACGGAGAGG AACTGACTCC GAAATCCCGC
AAGTCGCGCG CCATCCTCGC CATGCTGGCG GTGGCGCCTC GGGGCTCGCG CTCGCGCGTC
TGGCTGCGCG ACAAGCTCTG GAGCGACCGA GGTGAGGATC AGGCCTCGGC GAGCCTCCGG
CAGGCGCTTC TGGACATCCG CAAGTCGCTC GGGCCCCGCG CGGCGCATGT GCTGACGGCG
GACAAGAACA CGGTCTCGCT GGATCTGGCG GCGGTGGCGG TGGATGCGCT GGAACTCGCA
GCCCGCGAGC GGCGGGGCGA GGAGGGCAGC ACGGAGCATT TCCTCGAGGG GATCGACGTG
CGCGATCCCG AATTCGAGGA CTGGCTGGCG CTGGAGCGCC AGAGCTGGTT CGCGCGCCTG
GAGGAAGCCG GTTTCGAAGC CGATCTCGCG CCGCGCCCGG CGCCCTCGGA GGCGCGACGC
GAGGCGTCGC AGGCGGTGCC GCGCACCTCG CCGCCGGCTG CGCCGCAGCC GCCGCAATCG
CTGCTGCGGC AGGAGGACGA GGGCGGCTGG CGCGTGGCGA TGATGCCGCC CGTGATCCTC
GGCGGCGATC CGGCGGCGGC GGTGCTGCAG GCCGATGTGC AGCGTGCGCT GCGCCGCGCG
CTTGTCGAGA CGGGCGATCT GCGGCTGGTC GACATGGCGC CCGTGGCGCT GGGCGATCTG
GGGGCCGGCC TTGCGGGGAG CGGGCTGCTC GCGCGTCTGC CCGAACAGGT GCATCTGAGC
GTGCAGGTCC GGGTTCTGGC GGATCACAGC TACCTCCGCG TCGGGATCGT GCTGCAGAAC
CCGGCCGACA ATGCGCTGGT CTGGTCGGAC GAGGTGATCG TGCCGCGGCG CGAGTCGATC
GGCGAGGCGA GCTTTGCCAT GCCGCTGATC GTGCGCGCGA CCGAAGAGGC GACGCTGCAT
TTCCTGCGCC GCCACGGCAC CGAGGGGGCC GAGGCCGAGG GGCGGATCGC GGCCGCCGTG
GCCTCGATGT TCCGGCTGGC GCGGGGCGAT CTCGACCGGT CCGAGGAGAT CCTGCGCCGG
CATCTCGACC GGACCCCGAC CGCGCAGGGC TATGCCTGGC TGGCCTTCCT CAACACCTTC
CGTGTGGGCC AGCGCTTCAA TCCCGCCGAC GCGCCGCTGA TCGAGGAGAC GCAGCATCTC
GCACGCCGCG CGCTGGGGCT CGAGCCCAAC AATGCGCTGG TCTCGGCGCT GGTGGGCCAC
ATCCACTCCT ACCTCTTCGG CGAGTTCGAC TATGCCGCAG CGCTCTTCGA GCAGTCCCTG
CGGGTCAATC CGGCGCAGAC GCTGGCCTGG GATCTCTATG CCATGCTCCA TGCCTATGCC
GGCCAGCCGA AGCGGGCGCT CGCCATGGCG CGCTGGGCGC GTCATCTGGG CGCCTTCAGC
CCGCATCGCT ACTATTTCGA GACCACCCGC GCGATCACCG GAAACTTCTC GGGCGATCAT
CAGACGGCGA TCGACGCCGG ACAGTCGGCG CTGGCCGAGC GGCCGGACTT CAACTCGCTC
CTGCGGGTGC TGGTCTCGTC GAACGCCCAT CTCGACCGGC CCGAGGAAGC GCGGCTGTTC
CTCGAGCGGC TGTTGCAGGT CGAGCCGAAC TTCTCGGTCG CCTCGCTGCG GGACGCGGGC
TATCCGGGCC TCGATACCGA GGGCGGGCGG CATTTTCTGG ATGGTCTCGT GAAGGCGGGC
GTGCGCAAGC ACTGA
 
Protein sequence
MLCGTASAVG AARVQPFIIV NALPGVAARS LWRERAVAGA ASEEPAADRM TGASPDLRMY 
LFGPFTLLGP DGEELTPKSR KSRAILAMLA VAPRGSRSRV WLRDKLWSDR GEDQASASLR
QALLDIRKSL GPRAAHVLTA DKNTVSLDLA AVAVDALELA ARERRGEEGS TEHFLEGIDV
RDPEFEDWLA LERQSWFARL EEAGFEADLA PRPAPSEARR EASQAVPRTS PPAAPQPPQS
LLRQEDEGGW RVAMMPPVIL GGDPAAAVLQ ADVQRALRRA LVETGDLRLV DMAPVALGDL
GAGLAGSGLL ARLPEQVHLS VQVRVLADHS YLRVGIVLQN PADNALVWSD EVIVPRRESI
GEASFAMPLI VRATEEATLH FLRRHGTEGA EAEGRIAAAV ASMFRLARGD LDRSEEILRR
HLDRTPTAQG YAWLAFLNTF RVGQRFNPAD APLIEETQHL ARRALGLEPN NALVSALVGH
IHSYLFGEFD YAAALFEQSL RVNPAQTLAW DLYAMLHAYA GQPKRALAMA RWARHLGAFS
PHRYYFETTR AITGNFSGDH QTAIDAGQSA LAERPDFNSL LRVLVSSNAH LDRPEEARLF
LERLLQVEPN FSVASLRDAG YPGLDTEGGR HFLDGLVKAG VRKH