Gene Rsph17029_3159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3159 
Symbol 
ID4898721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp183130 
End bp184134 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content68% 
IMG OID640113761 
ProductAraC family transcriptional regulator 
Protein accessionYP_001045031 
Protein GI126463918 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0609208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCCG CAGCCCCACG GCGTCATGAG CTTTCCATTG CACTGATATT GCAGGACAAG 
TTCACGATTG CCGCTTTCTC CGGTTTCATC GATGCGCTGC GGCTGGCGGC CGACGATGCT
GCAAAAAGCC GACAGATCCG CGTCGCCTGG AAGGTCTTCG CCCAGCACCG CAGCCCGGTC
ATGGCCAGCT GCGGGCTGCG CGTGGCCACC GAGGACGGGC TGCCCGTTCC CGAGGACTAT
GACTATATCG CCATCTGCGG CGGCAATTCC TTTGCCGACG CCGCGCCCGC GCCGCAGCTG
GCCCAGCTCA TCCAGCGCGC CCACCGCGCG CGGGTGGGGC TCCTCGGCAT CTGTACCGGC
AGTTTCGCCA TCGCCCACGC AGGGCTGATC GGCGATCGGC GCTTCTGCAT CCACTGGAAC
GTGGCCGAGC CCTTCAAGGC GCTCTTCCCG CGCGCCCATA TCTCGGTGGA TCGGATCTTC
ATCGACGAGG GCGACGTCAT CACCTGCGCG GGCTCGACCG CGGCCATCGA CCTCGCGCTC
TATCTCGTCA TGCGCCACTG CGGACAGGAT CGGGCGCAGC AGGTGATGCG GCACATGATG
CTCTCGCAGA TGCGCCCCGC CACCATGCCG CAGGCCCATT TCTACCAGCT CCCTCCGGGC
GACAGCCACC CGCGCCTGCG CCGCGCGCTG CATTTCATGG AGCAGCAGCT CGACCGTCCG
CCCTCGGTCG GGGCCATCGC GCGCTATTGC GGCGTCTCGG TCCGCCAGCT CGAGCGGATC
TTCCGTCAGG CGCTGGGGCA GACGCCGAAC GCGGCCTTCC GCCAGATGCG GCTGAATTAC
GGGCGCTACC TGCTTTCGGC AGGCACGCTG CCCGTCACCG AGATCGCCCA TATCGCGGGC
TTCTCCGATG CGGCCCATTT CTCGCGCGAG TTCCGCCGTG CCTTTCACGA GACGCCGAGC
GCCCACCGCC GCGCGCGCAG CCACGGTCCC GACGCGGGCT CATGA
 
Protein sequence
MKPAAPRRHE LSIALILQDK FTIAAFSGFI DALRLAADDA AKSRQIRVAW KVFAQHRSPV 
MASCGLRVAT EDGLPVPEDY DYIAICGGNS FADAAPAPQL AQLIQRAHRA RVGLLGICTG
SFAIAHAGLI GDRRFCIHWN VAEPFKALFP RAHISVDRIF IDEGDVITCA GSTAAIDLAL
YLVMRHCGQD RAQQVMRHMM LSQMRPATMP QAHFYQLPPG DSHPRLRRAL HFMEQQLDRP
PSVGAIARYC GVSVRQLERI FRQALGQTPN AAFRQMRLNY GRYLLSAGTL PVTEIAHIAG
FSDAAHFSRE FRRAFHETPS AHRRARSHGP DAGS