Gene Rsph17029_0737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0737 
Symbol 
ID4895850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp746181 
End bp747158 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content72% 
IMG OID640111321 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_001042622 
Protein GI126461508 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.396346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGACG GGAACGCCAG TTTCCGGACG CAGTGTTTCA AGGGCGGCGA TGCGGCGCAG 
CCCCTGCCGT CGATCATGTT CGCCGAACGG CGCAGGCTGG CCATCCTCGG CGAAAACGGG
TTCGTCGAGA CCTGCGTGCG GACCATCGAG GGCAGCGACA TCGTCTTCGG CAGTGTCCGG
TCGTCCGGGC ATGTGATCGA GCTTCGCGAG CCGGATCGGC TGACCCTCCT TCTGCCGCGG
GCGGGGCGCC TGCGGGTGCG GATCGGGCCT GCCGAGCATG GCGTGACGCC GGGCTGCCCC
ATGGCCTTCC GGCCGGGCGA GCGGGTGACC GACGCCACCG CCGGCCGCGA CGGGCTCTTC
GCCGCGATCA CGCTGCAGGT GCCCGCCGCG CGGGTCCGGG CGCTGGCCGA GGCGGCCGAG
CTACCGCTGC AGGATCTGCT CGGCCCGGAT GCCGTGGCCC TGCGCGCCCG GCTCGAGGCT
TCGGCGCTGG AGGGCATGGC CCGGCTGGCC TGCGACCTCT TCCTGCGGCC GAAGACCGCC
CTTCCGCCCG GCGTCGCTCT GGCGATCACC GACTTCGTGG ATGCGCAGCT GCTGGCCCTG
ATGGACGGCC GGCCTGCTCC GGCCCGGTGC CGCGTCCTGT CGGCCTTCCA CCGCGTGCGC
GCGGCCGAAG AGATCATGCA TGCCCACAGC GAAGAGCCGC TCTCCATGCT CGATCTCGCA
CGACGTCTGG ATATCGGCCT GCGCAGCCTG CAGCTGGCCT TCCGCGAGGT GCATGACGGC
CTCTCGCCGC GCGAGGTCTA CAGCCGGATC CGGCTGGACC GCGCGCGGCA GCGGCTGCTG
GCGGCTTCGG GGGCCGATCG GGTGACGACC ATCGCGCTCG ACAGCGGCTT CGGTCATCTC
GGGCGGTTCG CCATGGCCTA TGCGCGCACC TTCGGCGAGT TGCCGAGTGA GACGCTTGCC
CGCCGCCGCA GGATTTGA
 
Protein sequence
MPDGNASFRT QCFKGGDAAQ PLPSIMFAER RRLAILGENG FVETCVRTIE GSDIVFGSVR 
SSGHVIELRE PDRLTLLLPR AGRLRVRIGP AEHGVTPGCP MAFRPGERVT DATAGRDGLF
AAITLQVPAA RVRALAEAAE LPLQDLLGPD AVALRARLEA SALEGMARLA CDLFLRPKTA
LPPGVALAIT DFVDAQLLAL MDGRPAPARC RVLSAFHRVR AAEEIMHAHS EEPLSMLDLA
RRLDIGLRSL QLAFREVHDG LSPREVYSRI RLDRARQRLL AASGADRVTT IALDSGFGHL
GRFAMAYART FGELPSETLA RRRRI