Gene RSP_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_2026 
Symbol 
ID3719360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007493 
Strand
Start bp624857 
End bp625834 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content72% 
IMG OID640070190 
ProductAraC family transcriptional regulator 
Protein accessionYP_352078 
Protein GI77462574 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGACG GAAACGCCAG TTTCCGGACG CAGCGTTTCA CGGGCGGCGA TGCGGCGCAG 
CCCCTGCCGT CGATCATGTT CGCCGAACGG CGCAGGCTGG CCATCCTCGG CGAAAACGGG
TTCGTCGAGA CCTGCGTGCG GACCATCGAG GGCAGCGACA TCGTCTTCGG CAGTGTCCGG
TCGTCCGGGC ATGTGATCGA GCTTCGCGAG CCGGATCGGT TGACCCTCCT TCTGCCGAGG
GCGGGGCGCC TGCGGGTGCG GATCGGGTCT GCCGAGCATG GCGTGACGCC GGGCTGCCCC
ATGGCCTTCC GGCCGGGCGA GCGGGTGACC GACGCCACCG CCGGCCGCGA CGGGCTCTTC
GCCGCGATCA CGCTGCAGGT GCCCGCCGCG CGGGTCCGGG CGCTGGCCGA GGCGGCCGAG
CTACCGCTGC GGGGTCTGCT CGGCCCGGAT GCCGTGGCCC TGCGCGCCCG GCTCGAGGCT
TCGGCGCTGG AGGGCATGGC CCGGCTGGCC TGCGACCTCT TCCTGCGGCC GAAGACCGCC
CTTCCGCCCG GCGTCGCTCT GGCGATCACC GACTTCGTGG ATGCGCAGCT GCTGGCCCTG
ATGGACGGCC GGCCTGCTCC GGCCCGATGC CGCGTCCTGT CGGCCTTCCA CCGCGTGCGG
GCGGCCGAAG AGATCATGCA TGCCCACAGC GAAGAGCCGC TCGCCATGCT CGATCTCGCA
CGACGTCTGG ATATCGGCCT GCGCAGCCTG CAGCTGGCCT TCCGCGAGGT GCATGACGGC
CTCTCGCCGC GCGAGGTCTA CAGCCGGATC CGGCTGGACC GCGCGCGGCA GCGGCTGCTG
GCGGCTTCGG GGGCCGATCG GGTGACGACC ATCGCGCTCG ACAGCGGCTT CGGTCATCTC
GGGCGGTTCG CCATGGCCTA TGCGCGCACC TTCGGCGAGC TGCCGAGCGA GACGCTTGCC
CGCCGCCGCA GGATTTGA
 
Protein sequence
MPDGNASFRT QRFTGGDAAQ PLPSIMFAER RRLAILGENG FVETCVRTIE GSDIVFGSVR 
SSGHVIELRE PDRLTLLLPR AGRLRVRIGS AEHGVTPGCP MAFRPGERVT DATAGRDGLF
AAITLQVPAA RVRALAEAAE LPLRGLLGPD AVALRARLEA SALEGMARLA CDLFLRPKTA
LPPGVALAIT DFVDAQLLAL MDGRPAPARC RVLSAFHRVR AAEEIMHAHS EEPLAMLDLA
RRLDIGLRSL QLAFREVHDG LSPREVYSRI RLDRARQRLL AASGADRVTT IALDSGFGHL
GRFAMAYART FGELPSETLA RRRRI