Gene RPB_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2046 
Symbol 
ID3909861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2324746 
End bp2325726 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content59% 
IMG OID637883939 
ProductAraC family transcriptional regulator 
Protein accessionYP_485664 
Protein GI86749168 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAGA GGAATCACTA TTCGACGGCC GCGCTAACCG GGTCTGAAAG CATTGCAGCG 
TGGCGGCAGG CCATGGCTGA AGTGTATTAT CGCCTCGATA TTCAGGCCCG CCACGATGAC
CGCGTGGTCG GCGAACTGAT CGACGTTCAA CTCGGCTCGC TGGGACTGTC CAACTTCAAG
GCGGATGCGC AACGGGTGAT CAGGCGCAAG GAGTCCGCGA AGATCGACGG GTCCGAAGAC
TTCGTCTTTC TGTTTCCGAT CCGAAAGGGC TTGCAATACG AGCAGCGGGG ACGCTCGGGG
CTAGCCATGC CCGGAACCGT CTTTCTCCTC AACTCTGCCG AAAATTACGT CATCGACGTT
CCGGACGGGT CCGAAAACAT CACGATCAAG GTCGACCGCC GCCTTCTGAT CGATCGGGTC
AAAGGGATCG ACGGCCTGTG CGCCTCGATG AATATCGCCA ACTCACAACT CGTTCCGGTT
GTGACAACGC TCGGCGCGCA ATTGCTCAAT CTTCCGCCGG GAGAGCACGC CGACCGACTC
CAGCAGTCGG TGATCGACCT GATCTGCCTG ATGCTGGACT TGCGGGAATC GGCACAAGAC
AAGACCTTCA TCAGGCAGAC GCTGGCCTCG TCGCTGTACC ATCGGATCGA TGCTTACCTG
CAGCGCAACC TGCACGATTG CGACCTGTCG CCCGATCACG CCGCGCGAGA GCACAAGATC
TCGGTCCGCT ACCTGCACAA GGTGTTTCAC TTTCACGGCA CCTCGTTCGG CCAGCGCCTC
CTCGAACTCA GATTGCAGCG TGCGCATTAC GTCATTTCAA GACATGGCGC CACCACCACC
ATCAATCTTG GTCAGGTGGC CTATGAATGC GGGTTCACGA GCCAGTCCTA TTTCTCGACC
TGCTATCGGA AACGCTTTGG CTTGACGCCA CGCCAGACCG GGAAGTCCGA TCGACAATCC
GCCGCCGACG CCGGAAGCTG A
 
Protein sequence
MNQRNHYSTA ALTGSESIAA WRQAMAEVYY RLDIQARHDD RVVGELIDVQ LGSLGLSNFK 
ADAQRVIRRK ESAKIDGSED FVFLFPIRKG LQYEQRGRSG LAMPGTVFLL NSAENYVIDV
PDGSENITIK VDRRLLIDRV KGIDGLCASM NIANSQLVPV VTTLGAQLLN LPPGEHADRL
QQSVIDLICL MLDLRESAQD KTFIRQTLAS SLYHRIDAYL QRNLHDCDLS PDHAAREHKI
SVRYLHKVFH FHGTSFGQRL LELRLQRAHY VISRHGATTT INLGQVAYEC GFTSQSYFST
CYRKRFGLTP RQTGKSDRQS AADAGS