Gene RPB_0135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0135 
Symbol 
ID3908106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp146884 
End bp148467 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content67% 
IMG OID637882017 
ProductGntR family transcriptional regulator 
Protein accessionYP_483758 
Protein GI86747262 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.868038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGAAAGC TCGCACAGCT CGCTTATCAG GAAGCCGGTC GCAACGGTTC GCTCGATCGT 
TCGGCAAATG CCTCCGCGAA TGCCGCATCG TTAGGCAACA GCGATGCGCT GTTCTGGGGC
GCCTTATTCC GCGGAATGGA TCGCGGCGGG TCGTTTCTGC AACTGCAGAT CCGCCAGATC
ATCGTCACCG CGATCGAAGA CGGCCGGCTG CCGCTCGGGA TGCGGATGCC GTCGAGCCGC
GATCTCGCCG CGGTGCTCAA GGTGTCGCGC AACACCGTTG TGATCGCCTA CGAGCAGCTG
GTCGACCAGA ACTTCCTGGT GTCGCGGCAG CGCAGCGGCT ACTTTGTCGC CGGCCTGTCG
AAGAAGGTCG CCTCCGGCGC CGGCAAGCCT GCCGAGGCGG CGAGCCGCGA CGATGCGCAT
TGGGCCGCGC GCTACGCGGT GCAGCCGTCG GCGCATCGCA ACATCGTCAA GCCCGCCGAC
TGGCAGGCGC AGCCCTACCC TTTCATCTTC GGGCAGTTCG ACCCCAGCCT GTTTCCGACC
AACGACTGGC GCGAATCCGC GCGGGCCGCG CTGAGCGTCC CCGAGATCAA CAACTGGGCG
CGTGACCTGA TCGACGGCGA CGACCCGGCG CTGATCGAGC AGTTGCAGCT TCAGGTGCTG
CCGCGCCGCG GCATTCATGC GCGCTCCGAC GAAATCATGA TGACGATCGG CGCGCAGCAC
GCGCTGTATC TGATCGCGAC GCTGTTCATC AACGACCGCA CCCGCGTCGG CATCGAGGAG
CCCGGCTATC CCGACGCGCG AAACATCTTC CGGATGCTGA CGCCCGACGT CGTCCCGCTC
GCCTCCGACG CGCAGGGCCT GTTGCCGGAC GAGCGCTTGC AGAGTTGCGC GCTGGCCTAT
GTCACCGCCA GCCATCAATG CCCGACCGCG ACGGTGATGC CACTGCAGCG GCGACTGGAG
TTGCTCAAGG CCGCCGAGAC CGGCGACGTC GTGCTGGTGG AGGACGACTA CGAGGGCGAA
CTGATGCCGG AGGCGGCGAC GCTGCCGCCG CTGAAGAGCC TCGATCACGC CAGCAACGTG
CTCTATGTCG GCAGCCTGTC GAAGGCGCTG GCACCCGGGC TGCGGCTCGG CTACGTGGTG
GCGCCCGCGC CGGTGATCCG CGAGCTGCGG GCGCTGCGGC GGCTGATGCT GCGCCACCCC
CCGCTCAACA ACCAGCGCGT CGCCGCATTG TTCATCGGGC TCGGGCACTA TCGCTCGCAT
CTGGCCCAGG TCGGCCGCGT GCTGCTGGAG CGCGCCAAGA TGCTCGACCG CCTGCTGCCG
AAACATCTGC CGGACTGCAG CTTCTCGCGC GGACCGGGCA GCACCAACTA CTGGATCGCC
TGCCCGCCCG GTACCGACAC CACGGCACTG GCGCGGGAAG CGCTGGCGCA GGGCGTGGTA
ATCGAGCCGG GCGCGGTGTT CTCGATGGAC GAGACCGCCA GCCGGCATTG CTTCCGGCTC
GGCTTCTCCT CGATCCGCAC CGACCGGATC GAAACAGGCA TCCAGCGCCT CGGCGAAGTG
ATTGCGGAGT ATTTGAAGAA GTAG
 
Protein sequence
MGKLAQLAYQ EAGRNGSLDR SANASANAAS LGNSDALFWG ALFRGMDRGG SFLQLQIRQI 
IVTAIEDGRL PLGMRMPSSR DLAAVLKVSR NTVVIAYEQL VDQNFLVSRQ RSGYFVAGLS
KKVASGAGKP AEAASRDDAH WAARYAVQPS AHRNIVKPAD WQAQPYPFIF GQFDPSLFPT
NDWRESARAA LSVPEINNWA RDLIDGDDPA LIEQLQLQVL PRRGIHARSD EIMMTIGAQH
ALYLIATLFI NDRTRVGIEE PGYPDARNIF RMLTPDVVPL ASDAQGLLPD ERLQSCALAY
VTASHQCPTA TVMPLQRRLE LLKAAETGDV VLVEDDYEGE LMPEAATLPP LKSLDHASNV
LYVGSLSKAL APGLRLGYVV APAPVIRELR ALRRLMLRHP PLNNQRVAAL FIGLGHYRSH
LAQVGRVLLE RAKMLDRLLP KHLPDCSFSR GPGSTNYWIA CPPGTDTTAL AREALAQGVV
IEPGAVFSMD ETASRHCFRL GFSSIRTDRI ETGIQRLGEV IAEYLKK