Gene Rpal_4759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4759 
Symbol 
ID6412445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5125241 
End bp5126239 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content64% 
IMG OID642714638 
Producttranscriptional regulator, AraC family 
Protein accessionYP_001993725 
Protein GI192293120 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATTG ATCTACTCGG GGAGCCGCTC GATCGGTTTC CCATGGTGCG CGGTTCCAAT 
CCGACCGATT TCGAATCCGC ACTCAAGTCT GTGTTCGGCG ATGCGTCGGT CGAGGTCTCG
GATCGAGACG GCTTCAGGGC GCGGCTCAAT TTCGTCCGGC TGAGCGATAT CGAGATGGCC
TATAGCTGGG CGACCGTGCC GTCGCGGCTG CGGCTGCCGC CCGACGATTT CGTCGGCCTG
CAGCTCGCGC TGAGCGGCAG CGCCATCACC ATGGTCGGAA ATCGGCGGGT CGCCACCAAT
GCCCGGCAGT CGTGCATTTG TCCGCCCGGT CAGGGGCGCG ACTATCAGTT CGATGCCGAG
TTCGAACAGC TGTTTCTCGG CGTCCGGCTC AGTGCGCTGG AACGGACGCT CGCCGGGCTG
CTTGGCGGCA AGCCGAATGC GCCGCTCGAA TTCGAGCCGG TGGCCGACAA CGACTATCCG
CACTCGGAAA ATCTGCGCCA GCTGACGCTG TTCTTCGGCG GCACGCTGAA CGCCACCAAG
GTGTCGTTGC CGTCGCAATA TCTGGCCGAG CTCGAGCAGG CGACCGCGGT CGCCTTCCTG
CACGCCTGTA AACACAATTT CAGCAGCTAT CTCGGCGTCG CTGAGAAGGA CGCCGCGTCG
CGCCACGTCA AATTGGTCGA GGAGTACATC GAGGCCAATT GGAACGAGTC GCTCACGATC
GAGAAGCTGG TGGAGCTGAC CGGCATGAGC GCCCGCACCG TGTTCAAGGC GTTTCAGCGC
ACCCGCGGTT ATTCGCCGAT GGCCTTTGCC AAGCGGGTCC GGATGGAGCG GGTGCGGCAG
CTGCTGCTGG AGGCCGGCGG CGACGCCTCG GTCGGCGCCA TCGCGGTGCA ATGCGGCTTT
CCGCATCTCG GTCATTTCGC CAAGGATTAT CGCAAGACGT TCGGCGAGAA TCCGTCCGAT
ACGCTGGCAA GGGGACGCCG TTTCCGCGGC GTTCGATAG
 
Protein sequence
MKIDLLGEPL DRFPMVRGSN PTDFESALKS VFGDASVEVS DRDGFRARLN FVRLSDIEMA 
YSWATVPSRL RLPPDDFVGL QLALSGSAIT MVGNRRVATN ARQSCICPPG QGRDYQFDAE
FEQLFLGVRL SALERTLAGL LGGKPNAPLE FEPVADNDYP HSENLRQLTL FFGGTLNATK
VSLPSQYLAE LEQATAVAFL HACKHNFSSY LGVAEKDAAS RHVKLVEEYI EANWNESLTI
EKLVELTGMS ARTVFKAFQR TRGYSPMAFA KRVRMERVRQ LLLEAGGDAS VGAIAVQCGF
PHLGHFAKDY RKTFGENPSD TLARGRRFRG VR