Gene Rpal_3177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3177 
Symbol 
ID6410847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3423035 
End bp3424615 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content67% 
IMG OID642713055 
Producttranscriptional regulator, SARP family 
Protein accessionYP_001992156 
Protein GI192291551 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)
[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0715298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGGGT TTTGGCTCCA GACCTTTGGA TTTCCTGAGA TCCGCAGCTT CGACGGTCCG 
GTCCGGCTCA ATCTGCGCAA AGGGCTGGCG CTGCTGGTCC AACTCGCGGA GGCGCGCGCA
GCGGTGTCCC GGGACGTCGT CGCCACCTTG CTGTGGCCGG ACAGCGCCAT CGACGTTGCC
CGCGCGCGGC TGCGGCGCTT GCTGCATCGC ATCGAGCAGG CGCTCGGTCA GCCTGTGTTC
GAGACCGACC GGACCAGCCT GCGTTGGTCA CCCTCGGTCG AACTCAGCGT CGATGCGCAG
CTGTTCGAAG CCGCCTGTGA AAGCGGGGAT TTCGCGCAAG CCTGCGGCTA TTATCACGGC
GATTTCCTCG CCGGCTTCGC GTTGCCGGAC TGTGCCGAGT TCGAGGAGTG GGCGTTCTTC
CGCCGCGAGG CGCTGCGAGG ACGCTACCTG CACGCACTGG AGCGCTTGGT GCAGGACAAC
AACGCCGCCG GCGATCATGC AGCGGCCGCA GTTCGTGCGA CGCGTCTGGT GGAATTGGAT
CCGCTCAGCG AGATCTACGG CCGCCATCTG ATCCGCAGTC TGCTGCTGGC AGGCGAACGG
CAGGCGGCGG AACGCCATCA CGCCGCACTG ACCCGGCGGC TGCACGATGA GCTCGGGGTG
GCGCCCGAGG CCGAGACCGA GGAGTTGATG CAAGAGGCCG CGATGCCGAC TGCTCTCGCT
GTCCCGGTGA CGCGATATGC CGAGGGCGAC AGCATCCACT TGGCGTATCA GACCTTCGGC
AGCGGCGAGT TCGACATCCT GGTCATGCCG GGCTTCGTTT CACATGTCGA ACGGGTGTGG
GAGCATCCGT CGTGCCGGGC GTTTCTCGTT TCGATGATGG CGCTCGGCCG GCTGATTATC
TTTGATCGCC GCGGCGTCGG GCTTTCGGAC CGGGTCGGCT CGGCTCCGAA TGTCGACGTC
ACCGCCGAGG ATATCGGCAC CGTGCTGCGG GCTGCGCAGT CGCGCCGCGT CGTGCTGTTT
GGCGCATCCG AATGCGGACC GGCCTGCATC AAATTCGCGG TCGATCATCC TCGCCTCGTC
GCGGGCCTGA TCTTGTTCGG CTCGCTTGCC AAAGGCTGTC GCACCCCTAA CTACCCGTAC
GCCTTGACGG TGGAGCAGTT CGAGATTTGG CGCCGGCAGC TGATCGGCGC ATGGGGCAGC
GCAGCCGGCA TCGAGACCTT CGGCCCCAGT CTAGCGCATG ACGCCAAATC AAAAGCGTGG
TGGGCCGGAC TGCTGCGCGC TGCATCCAGT CCCGGCGGCA TCTGGGCGGT GCTCGCAGCC
TTGCGCGACG CCGACGTCCT GGAGCTGCTG CCCAAACTAT CCGTGCCGAC CTTGGTGCTG
CACCGCCGCG GCGACCGCGC CGTCCGGATC GCCGCGGGAC GCGACATGGC CGCTCGGATC
GTCGGCGCCG AATTCGTCGA GCTCGACGGC GACGACCACT GGTTCTTCGC GGGGGACCAA
CGCCCCGTAC TCGACGCCAT CAGACAGTTC GTAAGCGACC TTCGCCGCCC AGGTCCGAAG
CGCACGCGGC CACGTACATA G
 
Protein sequence
MTGFWLQTFG FPEIRSFDGP VRLNLRKGLA LLVQLAEARA AVSRDVVATL LWPDSAIDVA 
RARLRRLLHR IEQALGQPVF ETDRTSLRWS PSVELSVDAQ LFEAACESGD FAQACGYYHG
DFLAGFALPD CAEFEEWAFF RREALRGRYL HALERLVQDN NAAGDHAAAA VRATRLVELD
PLSEIYGRHL IRSLLLAGER QAAERHHAAL TRRLHDELGV APEAETEELM QEAAMPTALA
VPVTRYAEGD SIHLAYQTFG SGEFDILVMP GFVSHVERVW EHPSCRAFLV SMMALGRLII
FDRRGVGLSD RVGSAPNVDV TAEDIGTVLR AAQSRRVVLF GASECGPACI KFAVDHPRLV
AGLILFGSLA KGCRTPNYPY ALTVEQFEIW RRQLIGAWGS AAGIETFGPS LAHDAKSKAW
WAGLLRAASS PGGIWAVLAA LRDADVLELL PKLSVPTLVL HRRGDRAVRI AAGRDMAARI
VGAEFVELDG DDHWFFAGDQ RPVLDAIRQF VSDLRRPGPK RTRPRT