Gene RPD_1343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1343 
Symbol 
ID4021820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1511626 
End bp1513158 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content64% 
IMG OID637961536 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_568482 
Protein GI91975823 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGATC AAGCCCAATC GTCATCGGAG GCCCTGGCGC GCGATAGGCG GCTGCAGCAG 
GGCATCGATA GCTACGGCGT GGGAAGTTGG GAATTGAATC TCGCCACGCG GGAATTGGCG
TGGTCTGCGA CGACGCGACG ATTGCTCGGT GTCGCGCCCG ACGCACCGGT CGACTTCGAC
ACCTTCGTCT CTCTGGTCGA TCCGGAAGAC CGCGACCGCG TCACGCAGGC GGTGCAGCGC
ACGATCGAGC ACAGCGAGTA CCTCGACATC ATGTTTCGGG TCGCCGACAG GGGCCAGCCG
AGCCATTGGC TGCGCGCCCG CGGCGGTCTC GTCAGGGAGA ACGGCACGGC CGGCTACCTG
TGCGGCATCG TGCTCGACAT CGACCAGCAG AAGGTGCTCG AACAGGAACT TAGGCTGCAG
CAGAATCAGC TCCGCTCGAT TCTCGACACG GTGCCTGATG CGATGATCCT GATCGACGGC
CGCGGCATCA TGCGGTTCTT CTCCAGCGCC GCAGAGCGGA TGTTCGGACT GTCCGAGAGA
GAAGCGATCG GCCAGAACAT CAGCATCTTG ATGCCCCGGC CGGACGCGTC ACGCCACGAC
AATTACATTT CCCGCTACAG CAATACGGGC GAGCGCCACA TTATCGGCAT CGGCCGCATC
GTCACCGGCC AGCGCAAGGA CGGCACCACC TTCCCGATCC ATCTGTCGAT CGGCGAGATG
GTGTCCAGCG GCGAGCGATA TTTCACGGGG TTCATCCGTG ACCTCACCGA ATATCAGCAG
ACCCAGGCGC GGCTGCACGA ACTGCAGGCG GAGCTGGTGC ACGTCTCGCG GCTGACAGCG
ATGGGCGAGA TGGCGTCGGC GTTGGCGCAC GAGCTCAACC AGCCGCTGTC GGCAATCAGC
AACTACATGA AGGGCTCGCG CCGGCTGCTC GCGGGGAGCA GCGACCCGAA CATCTCCAAG
ATCGAAAACG CGATGGAGCG CGCCGCCGAG CAGGCACTGC GCGCAGGCCA GATCATCCGA
CGGCTGCGCG ACTTCGTCGC CCGCGGCGAA TCCGAAAAGC GGGTCGAGAG CCTCGCCAAG
CTGGTCGAGG AAGCCGGTGC GCTGGGCCTG ACCGGCGCGC GCGAGCAGGG CGTGGTGTTG
CGCTTCCACA TGGACCAGGC CAACGATCTG GTGCTGGTCG ACCGCGTGCA GATCCAGCAA
GTGCTCGTCA ATCTGTTCCG AAATGCGCTC GAGGCGATGG CGACTTCACG GCGCAAGGAG
TTGTCCGCGT CGAATGCACG GGTGGCGGAC GACATGATCG AGGTGACGGT GGCCGACAGC
GGCAGCGGCA TCCCCGACGA CGTCAAGGCC AAACTGTTCC AGACATTTTT CACTACCAAG
GACACCGGAA TGGGCGTTGG ACTATCCATC AGCCGCTCGA TCATCGAGGC CCATGGCGGC
CGGATGTGGG CTGAAACCAA CAGCGCTGGC GGCGCGACGT TTCGCTTCAC GCTGCCGATG
GCGCCGGGCG AGGACGTGAC CGATGCCGCA TAA
 
Protein sequence
MTDQAQSSSE ALARDRRLQQ GIDSYGVGSW ELNLATRELA WSATTRRLLG VAPDAPVDFD 
TFVSLVDPED RDRVTQAVQR TIEHSEYLDI MFRVADRGQP SHWLRARGGL VRENGTAGYL
CGIVLDIDQQ KVLEQELRLQ QNQLRSILDT VPDAMILIDG RGIMRFFSSA AERMFGLSER
EAIGQNISIL MPRPDASRHD NYISRYSNTG ERHIIGIGRI VTGQRKDGTT FPIHLSIGEM
VSSGERYFTG FIRDLTEYQQ TQARLHELQA ELVHVSRLTA MGEMASALAH ELNQPLSAIS
NYMKGSRRLL AGSSDPNISK IENAMERAAE QALRAGQIIR RLRDFVARGE SEKRVESLAK
LVEEAGALGL TGAREQGVVL RFHMDQANDL VLVDRVQIQQ VLVNLFRNAL EAMATSRRKE
LSASNARVAD DMIEVTVADS GSGIPDDVKA KLFQTFFTTK DTGMGVGLSI SRSIIEAHGG
RMWAETNSAG GATFRFTLPM APGEDVTDAA