Gene RPB_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3021 
Symbol 
ID3910820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3443780 
End bp3445312 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content64% 
IMG OID637884927 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_486634 
Protein GI86750138 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.309239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCCGG AATCCACGCG GCGACGCGCT TTCATCCAGA TCACGTTGCT GTCGCTGTCT 
TTTGCCGTGC TGGTTCTGAT CAGCGTCGCC TCGGTGTTTC TGTTGCAGAA AGCCCGGGAG
GACAGCGCCT GGGTCGCCCA CACGCTCGAA GTCGAGAACC AGATTTCCCT CGCCCAGCTG
CAGATGCGCC GTGCCGAAAG CGCGGAGCGA GGCTTTCTGC TGACGAGTCA GCCTGAATTC
GTCGAAGAGT TCGAGAATTC CTCCGGCCAG GTGGTGCCGA TGCTGCAACG CCTGCAGCAG
CGGGTAGCCG ACAATCCGGT GCAGCGCGGG TTGGTCGACA AGAGCATCGT CATCGCACAG
GCGAGGATCG AGCGGTTCAG GGAACTGATC GCGCTCGCGC AGGCGGACCG CGCCGAGGAA
GCTCGGCGCA TCGTGCGCGC CGAAAGCGGC CGTGAGGCCA TGAACCAGTT CGCCAACTTC
ATGGCGGCGA TGCGGGCCGA AGAAAACAAG CTATTCGTGA TGCGGACGGC GGCGGCCGAT
CGCAGCCAGT CTCTGGCGTC GATCGTCACC ACCACCGGCT CCGGCCTGGT GCTGGTGCTG
GCCGGGCTCT CGATCTTCCT GGTGCGGCGC TCCGCGCGCG CCCGTGACGA CGCCGAAGCG
CTGCTACGCG AGAACAATCT CAACCTCGAG AGCACCGTGT CCGAGCGCAC CGCCGACCTG
CGCGAGGCCA ACGAAGAGAT CCAGCGCTTC GCCTATATCG TGAGCCACGA TTTGCGCTCG
CCGCTGGTGA ACATCATGGG ATTCACCAGC GAGCTCGAAG AACTGCGCAG CGACATCTTT
CGCCGCATCG CCACGCTGAG CCGCGAAGTC GCCGACAAGC AGATGCCGGA GAAGGCCGAC
AGCGACGGCG AGCCGGAGCT GCAGCCGGCC GACCAACAAT TGTCGGAGGA CTTCACCGAG
GCACTCGCCT TCATCAAATC GTCGATCGGC AAGATGGACC GGCTGATCAG CGCGATCCTC
AACCTCACGC GCGAAGGCCG CCGTCAGTTC CAGCCGGTTT CGATCGACAC CCGGGAGCTG
ATCGAGAACA TCGTCTCGAC GGTGGCGCAT CAGGCCGCCG AGGCCAACGC CGAGATCCGC
ATCGAGCCGT TGCCGGAAAT CGTCAGCGAC CGCCTCGCGC TCGAACAGAT CTTTTCGAAT
CTCATCGACA ACGCTCTGAA ATATCTCAGG AACGGTGTCC CCGGAGACAT CCGGATCCGC
GGTCGCCAGA AGCTCGGCTT CGCGATCTTC GAGATCGCCG ACAACGGACG GGGGATCGAT
CCCAAGGACC ATCAGAGAAT CTTTGATCTT TTCCGCCGTG CCGGTACACA GGACAGGCCG
GGTCAGGGGA TCGGGCTCGC TCATGTTCGC GCGCTCGTGC GCCGCCTGGG TGGAACCATG
TCGGTGTCCT CGGCGCTCGG CGAGGGCTCG ACCTTCACCA TCACCTTGCC GATGAAATGG
ACCACGGCCA CCAAACGGGA ACCCCGCTCA TGA
 
Protein sequence
MTPESTRRRA FIQITLLSLS FAVLVLISVA SVFLLQKARE DSAWVAHTLE VENQISLAQL 
QMRRAESAER GFLLTSQPEF VEEFENSSGQ VVPMLQRLQQ RVADNPVQRG LVDKSIVIAQ
ARIERFRELI ALAQADRAEE ARRIVRAESG REAMNQFANF MAAMRAEENK LFVMRTAAAD
RSQSLASIVT TTGSGLVLVL AGLSIFLVRR SARARDDAEA LLRENNLNLE STVSERTADL
REANEEIQRF AYIVSHDLRS PLVNIMGFTS ELEELRSDIF RRIATLSREV ADKQMPEKAD
SDGEPELQPA DQQLSEDFTE ALAFIKSSIG KMDRLISAIL NLTREGRRQF QPVSIDTREL
IENIVSTVAH QAAEANAEIR IEPLPEIVSD RLALEQIFSN LIDNALKYLR NGVPGDIRIR
GRQKLGFAIF EIADNGRGID PKDHQRIFDL FRRAGTQDRP GQGIGLAHVR ALVRRLGGTM
SVSSALGEGS TFTITLPMKW TTATKREPRS