Gene RPD_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1944 
Symbol 
ID4022426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2181926 
End bp2183749 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content66% 
IMG OID637962137 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_569080 
Protein GI91976421 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.133284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0645132 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGTAT CCAACATCAT CAGCGAGTGC CTCGACGCAT TGCTGCATCC GTCGGCGCGA 
TACGACGCGC TGGCGAGCGC GCGTCATCGC GCGTTCATTG CGCCTCGCCT GCTCGGCAGC
CTCGTCGCCT TCGCTGCGTT CCCGATCTAC CTCGTGCTCC GGGGCGCGCC GAGTGCGCTC
GAAGCGCTGG CGTTCGCATG GCTGATCGCG CCGATTCTGC TGTCGTGGTT TCTGTCACGC
ACCGGCCGTT ATCGTAGTGC CCATGTGCTG TCCTCGCTGT CGCTGGCGAG CCTGATCGTG
ATCGTCGCGG CGCAGACCGG CGGAATTTCA TCTTTCGCCG CGATCTGGCT GGTGCTGGTG
CCGCTCGAGG CCGCGCTGTC GGCGTCGCGC CGTGTCGTGA GCTTCGCCGC GGCGCTCGCG
CTGGGTTGCG CCGCGGCGCT GATCGCGACC GGTCAGTTCG GCTGGCTTCC GACGCCCACC
GGCGCCGACG TGTCGCACGG CACGCTGATG GCGTTCAGTG TCGCATCCGC GACGCTGTAC
GCGGCCGGAC TCGCGTTCGG CGCGGAATCG CTGGCGCGCA CCAGCGACAC GCTGCTCAGC
GTCGAGGAGG AGCGCTACCG GCTGCTCGCC CTTCACATGA GCGATGTGAT CTCGCGCCAC
AGCCGCAACG GCGCGGTGCA GTTCATTTCA CCGGCCGTCG AAACCCTGCT CGGTCTGCCT
GCGGCTCGGC TCGCGGGGCA CGGCCTGTTC GATCGGGTCC ACGTCGCCGA TCGGCCAGCC
TATCTCAAGG CGCTGTCGGA CGCAGGCCTT GGCGGCGAGG GACGCAGCGT CGAGTTCCGC
GTTCGCCGCG AGGTTGCGCG CGATGATCGC GGCCGGCCGA CGGCACCGGA ATTCATCTGG
CTTGAAATGC GCTGCCGCCC GTTTGCACCC GGCACTCAAT CGTCGGCCGC CGGCGAGACC
GGCGTGGTCG CGGTGATGCG CGACGTCACC GATCGCAAGC TGCAGGAACA GGCCCTGGAA
CAGGCGCGCG CCGAGGCGAG CCGCGCCGAC GCCAGCAAGA GCCGCTTCCT TGCCACCATG
AGCCACGAGC TGCGCACGCC GCTGAATGCG ATCATCGGCT TCTCCGACAT GATCCTGCAC
GAAGACGAGC TGATGCTCGC GCCGGAGCGG CGCAAGGAAT ATGCGCAGCT CATCAACGAT
TCCGGTCAGC ACCTGTTGTC GGTCGTCAAC GGCATTCTCG ACATGTCGAA GATGGAATCG
GGAAATTTCG AGATCGTTCC GGAGCCGTTC GCGCCGCGAC CGGCGCTGCT GAATTGCTGC
AATCTGCTGG CGCTGAAGGC CCGCGAGAAT GGCATCGAGC TGGTGACGCG CGCGCCTGAA
GATCTTCCGG AGATAGTCGG CGATGCGCGC GCGTTCAAGC AGATCCTGCT CAATCTGGTG
TCGAACGCGA TCAAGTTCAC AGAGCGCGGC GGCACCGTCT CCGTCGAAGC CGCCGTTGAA
GCGGCGCGGC TGGTGCTGCG GGTCCGCGAC AACGGCGTCG GCATTGCGGC GGAAGACCTG
AAACGGATCG GCGATCCGTT CTTCCAGGCC GGAAAGACGT ATCAGCGCCG CCATGAGGGC
ACCGGGCTCG GCCTTTCGAT CGTGAAGAGC CTGGTCGGTC TTCACGGCGG CAAGATCGAC
GTGACGAGCG AGGTTGATCG GGGCACGATC GTGACGATAT CGCTGCCGTT GGCGCCTTCG
GCGCCGTGCA ACGTCACGAC GCTGCAGCCG CCACAGCCGG CCGAGCCTCA ATTGCAAGAC
TATCAGGTGA AGAAGAGTGC CTAA
 
Protein sequence
MAVSNIISEC LDALLHPSAR YDALASARHR AFIAPRLLGS LVAFAAFPIY LVLRGAPSAL 
EALAFAWLIA PILLSWFLSR TGRYRSAHVL SSLSLASLIV IVAAQTGGIS SFAAIWLVLV
PLEAALSASR RVVSFAAALA LGCAAALIAT GQFGWLPTPT GADVSHGTLM AFSVASATLY
AAGLAFGAES LARTSDTLLS VEEERYRLLA LHMSDVISRH SRNGAVQFIS PAVETLLGLP
AARLAGHGLF DRVHVADRPA YLKALSDAGL GGEGRSVEFR VRREVARDDR GRPTAPEFIW
LEMRCRPFAP GTQSSAAGET GVVAVMRDVT DRKLQEQALE QARAEASRAD ASKSRFLATM
SHELRTPLNA IIGFSDMILH EDELMLAPER RKEYAQLIND SGQHLLSVVN GILDMSKMES
GNFEIVPEPF APRPALLNCC NLLALKAREN GIELVTRAPE DLPEIVGDAR AFKQILLNLV
SNAIKFTERG GTVSVEAAVE AARLVLRVRD NGVGIAAEDL KRIGDPFFQA GKTYQRRHEG
TGLGLSIVKS LVGLHGGKID VTSEVDRGTI VTISLPLAPS APCNVTTLQP PQPAEPQLQD
YQVKKSA