Gene RPD_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1998 
Symbol 
ID4022480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2234685 
End bp2236082 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content67% 
IMG OID637962191 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_569134 
Protein GI91976475 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.374305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGCC CGGCCGTTCT CGTCAAACCG CCGCGCCGCA GTGGCAGCTC GCTCGCCACG 
CGACTGTTTC TGTCGGCGAC CGCCTGGGTG GTGGTGATCC TGGCGATCAC CGGCATCGTG
CTGTCCTCGG TGTATCGGCA GGCGTCCGAG CGCGCGTTCG ATCGCCGGCT CAATCTCTAT
CTGCGCACCA TCATCGCCGA AGTCGCGACG CCCGAAGCCG CGCCGGACCA GTTTCAGTCG
ATCGGCGAGC CGCTGTTCGA TCTGCCGCTG TCGGGCTGGT ACTGGCAGAT CGTCCGCACC
GACACCGACA AGATCGACCC GCGCGCCTCG CGTTCGCTAT GGGACCGCAA GCTGCCGAAG
CTCGAGGACC AGGGCGTCGA ACTCGGCGCG TCCGGTGTCC GCCAAGGCTA TGTCGAGGGA
CCGGAAGGCC AGACCCTGCG CATGGTCGAG CGCCCGGTCG ATCTCGGCGC CGACGGCAAA
TTCGTCGTGA CGGTGGCCGG CGACGGCAGC GAAATCTTCG AGGAAACAAG GACCTTCGAC
TATTACCTCG CCGGCACCTT CATCGCGCTG TCGATCGGGC TGGTGCTGAC CACGATCTTT
CAGGTCCGGT TCGGCCTCGC GCCGCTGAAA CGGATCTCCG ACTCGATCGC CGACATCCGC
TCCGGCCGCG CCGAGCGGCT CGAAGGCAAG TTCCCGGTCG AGATCGCGCC GCTGGCCCGC
GAGACCAACG CACTGATCGA GGCCAATCGC GAGATCGTCG AACGCTCGCG CACCCATGTC
GGCAATCTCG CCCATGCGAT CAAAACGCCG CTCTCGGTGC TCGTCAACGA AGCCGCCGCG
CGGAGTGGCG ATCCGTTCGC CGCCAAGGTG CTGGAGCAGG CCGAAATCAT GCGCAGCCAG
GTCACGCATC ATCTGGAGCG GGCGCGGATC GCAGCACGGC TGACCGTGGT CGGCACTGTC
ACCGAGGTCG AACCGGTGAT CGAGGCGCTG CGCCGGACGA TGGAGAAGAT CCATCGCGAC
CGCGACATCC TGGTCCGCTC CGAGGTCGCC AGCGGCCTCA AATTCCGCGG TGAAAAGCAG
GACCTCGAGG AGATGGTCGG CAATCTGGTC GACAATGCGT GCAAATGGGC GGTGAGCCGG
GTGTTCATCG ACGTGACCGC CGAGCGCGGC CCGACGCCGC TGGTCCGCAT CATCGTCGAC
GATGACGGTC GCGGCCTGTC GGCGGCGGAG CGGGCCCAGG CCGCCCGCCG TGGTCAGCGG
CTCGACGAGA GCAAGCCGGG CTCCGGCCTC GGGCTCGCGA TTGTCGTCGA TCTTGCAGCA
CTTTACGGCG GCGAGCTGAA GCTCGCTCAC GCCCCGATCG GCGGCCTGCG GGCCGAACTG
AGGTTGCCTG CGGCGTAA
 
Protein sequence
MASPAVLVKP PRRSGSSLAT RLFLSATAWV VVILAITGIV LSSVYRQASE RAFDRRLNLY 
LRTIIAEVAT PEAAPDQFQS IGEPLFDLPL SGWYWQIVRT DTDKIDPRAS RSLWDRKLPK
LEDQGVELGA SGVRQGYVEG PEGQTLRMVE RPVDLGADGK FVVTVAGDGS EIFEETRTFD
YYLAGTFIAL SIGLVLTTIF QVRFGLAPLK RISDSIADIR SGRAERLEGK FPVEIAPLAR
ETNALIEANR EIVERSRTHV GNLAHAIKTP LSVLVNEAAA RSGDPFAAKV LEQAEIMRSQ
VTHHLERARI AARLTVVGTV TEVEPVIEAL RRTMEKIHRD RDILVRSEVA SGLKFRGEKQ
DLEEMVGNLV DNACKWAVSR VFIDVTAERG PTPLVRIIVD DDGRGLSAAE RAQAARRGQR
LDESKPGSGL GLAIVVDLAA LYGGELKLAH APIGGLRAEL RLPAA