Gene RPC_4247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4247 
Symbol 
ID3971589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4726967 
End bp4729735 
Gene Length2769 bp 
Protein Length922 aa 
Translation table11 
GC content65% 
IMG OID637927350 
ProductCheA signal transduction histidine kinases 
Protein accessionYP_534090 
Protein GI90425720 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.63903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACT TGTTGCGCGA GTTTTTGACG GAGACGAATG AGAGCCTGGA CACGGTTGAC 
AACCAACTGG TCAGGTTCGA GCAGGACCCC AACAACGCCA AGATTTTGGA CAACATTTTC
CGGCTGGTCC ACACCATCAA GGGGACCTGC GGGTTTTTGG GTCTGCCGCG GTTGGAAGCT
TTGGCGCACG CCGCCGAGAC CCTGATGGGC AAATTCCGCG ACGGCATGCC GGTGACCGGC
GAGGCGGTGA CGCTGATCCT TTTGACCATC GACCGGATTA AAGAGATTCT AGGCGGGCTG
GAGGCCACCG AGGCCGAGCC TGAGGGCGTC GACCAGGATT TGATCGGCGA GTTGGAAGTG
TTGTCGCAAG CGCCGATGGC GGCGCCGGTG GTCGAAGTCG TCCCCGAAGT GGAGGCCGCG
CCGGTGGCGG CGGTCGCCGA GGGCACGCTG GTGCCGCAGA TTCTGGAGCG CGCGCTGCGC
CCCGGCGAAG TCTCGCTCGA CGAATTGGAG CGGGCGTTTC GCGAGACCGA AGTCGCGGTG
GAGATTGCGC AGCCGGTCGA AGCCAAAGCT GCCGAAGCCG ACCACGCCGA GCATAAGCCC
GCGGACGCCG CGCCGGTGGC CGCCGAGGCC AAATCCGACG CCAAATCGGA CGCCAAGCCA
GGCAAGCCCG TGACCAAGAA GAAGACCGCC GTGGAATTGG ATATGCCGCT GCACGATTCC
GACAAGATCG CCAACCAGTC GATCCGGGTC AATGTCGACA CCCTCGAGCA CCTGATGACC
ATGGTGTCCG AGCTGGTCTT GACCCGCAAT CAGCTCCTGG AGATCAGCCG CCGCCACGAG
GACACCGAGT TCAAGGTGCC GTTGCAGCGG CTCTCCACCG TCACCGCCGA ACTGCAGGAC
GGCGTGATGA AGACCCGCAT GCAGCCGATC GGCAACGCCT GGCAGAAGCT GCCGCGCATC
GTGCGCGATC TGGCCTCCGA ACTCGGCAAG CAGATCGAAC TGGAGATGCA CGGCGCCGAC
ACCGAGCTCG ACCGCCAGGT GCTCGACCTG ATCAAGGATC CGTTGACGCA TATGGTGCGC
AACTCCGCCG ATCACGGCCT CGAGACCCCG GCCGATCGCG CCGCCGCCGG CAAGCCCGAG
CAGGGCACGA TTCGCTTGTC CGCCTATCAC GAGGGCGGCC ACATCGTGCT GTCGATCGCC
GACAATGGTC GCGGCCTCGA CACCGCGCGG ATCAAGGCCA AGGTGATCGC CAACGGGCTG
GCCTCCGAAG CCGACGTCGA GAAGATGTCG GAGAGCCAGA TCCACAAATA CATCTTCGCG
CCGGGGTTCT CCACCGCGGC TGCGGTCACC AGCGTGTCCG GCCGCGGCGT CGGCATGGAC
GTGGTGCGCA CCAATATCGA TCAGATCGGC GGCACCATCG ACATCAAGTC GGTGGCCGGC
GAAGGCTCCA GCGTCACTAT CAAGATCCCG CTGACCTTGG CGATCGTCTC GGCGCTGATT
GTTGAGTCCG CCGGCAACCG CTTCGCCATC CCGCAATTGG CGGTGGTCGA GCTGGTGCGG
GCGCACGCCA ATTCCGAGCA CAAGATCGAG CGCATCAAGG ACACGCCCGT CCTGCGCTTG
CGCAACAAGC TGCTGCCGTT GATGCATCTG CGCCACCTGT TGCGGATCGA CGACGGCAAG
GTCACCGAGC CGGAGAACGG CTTCATCGTG GTGACCCAGG TCGGCAACCA GACCTTCGGC
ATCGTCGTCG ACGGCGTGTT CCACACCGAA GAAATCGTCG TCAAGCCGAT GTCGACCAAG
CTGCGGCACA TCGGCATGTT CTCCGGCAAC ACCATCCTGG GCGACGGCGC GGTGATCATG
ATCGTCGATC CCAACGGCAT CGCGCAGGCG CTCGGCACTT CGGTCGCAGC CCAGCACGAC
ATCGCCGAAG ACAACGCCGC GATCCGCGCC TCCTCGGCCG ATCAGTTGAC CTCGCTGCTG
GTGTTCCGCG CCGGTTCCGC GCAGCCCAAG GCGGTGCCGC TGTCGCTGGT GACGCGGCTC
GAAGAGATCG CCGCCGACAA GATCGAGTTC TCCAACGGCC GCCACATGGT GCAGTACCGC
GATCAGCTGA TGCCGCTGGT GACGATGGAC GGCGTCAGCG TCAAGACCTC CGGGGCGCAG
CCGATCCTGG TGTTCGCCGA CGAGGACCGC GCGATGGGCC TCGTGGTCGA CGAGATCGTC
GACATCGTCG AAGAACATCT GCAGATCGAG GTCGGCTCCA GCCACGAGGG CATTTTGGGC
TCCGCGGTGA TCAAGGGCGC CGCCACCGAA GTGATCGACG TCGGCCACTT CCTGCCGATG
GCGTTCGCCG ACTGGTTCAA GCGCAAGGAA ATGCGGCCCT CCACCACCGC ACAGTCGATC
CTGCTGGTCG ACGACAGTGC GTTCTTCCGC AACATGCTGG GCCCGGTGCT GAAGGCCGCG
GGCTACAAGG TGCGGCTCGC CACCAACGCC CAGGAAGGCT TGGGCGTGCT GCGCTCCGGC
CAGGAATTCG ACGCCATCCT CACCGACATC GAGATGCCGG ACATGAACGG CTTCGAATTC
GCCGAGACCA TCCGCGCCGA CGCCAAATTG GCGCAGACCC CGATCATCGC GCTGAGCTCA
ATGATCTCGC CGGCGGCGAT CGAGCGTGGC CGGCAGGCCG GCTTCCACGA CTACGTCGCC
AAGTTCGACC GGCCGGGCCT GATCGCCGCG CTGAAAGAAC AGACCACCCA TCTGAACCAG
GCGGCGTGA
 
Protein sequence
MDDLLREFLT ETNESLDTVD NQLVRFEQDP NNAKILDNIF RLVHTIKGTC GFLGLPRLEA 
LAHAAETLMG KFRDGMPVTG EAVTLILLTI DRIKEILGGL EATEAEPEGV DQDLIGELEV
LSQAPMAAPV VEVVPEVEAA PVAAVAEGTL VPQILERALR PGEVSLDELE RAFRETEVAV
EIAQPVEAKA AEADHAEHKP ADAAPVAAEA KSDAKSDAKP GKPVTKKKTA VELDMPLHDS
DKIANQSIRV NVDTLEHLMT MVSELVLTRN QLLEISRRHE DTEFKVPLQR LSTVTAELQD
GVMKTRMQPI GNAWQKLPRI VRDLASELGK QIELEMHGAD TELDRQVLDL IKDPLTHMVR
NSADHGLETP ADRAAAGKPE QGTIRLSAYH EGGHIVLSIA DNGRGLDTAR IKAKVIANGL
ASEADVEKMS ESQIHKYIFA PGFSTAAAVT SVSGRGVGMD VVRTNIDQIG GTIDIKSVAG
EGSSVTIKIP LTLAIVSALI VESAGNRFAI PQLAVVELVR AHANSEHKIE RIKDTPVLRL
RNKLLPLMHL RHLLRIDDGK VTEPENGFIV VTQVGNQTFG IVVDGVFHTE EIVVKPMSTK
LRHIGMFSGN TILGDGAVIM IVDPNGIAQA LGTSVAAQHD IAEDNAAIRA SSADQLTSLL
VFRAGSAQPK AVPLSLVTRL EEIAADKIEF SNGRHMVQYR DQLMPLVTMD GVSVKTSGAQ
PILVFADEDR AMGLVVDEIV DIVEEHLQIE VGSSHEGILG SAVIKGAATE VIDVGHFLPM
AFADWFKRKE MRPSTTAQSI LLVDDSAFFR NMLGPVLKAA GYKVRLATNA QEGLGVLRSG
QEFDAILTDI EMPDMNGFEF AETIRADAKL AQTPIIALSS MISPAAIERG RQAGFHDYVA
KFDRPGLIAA LKEQTTHLNQ AA