Gene RPB_3920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3920 
Symbol 
ID3911725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4471129 
End bp4473933 
Gene Length2805 bp 
Protein Length934 aa 
Translation table11 
GC content66% 
IMG OID637885822 
ProductCheA-like signal transduction histidine kinase 
Protein accessionYP_487524 
Protein GI86751028 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases
[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.690282 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGATC TTCTCCGTGA GTTCTTGACG GAAACCTTCG AAAGCCTGGA TACGGTTGAC 
AATCAACTGG TCCGCTTCGA GCAGGAGCCG AACAACGCGA AGATACTGGA TAATATCTTC
CGGCTCGTTC ACACCATCAA GGGTACCTGC GGGTTCCTCG GGCTGCCGCG ACTGGAAGCG
CTGGCGCATG CGGCCGAGAC GTTGATGGGC AAATTCCGCG ACGGCATGCC GGTCACCGGC
GAGGCGGTGA CGCTGATCCT GACCACGATC GACCGCATCA AGGACATTCT CGGCGGACTG
GAAGCGACCG AGGCCGAGCC CGAGGGCGAC GACGGCGACC TGATCGGCGA ACTCGAACGG
CTGTCGATGC GCACGCCGGA GCAGATCGCC GCCGAGCTCG GCGGCAGCGC TGCGCCGGTC
GACGAAGCCG TTCCGGAGGA CGCTGCGCCG GTCGAAGCGG CCGTGGCCGC ACCGGCGGTG
GCAGAAGGTT CGCTGGTGCC GCAGACGCTG GAGCGTCCGT TGCGCCCGGG CGAAGTGTCG
CTCGACGAGC TGGAGCGCGC GTTCCAGGAA ACCGCGATCG AAGTGGCCTC GCCGCCGCTG
GCGCCGGTCG CGAGCGAGCC GGAGAAGCCG GCCGAGGCTC CGCCTGCGGT CGTCGAAGCA
GCCAAGCCGG CGCCGAAGCC GGCCGCCAAG GCCCCGAAGA AGAATGCCGA CGCCGAGGCT
CCGGCCGAGG GCGACCGGAT CGCCAACCAG TCGATCCGCG TCAACGTCGA CACGCTCGAA
CACCTGATGA CGATGGTGTC GGAGCTGGTG CTGACCCGCA ACCAGCTGCT CGAGATCAGC
CGTCGCCACG AGGACACCGA ATTCAAGGTG CCGCTGCAGC GGCTCTCGAC CGTCACCGCC
GAGTTGCAGG ACGGCGTGAT GAAGACCCGG ATGCAGCCGA TCGGCAACGC CTGGCAGAAG
CTGCCGCGGA TCGTGCGCGA TCTCGCCGCC GAACTCGGCA AGCAGATCGA ACTCGAGATG
CACGGCGCCG ATACCGAACT CGACCGCCAG GTCCTCGACC TGATCAAGGA TCCGCTCACC
CACATGGTGC GCAACTCCGC CGATCACGGT CTGGAGCGGC CGGAGGAGCG GGTGCGCAAC
GGCAAGCCCG AGCAGGGCAC GATCCGTCTG TCCGCCTATC ACGAAGGCGG CCATATCGTG
ATCTGCATCG CCGACAACGG CCGCGGCCTC AACACCGAGC GGATCAAGGC GAAGGCGATC
GCCAACGGCC TGGTCACCGA GGCCGAGGTC GAGAAGATGA CCGAGGCGCA GATCCACAAG
TTCATCTTCG CGCCCGGGTT CTCGACCGCC GCCGAAGTCA CCAGCGTGTC CGGCCGCGGC
GTCGGCATGG ACGTGGTGCG TACCAATATC GACCAGATCG GCGGCACCAT CGAGATCAAG
TCGGTGGCCG ACGAAGGCTC CAGCGTCACC ATCAAGATCC CGCTGACGCT GGCGATCGTC
TCGGCGCTGA TCGTCGAGGC CGGCGGCGAT CGCTTCGCGA TCCCGCAGCT CGCGGTGGTC
GAACTGGTGC GGGCGCGGGC CAATTCCGAG CACCGCATCG AGCGTATCAA GGACACTCCG
GTGCTGCGTC TGCGCGACAA GCTGCTGCCG CTGATGCACC TGAAGAAGCT GCTGCACATC
GACGACGGCA CCGCCTCGAC CGAGCCGGAG AACGGCTTCA TCGTGGTGAC CCAGGTCGGC
AGCCAGACCT TCGGCATCGT CGTCGACGGC GTGTTCCACA CCGAGGAAAT CGTCGTCAAG
CCGATGTCGA CCAAGCTGCG CCACATCGGC ATGTTCTCGG GCAACACCAT CCTGGGCGAC
GGCGCGGTGA TCATGATCGT CGATCCGAAC GGGATCGCGC AGGCGCTCGG CACCTCGGTG
TCGGCGCAGC ACGACCTGGC GGAGCAGAGC GCGGCGACCC GCGCGGCCAC GACCGAGCAG
CTCACCTCGC TGCTGGTGTT CCGCTCCGGC TCGCCGCAGC CGAAGGCGGT GCCGCTGTCG
CTGGTCACCC GGCTCGAAGA GATCGCCGCC GACAAGATCG AATCGTCGAA CGGCCGCTAC
ATGGTGCAGT ACCGCGATCA ACTGATGCCG CTGGTGCTGA TGGAGGGCGT CGAGGTCGCG
AGCACGGGTG TGCAGCCGAT CCTGGTGTTC GCCGACGAAG ATCGCAGCAT GGGTCTCGTG
GTCGACGAGA TCGTCGACAT CGTCGAGGAG CATCTGCAGA TTCAGGTCGG CTCCAGCCGC
GACGGCATTC TCGGCTCGGC GGTGATCAAG GGGCTCGCCA CCGAAGTCAT CGACGTCGCT
CACTTCCTGC CGATGGCGTT CTCCGACTGG CTCTCCCGCA AGGAGATGAA GCAGTCGATC
GCCAGCCGGT CGGTGCTCCT GGTCGACGAC TCCGCGTTCT TCCGCAACAT GCTCGGCCCG
GTGCTGAAGG CAGCCGGCTA CAAGGTGCGG CTGGCGACTT CGGCGGTCGA AGGACTCTCG
GTGCTGCGTA GCGGCGCCAG CTTCGACGTG ATCCTGACCG ACATCGAGAT GCCGGAAATG
AACGGCTTCG AGTTCGCCGA GGCGATCCGC GCCGACACGA AACTGGCGCC GACGCCGGTG
ATCGCGCTGT CCTCGCTGGT GTCGCCGGCG GCGATCGAGC GCGGACGCCA GGCCGGCCTC
ACCGACTACG TCGCCAAATT CGATCGTCCC GGCCTGATTG CGGCGCTGAA GGAGCAAACC
GCGAGCACCG AGCGGGTCGA GGCATTGCAG CAGCAGGCGG CATGA
 
Protein sequence
MDDLLREFLT ETFESLDTVD NQLVRFEQEP NNAKILDNIF RLVHTIKGTC GFLGLPRLEA 
LAHAAETLMG KFRDGMPVTG EAVTLILTTI DRIKDILGGL EATEAEPEGD DGDLIGELER
LSMRTPEQIA AELGGSAAPV DEAVPEDAAP VEAAVAAPAV AEGSLVPQTL ERPLRPGEVS
LDELERAFQE TAIEVASPPL APVASEPEKP AEAPPAVVEA AKPAPKPAAK APKKNADAEA
PAEGDRIANQ SIRVNVDTLE HLMTMVSELV LTRNQLLEIS RRHEDTEFKV PLQRLSTVTA
ELQDGVMKTR MQPIGNAWQK LPRIVRDLAA ELGKQIELEM HGADTELDRQ VLDLIKDPLT
HMVRNSADHG LERPEERVRN GKPEQGTIRL SAYHEGGHIV ICIADNGRGL NTERIKAKAI
ANGLVTEAEV EKMTEAQIHK FIFAPGFSTA AEVTSVSGRG VGMDVVRTNI DQIGGTIEIK
SVADEGSSVT IKIPLTLAIV SALIVEAGGD RFAIPQLAVV ELVRARANSE HRIERIKDTP
VLRLRDKLLP LMHLKKLLHI DDGTASTEPE NGFIVVTQVG SQTFGIVVDG VFHTEEIVVK
PMSTKLRHIG MFSGNTILGD GAVIMIVDPN GIAQALGTSV SAQHDLAEQS AATRAATTEQ
LTSLLVFRSG SPQPKAVPLS LVTRLEEIAA DKIESSNGRY MVQYRDQLMP LVLMEGVEVA
STGVQPILVF ADEDRSMGLV VDEIVDIVEE HLQIQVGSSR DGILGSAVIK GLATEVIDVA
HFLPMAFSDW LSRKEMKQSI ASRSVLLVDD SAFFRNMLGP VLKAAGYKVR LATSAVEGLS
VLRSGASFDV ILTDIEMPEM NGFEFAEAIR ADTKLAPTPV IALSSLVSPA AIERGRQAGL
TDYVAKFDRP GLIAALKEQT ASTERVEALQ QQAA