Gene RPB_0545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0545 
Symbol 
ID3909584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp610543 
End bp613704 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content67% 
IMG OID637882433 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_484167 
Protein GI86747671 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0920041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCT CGACCCGCCT GACGCTGGCG ATGGTGGCGC TGGTCCTGGT GACCACGGCG 
GTCCTCGGCT TTTTGAACTA CCGCAGCATC GTCGAATTGG TGATGCCCCG CGCCTTGCTG
CAATTGCAGA CGCGCTCGCA GCTTCATGCG CTGCTGATGG ATGCGTATCT GCGCGGCACG
CGCGCGGACG CGGTCAGTGC GCAGGCCTCC ACGACCCTCC GCGATCTGCT CGCCAACCGC
GACCGCGCCG GCGAGGCGAC CCCCGACTGG CGTCGGCGAA TCGAGGATCG GTTCGCCGCG
GAAATGGCCG GGAAGCCGAA CTCCGCCATC CTGCGCATCC TCGGGCCCGA TGACGGCGGC
CGCGAACTGG TGCGGGTCGA CCGACTGGGC CCGAACGGCA CGATCCGCGC GACACCGGCG
GCGCAGCTGA CGCGACGCGG CGACCGCGGC TATTTCAAGC AAGGCATGGC GCTGCCACCC
GGCGAGGTGG CGATCTCCCG GATCGAACTG AACAAGACGG CGACCGGACT GGAAATTCCC
CACGTCCCCA CCGTGCGCAT CATGGCGCCG ATCGATGCCG GCGACGGCAC GCGGCTCGGC
GTGCTGGTGA TCAACACCAA TCTCGGCGCG CTGTTCGAGC GGGTCCGAAA CAGCGTCGTC
GTCGGCAACC TGACCTACAT CGTCAACGAC GCCGGCGACT ATCTGCTGCA TCCCGACCGC
TCGCGCGAAT TCGGCTTCGA TTTCGGCAGG CCCAGCCGCG TCCAGGATGA TTTCCCGGCA
TTCGCGGCAC TGCTCGACGG CAAGGACACC ACCCCGCGCG TGATCGAGGA TCGCACCGGG
CAACGCTTCG GCGCCGGCTG GGGCTGGGTG CGTCTCGCCG ACGGTCCGAC GATCGGCGTG
ATCGAGATGC GGCCGTATCA ATCCCTGACC TCGGTGCAGA CGGCGGTCCG CAATTCGACG
CTGACGGGCG GTGCGGTCGC GGTGCTGGTG GCGATGCTGA TGGCGTTTCC GCTGGCGAAG
TCGCTGACCC GGCCGCTGGT CCGCATCACC CGCTCGGTTC AGGCGTTCGC GCGCGGCGAA
AAATTGGAGC TCGCTCCCGG CGGCGGCCAG GAGATCAAGC TGCTGGCCGA AGCCTTTTCA
CAGATGACCC GCGAAGCCGA GCGCAAAGCG ATCGCCCTCG CCGAGGAGGC GGAAGAGCGC
AGCCGCATCG CCGGGGTGCT GCAGAACACC ATCGACAACA TGGTCGATCC GGTGCTGGTC
GCGGATGCGC GCGGCATGGT GATGCTCACC AATCCGGCGG CGCGCGACCT GTTCGGGACG
CTGTCCGGCG TCGGCCTGCT CAACACCACC CGCTCGTTCG ACCGCTACTA TCCGGACGGA
ACCCCGCTGC CGCCCGACCA ATCGCCGCTG CTGCGCGCCT ATCGCGGCGA AACCATCGTC
AATTTCGAAT TCATGGTGCA GCCGATCGGA TCGGACCGCA GATCCTATCT GATCGCGAAC
GGCCGGCCGC TGCGCAGCGA GACCGGCGAG AACCAGGGCG CGGTGATGGT GTATCACGAC
ATCACCCAGA CCAAGACGGC CGAGAAGGCG TTGCGCCGCA GCGAGCAGAT GGCGCGGGCG
ATCATCGACA CCGCGCTCGA CGCCTTCGTG CAGATCGACG CGCTCGCCAC GATCACCGAA
TGGAGCCCGC ACGCCGAGGA AATGTTCGGC TGGCGTCGCG CCGAAGCGAT CGGCCGCAAC
GTGTTCAAGT TGATCATCCC GCCCGACCAG TTCGAGCAGC GCACCGAGGA GTTCAAGCAG
TTCGCATCGA CGCTCGGGCG CGACAGCTCG GGCTTCCGCG TCGAGATCGA ATCGATCCGC
AACAACGGCA CGCCGATCCC GCTCGAAGTC GCGATGACCG CGCTGTATCG CGACGGCAGC
TTCGTCATCA ACGCGTTCCT GCGCGACCTG ACCGAGAAGA TCGCGTTCGA GGAGCAACTC
CGGCAATCGC AGAAGATGGA GTCGATCGGC CAGCTCACCG GCGGCATCGC GCACGACTTC
AACAACATGC TGACGGTGAT CACCGGCACC ATCGACATCA TCAGCGACGG CGTCGCCGAG
CAGCCGCATC TGGCGAACAT CGCCAAGCTG ATCAGCGAGG CGGCCGATCG CGGCGCCGAG
TTGACCCGGC TGCTGCTCGC CTTCGCCCGC AAGCAGCCGC TGCGGCCGGA CGACACCGAC
GTCAACGCCC TGGTGGCCGG GCTGCAGAGC CTGCTGCGGC CGACACTGGG CGAACAGATC
GAAGTCGAGA CCTCGCTCGC GGACAACGTC TGGCCGATTT ACGTCGATCG CGGCCAGCTC
GAATCCGCGC TGGTCAATCT CGCGGTCAAC GCCCGCGACG CGATGCCGAA CGGCGGCAAG
CTGACGCTGG AGACCTGCAA CATCGTGGTC GACCAGGAAT TCGCCAGGCG GCTCGGCAAT
GTCGAGGTCG GCTCCTATGT GATGATCGCG ATCACCGATT CGGGCTGCGG CATTCCCGAG
GCGATCCGCG GCAAGGTGTT CGATCCGTTC TTCACCACCA AGGACATCGG AAAAGGCACC
GGGCTCGGGC TGAGCATGGT GTACGGCTTC ATCAAGCAGT CCGGCGGACA CATCACGCTG
TACAGCGAGG AAGGCCTCGG CACCACGTTC CGGCTGTATC TGCCGCGCGC CACCGCCGAG
ATCGAACGCC AGGCGCCGGC GACGTCGGAG CAGGGCGCGA TCGGCGGCAC CGAGACCATC
CTGGTCGTCG AGGACGACGC CATGGTGCGC AGCTACGTCA ACGCTCAGCT CAAGAGCCTC
GGCTACACCA CCCTGTCGGT CGGCAACGCC ACGGCCGCGC TGTCGATCGG AGACAGCGAC
ACGCAATTCG ACCTGCTGTT CACCGACGTG GTGATGCCGG GCCCCTACAA CGGCGTGCAG
CTGGCCGCCG AAATGAGCCT GCGCCGGCCC GGGCTGAAGG TGCTGTTCAC CTCCGGCTAT
TCCGAGAACG CCCTGATCCA CAACGACCGG ATCGATTCCG ATATCCTGCT GTTGTCGAAG
CCGTATCGGC GCAGCGATCT CGCCCGCATG ATCCGCCTCG CGCTGAGTCC CGCCGCCGAC
ACCGTGACGG CAGCCGGCGA AGTGGACAAT GCGACGGTCT GA
 
Protein sequence
MKLSTRLTLA MVALVLVTTA VLGFLNYRSI VELVMPRALL QLQTRSQLHA LLMDAYLRGT 
RADAVSAQAS TTLRDLLANR DRAGEATPDW RRRIEDRFAA EMAGKPNSAI LRILGPDDGG
RELVRVDRLG PNGTIRATPA AQLTRRGDRG YFKQGMALPP GEVAISRIEL NKTATGLEIP
HVPTVRIMAP IDAGDGTRLG VLVINTNLGA LFERVRNSVV VGNLTYIVND AGDYLLHPDR
SREFGFDFGR PSRVQDDFPA FAALLDGKDT TPRVIEDRTG QRFGAGWGWV RLADGPTIGV
IEMRPYQSLT SVQTAVRNST LTGGAVAVLV AMLMAFPLAK SLTRPLVRIT RSVQAFARGE
KLELAPGGGQ EIKLLAEAFS QMTREAERKA IALAEEAEER SRIAGVLQNT IDNMVDPVLV
ADARGMVMLT NPAARDLFGT LSGVGLLNTT RSFDRYYPDG TPLPPDQSPL LRAYRGETIV
NFEFMVQPIG SDRRSYLIAN GRPLRSETGE NQGAVMVYHD ITQTKTAEKA LRRSEQMARA
IIDTALDAFV QIDALATITE WSPHAEEMFG WRRAEAIGRN VFKLIIPPDQ FEQRTEEFKQ
FASTLGRDSS GFRVEIESIR NNGTPIPLEV AMTALYRDGS FVINAFLRDL TEKIAFEEQL
RQSQKMESIG QLTGGIAHDF NNMLTVITGT IDIISDGVAE QPHLANIAKL ISEAADRGAE
LTRLLLAFAR KQPLRPDDTD VNALVAGLQS LLRPTLGEQI EVETSLADNV WPIYVDRGQL
ESALVNLAVN ARDAMPNGGK LTLETCNIVV DQEFARRLGN VEVGSYVMIA ITDSGCGIPE
AIRGKVFDPF FTTKDIGKGT GLGLSMVYGF IKQSGGHITL YSEEGLGTTF RLYLPRATAE
IERQAPATSE QGAIGGTETI LVVEDDAMVR SYVNAQLKSL GYTTLSVGNA TAALSIGDSD
TQFDLLFTDV VMPGPYNGVQ LAAEMSLRRP GLKVLFTSGY SENALIHNDR IDSDILLLSK
PYRRSDLARM IRLALSPAAD TVTAAGEVDN ATV