Gene RPC_4603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4603 
Symbol 
ID3972094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5137551 
End bp5139902 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content66% 
IMG OID637927714 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_534444 
Protein GI90426074 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAAG GCAACTCCGA ATTCGACGTC GCGGCTGTGG TCGATGCCCT TGACGTCGGC 
GTTATCGTGC TGGACGAGCG CAGCGGCATC GTCGGTTGGA ACGATTGGAT CGCCCGGGTC
AGCAAGCTGC CGCGGTCCGA TGCGCTCGGA AGCAACCTGA TCGCGCTGTT TCCCAATCTC
GGCGGCACCC GGCTGCCCTC GGTGATCGCG GATTGCCTGC AGACCGGTAG CTCCAGCATT
CTCACCCATT CGCTCAACAA GTTGCTGCCG TTGCAGAACG AGAGCGGCGA AGAACTGCTG
CACAACATCG TGGTGCGACC GGTGTGGTCG AACGGCGCGC CGCGTTGCAT GTTGCAGATC
AACGATGTCA GCGTTCCGGT GGCCCGCGAG CGGGTGCTGC GCGAGCGGCA GAACGCGCGC
TACCACGCCA TCGTCGATTC CGCGCCGGAT GCGATCATCA CCATCGGGCT CGACCGCACC
ATCCAATGGG CCAACGGCGC GGCCGAACAA GTGTTCGGCT ACGAACTCTC CGAGCTGTTG
GACCACAAGC TCGATATGCT GCTGGAGCAC GATGATGATG GGCTGGTGCA GGCCTTCGTC
GACCTGCAGC ATTCGAACTC CTGCACGCTG CAGGTCAACG GCCGCCGCAA GTACGGCGAG
TTGTGCCAGT TCGAGGTGTC GCTCGGTCGC TGGAAGGCCG ACGAGCGGGT TTTCATCACC
ACGATCTGGC GCGACGTCAC CGAACGCACC ACCGCGGAAG CCGCCTTGCG CGACAGCGAA
GGCCGCCACC GCGCGCTGCT GGAGGCGTTG CCGCAGCTGG TGTGGACCTG CGACCAGCAC
GGCGAATGCG ATTACTTCAA TCCGCAATGG CAAGCCTACA CCGGCGCGCC TTTCGGCGAG
CATCTGGGCT CCGGCTGGCT CAAGGCCATC CACAAGGACG ACAGCGCCGC GTTCACCGAA
TCCTGGGCCG CGGCGCTGGC CGATGGCGGG GTATTCGACG TCGACGCCAG GCTTTGCCGC
AAGGACGGCA GCCACCGCTG GTTCAAGCTG CGCTCGATCC CGGTGCTGAC GCCGGACGGC
AGCATCAGCC GTTGGTTCGG CACCGCGACC GACATCACCG ACCACGTCGA GGCTCGCGAG
TCGCTCCGCC GCAGCAACGA AGAGCTCGAG GCGCTGGTGC AGGAACGCAC CCGCGAGCGC
GAATTGGCGC TGAACCAGCT GCACGAAGCG CAGAAGATGG AGACCATCGG CCAGCTCACC
GGCGGCGTCG CGCACGATTT CAACAATCTG CTGGCGGTGA TCCTCGGTAG CCTGTCGCTG
CTGAAGAAAT GGCTGCCGGA CGATCCGCGC ACCTCGCGGC TGCTCGACGG CGCGCTGCAG
GGCGCCGAAC GCGGCGCCAC GCTGACCAAG CGGCTATTGG CGTTCGCGCG GCGCCAGGAG
CTGAAGCTCG AAGCGGTGGA AATCCAGAAG CTGATCCCCG ACATGATGGA CTTCCTGCGC
CAATCGCTGG GACCGAACAT CAACATCAGC ATCGACATCC CCCCCGACGT CGAGCCGGTC
AAGATCGACG CCAACCAGCT CGAACTGGCG TTGATGAATC TGGCGGTGAA CGCCCGCGAC
GCGATGCCGT CCGGCGGCGC GCTGGTGATC ACCTGCCGCA ACGACGGCGC CGCCTCCAGC
GAACGGCCGA AGGGACTGCC GGACGGCGAC TACGTCTGCA TCAGCGTTGC CGATACCGGC
GAGGGGATGG ATCAGACCAC GCTGGCCAAG GCGATGGAGC CGTTCTTCAC CACCAAGGGA
TTGGGCAAGG GCACCGGGCT CGGGCTATCG ATGGTGCAAG GCCTGACCGC GCAGTCCGGC
GGCGCGATGA CGATGAGCAG CGAACTCGGC GACGGCACGG TGGTGAACCT GTGGCTGCCG
CGGGCGCGGC GCGAGGACAT GATTCATCCG GCGACCGCGC TGGCGCCGCT GGCCCGCGAT
GCCGCCAGCC AACAATTGCG CATCCTGCTG GTCGACGACG ACCCATTGGT GCGGATGAAC
ACCGCCTATC TGTTGATGGA TCTCGGCCAC AGCGTGATGG AGGCGCAGTC CGGCGCCCAG
GCGCTGCAGC TGCTCGGTTC CGACGCCCGG TTCGACGTGC TGCTCACCGA CTACGCGATG
CCGGGCATGA CCGGGCTCGA CCTGGCGACC AGGGTGAAGA TCGTCAAGCC GAAGCTCCCG
ATCGTGCTGG CCACCGGCTA TGCCGAATTG CCGCCGGACG CGCTGCTCGG TTTCCCGCGG
CTCGGCAAGC CCTACACCCA GGAGCAACTG GCGGAATCGC TGGAAGCCGC GATCCGCGAG
CGGGTGAACT AA
 
Protein sequence
MSEGNSEFDV AAVVDALDVG VIVLDERSGI VGWNDWIARV SKLPRSDALG SNLIALFPNL 
GGTRLPSVIA DCLQTGSSSI LTHSLNKLLP LQNESGEELL HNIVVRPVWS NGAPRCMLQI
NDVSVPVARE RVLRERQNAR YHAIVDSAPD AIITIGLDRT IQWANGAAEQ VFGYELSELL
DHKLDMLLEH DDDGLVQAFV DLQHSNSCTL QVNGRRKYGE LCQFEVSLGR WKADERVFIT
TIWRDVTERT TAEAALRDSE GRHRALLEAL PQLVWTCDQH GECDYFNPQW QAYTGAPFGE
HLGSGWLKAI HKDDSAAFTE SWAAALADGG VFDVDARLCR KDGSHRWFKL RSIPVLTPDG
SISRWFGTAT DITDHVEARE SLRRSNEELE ALVQERTRER ELALNQLHEA QKMETIGQLT
GGVAHDFNNL LAVILGSLSL LKKWLPDDPR TSRLLDGALQ GAERGATLTK RLLAFARRQE
LKLEAVEIQK LIPDMMDFLR QSLGPNINIS IDIPPDVEPV KIDANQLELA LMNLAVNARD
AMPSGGALVI TCRNDGAASS ERPKGLPDGD YVCISVADTG EGMDQTTLAK AMEPFFTTKG
LGKGTGLGLS MVQGLTAQSG GAMTMSSELG DGTVVNLWLP RARREDMIHP ATALAPLARD
AASQQLRILL VDDDPLVRMN TAYLLMDLGH SVMEAQSGAQ ALQLLGSDAR FDVLLTDYAM
PGMTGLDLAT RVKIVKPKLP IVLATGYAEL PPDALLGFPR LGKPYTQEQL AESLEAAIRE
RVN