Gene RPB_4665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4665 
Symbol 
ID3912483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5276893 
End bp5278905 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content67% 
IMG OID637886570 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_488259 
Protein GI86751763 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGGGC TGGTGCGCGC GGTCCCGATC CGCTGGCGCA TCCTGTCGAT CGCGGCGCTG 
AACTCGCTGG TGGTGCTGAT CCTCGCCAGC ATGATCTGGA GCGGCGCGCG GGTGCTGAGC
TCGGCCTGGG ACGACGTCCG CCAGGTCCGC GAATCCGACA AAGTCCTGGC GCTGCTGGAG
AGCGAGACCA GCCGGCTGCA GAACCTGATC CACCGCTACA TCAATCAGCC GAGCCCGGAC
CTGTTCGCCG AAATCCTGCT GCTGCGCGAG GCGGTGCTGG GCACGCTCAC CAATCGCGCA
TCGACCGACC CGATGCTGTC GGGATCGGTC GCCGAGCTCG AGCGCGTGAC CGAACGCTTC
CTCAACGGCT TCGGCGACCT GCGCGCGCTG CAGGGCAAGA TCAAGGACAC CTACGAAAAC
CAGGTGCAGG GGCCGGCGCG CGACATCGCC GGGCTGTATT CGATCATCGA GGGCGCGACC
GGGCGGCGCG ATGCGGCGAT CTGGCCGCCG CTCGGCCGCT CGCGCGAGGC CTTCACGGCG
ATGTTGGTGG CGGCCAATTC GTACTATCTG TCGCTGTCCA AACAGGCCGC CGAGGACGCC
CGTCGCAACA CCGAGACGAT CGAGCGCACC GTGCCGGTGA TGATCGAGTT CGCCGACAAC
GACCTGCAGA AGATGGCGCT GCAGCGGCTG AAAGCCCGCG TCGCCGCCCT CCGCGACGGA
CTGCAGACGC TGTCGGATCA GCTCGCGAGC CGCAGCGATC TGCTGCGCAA TGCGATCGAC
GCCAGCCAGG CCTCGACCAT CGGCGCGATC GATGCGCTCT CGGTGAAGAT GCGCCAGCGC
GAACAGAAAG CGCAGGAGAC GTTCGACCGG ACGCTCACCG GCATCTCGCG CAAGGTGCTG
TGGATCGCGG TGATCTTCAT CGGCGTGATC ATGATCGTCG GCACCCTGAT CGCGCTCAGC
ATTCGGCTGC CTCTGCAACA GATTTTGGCG GCGATGCACG CGATCACCTC CGGCGACTAC
GGCCGCCGCG TCGCCGGCAC CGAAGCCCGC GACGAAGTCG GCGCGATGGC CCGCGCGGTC
GAGGTGTTCC GCGAGAACGC CATCGACAAG CGCCGGGCCG AAGACGATCT GCGCGCCTCC
AAGGAGAAGG CCGAGAGCGC GCTGCTCGAA TTGAATGCCG CGCAGCAGAA TTTGATCGAC
GCCGAACGGC TGGCGGCGCT CGGCGGACTC GTGGCCGGCG TCGCCCACGA GGTCAACAAC
CCGATCGGAA TCAGCCTGAC GGTAGCGTCG AGCTTCAGCC GCAAGACGAC GATGTTCGAG
GCGGATCTGA AAGGCGATGC GCCGCTGCGG CGGTCGCAGC TCGACGAATT CGTCCGCTCC
TCCCGCGACG CCGCGCAGCA GTTGGTCGCC AACCTGCAAC GCGCCGCCGA ACTGATCCAG
TCGTTCAAAC AGGTCGCGGT CGACCGCTCC CACGCCGAAC GCCGGCAATT CAATCTGCAC
GAGGCCACCG ACCAGATCGT CGCCAGCCTC AAGCCGGTGC TGAAACGGGC GCCGATCGAG
CTCGCGATCG ACGTGCCCGA CGGCCTGGTG ATCGACGGCT ATCCGGGCGC TTACGGCCAG
ATCCTGACCA ACCTGTTCCT CAACGCCGCC AACCACGCCT TCGCCGACGG CCGCGCCGGC
CGGATCGCCA TCACCGCGCG ACCACGCGGC GACGACGTCG AGATCGTGTT CGCCGACAAC
GGCGCCGGCA TGACCCCCGA CGTGCAGCGC CAGGCGTTCG ACCCGTTCTT CACCACCCGC
CGCAACGAAG GCGGCACCGG GCTCGGCCTG CACATCGTTT ACAATCTGGT CACCCAGCAG
CTCGGCGGGC GGATGATGCT GGAATCAAGG CTGGGACAAG GCACGACTTT TCGCATTATC
ATGCCCCGCA CCGCCACCGG CGGGCCGACC GACGCCGACA CGCCCATCGA CGGAAATGTG
AAATGGCCGA CCAGGACGAT ATCCTCCAGC TGA
 
Protein sequence
MVGLVRAVPI RWRILSIAAL NSLVVLILAS MIWSGARVLS SAWDDVRQVR ESDKVLALLE 
SETSRLQNLI HRYINQPSPD LFAEILLLRE AVLGTLTNRA STDPMLSGSV AELERVTERF
LNGFGDLRAL QGKIKDTYEN QVQGPARDIA GLYSIIEGAT GRRDAAIWPP LGRSREAFTA
MLVAANSYYL SLSKQAAEDA RRNTETIERT VPVMIEFADN DLQKMALQRL KARVAALRDG
LQTLSDQLAS RSDLLRNAID ASQASTIGAI DALSVKMRQR EQKAQETFDR TLTGISRKVL
WIAVIFIGVI MIVGTLIALS IRLPLQQILA AMHAITSGDY GRRVAGTEAR DEVGAMARAV
EVFRENAIDK RRAEDDLRAS KEKAESALLE LNAAQQNLID AERLAALGGL VAGVAHEVNN
PIGISLTVAS SFSRKTTMFE ADLKGDAPLR RSQLDEFVRS SRDAAQQLVA NLQRAAELIQ
SFKQVAVDRS HAERRQFNLH EATDQIVASL KPVLKRAPIE LAIDVPDGLV IDGYPGAYGQ
ILTNLFLNAA NHAFADGRAG RIAITARPRG DDVEIVFADN GAGMTPDVQR QAFDPFFTTR
RNEGGTGLGL HIVYNLVTQQ LGGRMMLESR LGQGTTFRII MPRTATGGPT DADTPIDGNV
KWPTRTISSS