Gene RPD_3015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3015 
Symbol 
ID4023518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3359446 
End bp3360546 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content64% 
IMG OID637963214 
Producthistidine kinase, dimerisation/phosphoacceptor 
Protein accessionYP_570142 
Protein GI91977483 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCTG ACGAAGGGGC GCCGTCTCCG CCGAGCGTCG ATCGGATCTT GGGGTCTTCG 
AAGATTGCGG TGGCGATCGA GAATGACCGC TACAAGCATC TGCTCGATAA TGTTCCCGTG
GCCGTGGCGG TGTCGCGCGG CAGCGGTGAT CATCAGCGGG TCGTCTATAT CAACAAGGCG
TTCGAAGGTT TGGTGTCGCT GCCCCCGGCC GATGTCCAGG GGCAGGGCTG GGCTTGTCTC
GATGCGCTGG TGAACGAGGA TGATCCAGCC CAGACGCTCG GCGCGGCGAT CCGCGACGGC
GAGGATTTCA TCGGCGTATT TCGGCCTGCC GCTTCGTCCG CCCGATTGCT GATCGTGCAG
GCCTACGCCG CGGTGATCGA GAGCGATGAC GGAATCGAGA GCTTCCGGAT CGCGGCACTG
GTCGATGTCG GCGGGCGCGA GCGCGCGCAG ATCGAGCAGT TCGAAACCCA GATCCGCGAA
CGCGACACCC TGATGCGCGA GCTGCAGCAC CGGGTGAAGA ATAATCTGCA ACTGGTGACG
GCACTTGTGC GCCTCGAAGC CCGGTCGGCG GCTGAAGGCG AGAACGTCGC GCTGGCGCGG
CTGGCGAGCC GGATCGATGC GCTGACCGTG CTGTATCGAA TATTGTCGGC AGAGAATGCC
GCCGGCAGTG ATATCGATCT CGGTCAATAT CTCGCCGACA TCACCGAGGC GGTGATGCAG
GCCAATGGCA GCGAAGGGAT CACCTACGAG CTCAGCGTCG GCTATTGCCC GCTGTCGGTC
AATATCGCGA TGCCGGCCGG GCTTCTGGTC AACGAAATGC TGACCAATGC GCTGAAATAC
GCCTTCATCG GGCGCAGCGG CGGCCGCATC AAGGTAATCT GTACCGTCGA AGGTGGCCGC
GTCTCGGTGA TCGTGTCGGA CGACGGGGGC GGTCTGCCGG AAGGTCAGGA GTGGCCGTCG
CCGCGCAAGC TCGGCGCGCT GATCCTGCAG ACCCTGAAAG AGAACGCCCA CAACGTCACG
TTTCGAGCGG AGAGCATTCG CGGCCAGGGC ACGCTGTTTG CGCTCGGCTT CGAAGCGCCG
CCGCCTCCCG CGACGAATTG A
 
Protein sequence
MSSDEGAPSP PSVDRILGSS KIAVAIENDR YKHLLDNVPV AVAVSRGSGD HQRVVYINKA 
FEGLVSLPPA DVQGQGWACL DALVNEDDPA QTLGAAIRDG EDFIGVFRPA ASSARLLIVQ
AYAAVIESDD GIESFRIAAL VDVGGRERAQ IEQFETQIRE RDTLMRELQH RVKNNLQLVT
ALVRLEARSA AEGENVALAR LASRIDALTV LYRILSAENA AGSDIDLGQY LADITEAVMQ
ANGSEGITYE LSVGYCPLSV NIAMPAGLLV NEMLTNALKY AFIGRSGGRI KVICTVEGGR
VSVIVSDDGG GLPEGQEWPS PRKLGALILQ TLKENAHNVT FRAESIRGQG TLFALGFEAP
PPPATN