Gene RPD_3836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3836 
Symbol 
ID4024352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4273282 
End bp4275267 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content66% 
IMG OID637964040 
ProductCache, type 2 
Protein accessionYP_570958 
Protein GI91978299 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein
[COG4564] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.337162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGC CCCGGCTTTC GCTCGGTTTT CAGATCTACT CGATTATTGC ACTCAGCTTC 
TGCGGCCTGA TCGGCCTCGC GATCATGCAG GCTAATACTC TTGACGACGC GCTGCGGGAG
CAACGCCACA ATGAGTTGGT GCACATGACT GAGCTCGCCC TCGGCATCGC CCGCGACGAG
CACGACGCCG CCACCCGCGG CCTGGTCACC CACCAGCAGG CACAAATCAC CGCGGCGGAA
CGGATCGCCA AGATGCGCTA CGGCAATGGC GACTATTTCT GGATCAATGA TCTCGGTCCG
AAGATGGTGA TGCATCCGGT CAAGCCTGAG CTGAACGGCA AAGACCTGTC CGACGAAAAG
GATCCGACCG GCAAGCGCCT GTTCGTGGCG TTCGTCGATA CCGTGAAAGC GAGCGGCGCC
GGCTTCGTCG ACTATCGATG GCCGAAGCCC GGCGCCGACA GGCCGCAGCC GAAGATCTCC
TATGTCGCCG GTTTTCAACC CTGGAACTGG GTGATCGGCA CCGGCGTCTA TGTTGATGAT
CTCGAGGCGC AGGTCTGGAG CAGCATTGAG CGGGTCATGG TCGCAGCCGG ACTGATCGTC
GTGTTGCTCG GCGCAGTGAC GCTGTTCATC GCGCGCCGGA TGTCGTCAGC GCTCGGCGGG
ATGTCGGCCG CGGTGACGCG GTTGGGCGAC GGCGATTTCG AGATCAGGCT GCCGGGGCTC
GATCGCTCCG ACGAACTCGG CGACATGGCG CGCGCGATCG AGGCATTCAA GGTCAAGGCG
ATCGAAAAGG CCCGCGCCGA CACCGCGCGC GACGCGGAGC TCCGCCGCAT CGCCGAAGAC
GCCAAGCGGC AGGCGCTGCG CGACATGGCC GATACGGTCG AGCGCGAGAC CGCGATTGCT
GTCGACCAGG TCGCCAGCCG CACCGACCTG ATGGCGCGTG GCGCAGTGCT GATGACCACC
GGCGCGGAAA TGCTCGGCCA GCAGAGCAGT GGCGTCGCCG CAGCCGCCGA GCAGGCGCTG
GCGACATCGC AGATGGTCGC CGCCAGTTCG TCCGAGCTGA GCGCGTCGAT CAACGAGATT
GCCGCCAAGG TCACGTCGTC GCGCAATCTG ACGCTCCGGG CCGTCAATGC GTCCAGCCAG
GCCGAGACGA TGATCGCCAA GCTGTCCGCA GCGGCGACGC GGGTCGGCGC GGTTACCAAT
CTGATCAGCG AGATCGCCGG TCAGACCAAT CTTCTGGCGC TCAACGCCAC CATCGAGGCG
GCTCGCGCCG GCGACGCCGG CCGCGGTTTC GCCGTGGTCG CTGCGGAAGT GAAGTCGCTG
GCTGAGCAGA CCGCCAAGGC CACAGGCGAG ATCTCGCAAC AGATCGACGA GATCCAGCAG
GCCACATCCG ACTCGGTCGA ATCGATCAAC GCGATCGGCG ATGCGATCCG CAACGTCGAC
CAGGTATCGG CGGTGATCGC GACCTCGATC GATCAGCAGA GCACGGTCAC GCGCGAGATC
GCTCGCGCCG TCGCCGAGAC CTCACAGGCG GCGCGCAACG TCGCGGCGCA GATCGTCACG
GTCTCGAGCG AAGCGGTGAA AACGGGACGG CGCGCTGTCG AGATTCAGGA CGGTTCGGTC
GAAATCGCCA GCCGGATTGA CAATCTGCGC GGCGTTCTCA CCCGCGTGAT CCGCACCTCG
ACCGCCGACG TCGATCGCCG CGCGCACGCC CGGATCGACA TCGAGCGCCC CGGAACGATC
GAGGCCGGAG GCAGAACTTA CGCCGTGCAA GTGCGGGATA TTTCCGAAGC CGGCGCGCGG
CTCGCCGACG CGATCGAAGC GCTTGGCCCG GATTGCGCCG TCACACTCGG CATCGACGGC
TTGCCCGGCA AGCTGCACGG TGTGATCGTC GCCAGCGACC CCGATCGGAC GTTGGTGAAG
TTCGACCTGT CGGAGCCGCA GCAACAGATC ATCCGCAGCT TCGTCGCCCA TCGCCGCGCC
GCGTAG
 
Protein sequence
MKLPRLSLGF QIYSIIALSF CGLIGLAIMQ ANTLDDALRE QRHNELVHMT ELALGIARDE 
HDAATRGLVT HQQAQITAAE RIAKMRYGNG DYFWINDLGP KMVMHPVKPE LNGKDLSDEK
DPTGKRLFVA FVDTVKASGA GFVDYRWPKP GADRPQPKIS YVAGFQPWNW VIGTGVYVDD
LEAQVWSSIE RVMVAAGLIV VLLGAVTLFI ARRMSSALGG MSAAVTRLGD GDFEIRLPGL
DRSDELGDMA RAIEAFKVKA IEKARADTAR DAELRRIAED AKRQALRDMA DTVERETAIA
VDQVASRTDL MARGAVLMTT GAEMLGQQSS GVAAAAEQAL ATSQMVAASS SELSASINEI
AAKVTSSRNL TLRAVNASSQ AETMIAKLSA AATRVGAVTN LISEIAGQTN LLALNATIEA
ARAGDAGRGF AVVAAEVKSL AEQTAKATGE ISQQIDEIQQ ATSDSVESIN AIGDAIRNVD
QVSAVIATSI DQQSTVTREI ARAVAETSQA ARNVAAQIVT VSSEAVKTGR RAVEIQDGSV
EIASRIDNLR GVLTRVIRTS TADVDRRAHA RIDIERPGTI EAGGRTYAVQ VRDISEAGAR
LADAIEALGP DCAVTLGIDG LPGKLHGVIV ASDPDRTLVK FDLSEPQQQI IRSFVAHRRA
A