Gene RPD_3904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3904 
Symbol 
ID4024420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4342248 
End bp4343948 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content65% 
IMG OID637964108 
Productchemotaxis sensory transducer 
Protein accessionYP_571026 
Protein GI91978367 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAGGC AACAGGTGAA ATTCAGCAAT CTTCGGATTA CCCCAAAACT CGCTATTCTC 
GTCTCGGTCG CGATGATCGG CCTATGTGCG GCGGGCGTCT ATGCCGGCAA CATGATGAAG
CAGGAACTGC TCGCGGCGCG TATCGAACAG ACCAAGTACT TCGTCGACAT GGGCCGCAAC
ACCGCATTGG CGTTGCAGAA GGAAGTCGGT GCCGGCCAGC TCACCAAAGA GCAGGCGATG
GCCGAGCTCA CCCGTCGCTT GACGTCCATG ACGTTCGACA GCGGCAACGG TTACTTGTTT
GCCTATGGCT ATGACGGCAT TGCCATTGCT GCGCCGGACA AGAAGGCGAT CGGTCAGAAT
TTCCTCAACT ACGCCACCGG CACCCGCTAT TTGCTGCGCG AGATGCGCGA GGGCGTGACC
TCCAAGGGCG AGGTGACGCT GTACTATGAT TTCCAGCGCC CCGGTTCCGA GGAACTGCGC
CGCAAAGTGT CCTACGCTGT TGCCATACCG GGTTGGAATA TGTTGATCGG CTCCGGCGCT
TATCTCGACG ATCTCGACGC CCGGCTGATG CCCATCGTCT GGGCGCTGGT GGCGGCGATT
CTCGGCATCG GCGTGGTGTG CGGCGCGATC GCCTGGGTCA TCGGCCGTTC GATCACCCGG
CCGTTGGCGA TGCTCGGCGA CCGGATGCAG TCGCTCGCCG AAGGCCAGCT CGACGCCGAT
ATTCCGGGCA CCGATCGCGG CGATGAAGTG GGCGCGATGG CCAAGTCGGT GCAGGTGTTC
AAGGACAACG CGATCCGCAT CCGCGGCTTG GAGCAGATCG AAGCCGAGAC ACAGCAGCGC
GCCGCCGAAG AACGCCGCCG CACCATGCTG GAGATCGCCG GCGACTTCGA GCGCAGCGTC
AGCGGCATCG TCGGCTCGGT GGCCACCGCC GCCCGCAACA TGCAATCGAC GGCGCAATCG
ATGACGGCGA CCGCCAGCGA TGCTTCGGCG CGTGCTGCGA CGGTGGGGTC GGCGTCGCAA
TCGACCTCGA CCACCATCGG CACGGTGGCG GCGGCGGCCG AGGAACTGTC GAGTTCGGTC
GGCGAGATCG CGCGCCAGGT GGCGCAGTCG CGCGAAGTCG CCAGCAAGGC GGTGATTGAC
GCCGAGCAGA CCAATGCCAC GGTGCAGTTG CTGTCGACCG GCGCCGAGAA GATCGGCGAG
GTGGTGCAGC TCATTCACTC GATCGCGTCG CAGACCAATC TGCTGGCGCT GAACGCCACC
ATCGAGGCGG CGCGGGCCGG CGAATCCGGG CGCGGTTTTG CGGTGGTCGC CTCAGAAGTG
AAGGCGCTCG CCAGCCAGAC CGCCAAGGCC ACTGAGGAAA TCTCCTCCCA GGTCGCGGCG
ATGCAGACCT CGACCAACGA TGCGGTGCAG TCGATTTCCG GCATCGGAGC GACCATCGCG
AAGATGAGCG AGATCACGGT GGCGATCTCC GGCGCCGTGG AGGAGCAGGG CGCGGCGACC
CGCGAGATTG CCCGCAACAT CCAGTCGGTG GCGGCCGGGG CCAACGAGGT TCATGATCAC
ATCGGCGGCG TCGCTTCGGC GGCGGAAGCC ACCGGTCAGG CCGCGTCCGA AGTGTTGTCG
AACGCCCGCG ATCTGGACAG TCAGTCCGGC ATCCTGCGCG CGGCGGTCGA TCAGTTCCTC
GACAAGGTGC GCGCGGCGTA A
 
Protein sequence
MRRQQVKFSN LRITPKLAIL VSVAMIGLCA AGVYAGNMMK QELLAARIEQ TKYFVDMGRN 
TALALQKEVG AGQLTKEQAM AELTRRLTSM TFDSGNGYLF AYGYDGIAIA APDKKAIGQN
FLNYATGTRY LLREMREGVT SKGEVTLYYD FQRPGSEELR RKVSYAVAIP GWNMLIGSGA
YLDDLDARLM PIVWALVAAI LGIGVVCGAI AWVIGRSITR PLAMLGDRMQ SLAEGQLDAD
IPGTDRGDEV GAMAKSVQVF KDNAIRIRGL EQIEAETQQR AAEERRRTML EIAGDFERSV
SGIVGSVATA ARNMQSTAQS MTATASDASA RAATVGSASQ STSTTIGTVA AAAEELSSSV
GEIARQVAQS REVASKAVID AEQTNATVQL LSTGAEKIGE VVQLIHSIAS QTNLLALNAT
IEAARAGESG RGFAVVASEV KALASQTAKA TEEISSQVAA MQTSTNDAVQ SISGIGATIA
KMSEITVAIS GAVEEQGAAT REIARNIQSV AAGANEVHDH IGGVASAAEA TGQAASEVLS
NARDLDSQSG ILRAAVDQFL DKVRAA