Gene RPD_3107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3107 
Symbol 
ID4023612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3453404 
End bp3455428 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content62% 
IMG OID637963308 
Productchemotaxis sensory transducer 
Protein accessionYP_570234 
Protein GI91977575 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0664997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGA ATCGGATTGG TAACAAACTT GGCCTCGTAG GACTCCTGGG AGTTGTCCTG 
AGCGGAGGAA TGCTGATCAA CCAGATGATC GCCGAGCGCG ATATCCAGGC GGCCAACACC
TTCGCTGACA ATCAGCAGTT CATCAGCGAC CGCACGCTGG AGGCGAACAT CGCGCTGCGG
CGGATGCAGA TCGCCTTGCG CGACGTCAGG CTGGCGCGCA CCGCTTCGGA GGTCGAGAAG
GCTGCAAGCG CCCTGTCGGA CATGCACACA CTCACGCTCA AGCAGCTCGG GCTGGCGACG
CCGCGCACCG TAAAGCCGGA AAACAGGGAA CGGCTGGTGA AAATTGCTGC GCTCACGGAT
GCCTACGACA AGAAGGCATC CGAACAGATC AAGACGGTTC TGGAAATCTT CGACACTGCC
GGCAAGCGGG ACAATGTGTC CTCCGAATGG AGCAAGGCCT TCGAGGCTCT GAAGACTTCG
TCTGCGCTAT CAACGGCGAG CAACCGGATC GAAATCGAAC GGGCGATGTA CGAAGTCGAC
ACGCACTTCA ATGCGATTCG CGCGGCGCGC TGGCGATACT ACTCTACGGG CGAGGAAGGC
CAGTTGAAGA CGATCGAGCG CCGGTCGGTC GATCTGAACG CCGCCTTGGC GCTGCTGCAC
AACGCCTCGA CAGATCGAAA TGTCCTGGCG ACGACCGAAA AGTTTCTGAC CATAGCCAAT
TCATTCCACG CTCTGTCCGC AGCATCTGTC AAGGCGGAGG CATTGAAGAC GGAGCAATCA
GCCCAGGCTC TGACGATCGC CACGCAATCC GCGCAATTGA TGCGAGAAGC GGTGGACGTT
GCGGGCAAGG CCGCTCACGA GGCGAACGCC AGCGCCGCCT TGGAGCTGTC TCAAGCCAAT
CGCATCAGCT TCATCGCCGC AGCCGCTGTG ATGCTCGCTC TGATCGGCAG CGTGATCTTC
TCTTTCGTCG GCGTCAGCCG TCCGCTGACC CGCCTGAACG GCGCGCTCGG CAGAATTGCG
GCCGGTCAGC TCAATACCGA GATTCCCGGC GCCAATCGTG GCGACGAGGT CGGCGACATC
GCCAAGACCG TGGTGGTGAT CAGCCGGAAT GCGGAGCAGA AGGCCCGCGA AGAAGCCGAG
ACACAGTCCA GGATGGAACA CGCCGCCGCC CAGCGCCGCA AAGCCGACAT GATGCAGCTT
GCCGACAGCT TCGAAGCCGC CGTCGGCGAT ATCGTCGAAA CGGTTTCGTC GGCTTCGACC
GAACTCGAAG CCTCCGCCAC GAAGCTGACG TCAACGGCTG AACGGTCGCA GGAACTCACC
ACCATGGTCG CGGCCGCTTC CGAGGAAGCG ACCACGAATG TGCAATCGGT GGCCTCGGCG
ACCGAAGAAC TGTCATCGTC GGTCAACGAG ATCAGCCGTC AGGTGCAGAG TTCCGCCCGA
ATGGCCAATG ACGCCGTCGA TCAGGCGCGC GCGACCAACG ACCGCGTCAG CGAACTGTCG
AAGGCGGCGG CCCGTATCGG CGATGTCGTC GAACTGATCA ACACCATCGC GGGCCAGACC
AATCTGCTGG CGCTCAACGC CACCATCGAG GCGGCGCGCG CCGGCGAGGC GGGGCGCGGC
TTCGCTGTGG TGGCGTCGGA AGTGAAAGCT CTGGCCGAAC AGACCGCCAA GGCGACCGGC
GACATCAGTC AGCAGATCTC CAGCATCCAG ACCGCGACCC AGGAATCGGT CGGCTCGATC
AGGGACATCA GCGGCACCAT CGAGAAACTG TCCGAGATCG CCTCGACCAT CGCTTCGGCG
GTCGAAGAAC AGGGCGCGGC GACCCAGGAG ATTTCCCGCA ACGTGCAGCA GGCCGCGATA
GGAACGCAGC AGGTCTCCTC CAATATCAAC GACGTCCAGC GCGGCGCCAA CGAGACCGGC
TCGGCCTCGA CGCAGGTTCT GTCGGCGGCA AAGTCGCTGT CGAGCGACAG CCGACGACTG
AAGCTCGAAG TCGGCAAGTT CCTCAGCACC GTCCGCGCCG CGTAA
 
Protein sequence
MSLNRIGNKL GLVGLLGVVL SGGMLINQMI AERDIQAANT FADNQQFISD RTLEANIALR 
RMQIALRDVR LARTASEVEK AASALSDMHT LTLKQLGLAT PRTVKPENRE RLVKIAALTD
AYDKKASEQI KTVLEIFDTA GKRDNVSSEW SKAFEALKTS SALSTASNRI EIERAMYEVD
THFNAIRAAR WRYYSTGEEG QLKTIERRSV DLNAALALLH NASTDRNVLA TTEKFLTIAN
SFHALSAASV KAEALKTEQS AQALTIATQS AQLMREAVDV AGKAAHEANA SAALELSQAN
RISFIAAAAV MLALIGSVIF SFVGVSRPLT RLNGALGRIA AGQLNTEIPG ANRGDEVGDI
AKTVVVISRN AEQKAREEAE TQSRMEHAAA QRRKADMMQL ADSFEAAVGD IVETVSSAST
ELEASATKLT STAERSQELT TMVAAASEEA TTNVQSVASA TEELSSSVNE ISRQVQSSAR
MANDAVDQAR ATNDRVSELS KAAARIGDVV ELINTIAGQT NLLALNATIE AARAGEAGRG
FAVVASEVKA LAEQTAKATG DISQQISSIQ TATQESVGSI RDISGTIEKL SEIASTIASA
VEEQGAATQE ISRNVQQAAI GTQQVSSNIN DVQRGANETG SASTQVLSAA KSLSSDSRRL
KLEVGKFLST VRAA