Gene RPD_0336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0336 
Symbol 
ID4020796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp388007 
End bp389698 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content64% 
IMG OID637960515 
Productchemotaxis sensory transducer 
Protein accessionYP_567475 
Protein GI91974816 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCAA CGCTCTCCAT TCGCGCCAAG ATCACGACGC TTGTCGCGGC TCTTCTGATT 
GCGATGACCT GCCTCGGCGG GCTCGGGATC CTCAAGGTGC GGTCGATGAA CTTTGCGGCG
CTCGACCTCG CCACCAACTG GCTGCCGACC ATCAAGATGC TCGGTAATTT GAGGTACAAC
GTCCTCAACT ACCGGACCGC GATCCGCGAT CACCTCATCA CGACGACGGC CGAAGGTATG
GCCACCGTCG AAAAAAGGTT GGAGGCCATC GAAGGGACTA TCCAGAAGGA CCTGCAAACC
TATGAGAAGA TGATAACAAC GCCGGAGGAA AGGCAGCTTT ACGACGCTTG GCTCAGCGAG
TGGGCCAAAT ACAAGCCCCT GGCGGCGAAA ATGATCGACG CGTCGCGTAA GAGCGTCGGG
CAGAGCTCCG AGGAAGCCAC AGACCTGCTT TACAAGGTGG CGGGTCCGGT CGCCTCGCGG
ATGGACGAGA TCCTGCAGAA GGATATCGAT CTGAACGACA GGGGCGCGGA CGGTTCGACC
GCCTTGGCGG CCTCGACCTA CTCCTCCGCC ATCTACCTGA TCGTGACGAT TCTCGGCGTT
GCAATCGTCG TCGGCATCGT CGCAAGCGTG ATGGTGATCC GCGACGTGGC GCAGGGAATT
GCGTCGATCG TCAAGCCGAT GCAGTCGCTC GGTCAGGGCG ACCTCTCGGC GGACGTGCCG
CATCGCGGCG AAAAGACCGA AATCGGCTCG ATGGCGGATG CGCTGCAGAT CTTCAAGGAC
GCGCTGATTT CCAAGAAGGC TGCTGACGAA GATGCGGCGC GTGAAGCGGA GGCGAAGATC
GCGCGCGGCC AACGCGTCGA TTCAGCGACC CGCCAGTTCG AAACCTCGAT CGGCGAAATC
GTCGAGACGG TGTCGTCGGC GTCGACCGAA CTGGAGGCGT CGGCCGCCAC GCTGACCGCC
ACGGCCGGAC ACGCCCAGGA ACTGACCACC GCGGTCGCCG CCGCCTCGGA AGAAGCCTCG
ACCAATGTGC AGTCGGTGGC CTCGGCGACC GAAGAGATGT CGTCCTCGAT CACCGAGATC
AGCCGTCAGG TTCAGGAATC GGCGCGGATC GCCAGTGAGG CGGTCGATCA GGCGCGCAAG
ACCAATGACA GCGTCGGGAT GCTGTCAGCC GCCGCGGCGC GGATCGGCGA CGTCGTCGAA
CTGATCAACA CCATCGCTGG CCAGACCAAT CTGCTGGCGC TGAACGCTAC GATCGAGGCC
GCCCGCGCCG GCGAAGCGGG GCGCGGCTTC GCGGTGGTGG CGAGCGAGGT CAAGGCGCTC
GCCGAGCAGA CCGCCAAGGC GACCGGCGAA ATCGGTCAGC AGATCACCGG CATTCAGAGC
GCGACCGTTC AGTCGGTCTC GGCGATCAAG GAGATCGGCC AGACCATCGA CCGGATGTCG
GAAATCGCCT CGACCATCGC CTCGGCGGTG GAAGAGCAGG GTGCGGCGAC GCAGGAGATC
TCCCGCAACG TGCAGCAGGC CGCACAGGGC ACGCAGCAGG TTTCCGCCAA CATCACCGAC
GTCCAGCGCG GCGCGACCGA AACCGGCTCG GCGTCGACGC AGGTTCTGTC CGCGGCGAAA
TCGCTGTCGC AGGACAGCAA CCGGCTGAAG CAAGAAGTCG CCAGGTTCCT CGAGACCGTT
CGGGCCGCCT GA
 
Protein sequence
MFATLSIRAK ITTLVAALLI AMTCLGGLGI LKVRSMNFAA LDLATNWLPT IKMLGNLRYN 
VLNYRTAIRD HLITTTAEGM ATVEKRLEAI EGTIQKDLQT YEKMITTPEE RQLYDAWLSE
WAKYKPLAAK MIDASRKSVG QSSEEATDLL YKVAGPVASR MDEILQKDID LNDRGADGST
ALAASTYSSA IYLIVTILGV AIVVGIVASV MVIRDVAQGI ASIVKPMQSL GQGDLSADVP
HRGEKTEIGS MADALQIFKD ALISKKAADE DAAREAEAKI ARGQRVDSAT RQFETSIGEI
VETVSSASTE LEASAATLTA TAGHAQELTT AVAAASEEAS TNVQSVASAT EEMSSSITEI
SRQVQESARI ASEAVDQARK TNDSVGMLSA AAARIGDVVE LINTIAGQTN LLALNATIEA
ARAGEAGRGF AVVASEVKAL AEQTAKATGE IGQQITGIQS ATVQSVSAIK EIGQTIDRMS
EIASTIASAV EEQGAATQEI SRNVQQAAQG TQQVSANITD VQRGATETGS ASTQVLSAAK
SLSQDSNRLK QEVARFLETV RAA