Gene RPD_0337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0337 
Symbol 
ID4020797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp390010 
End bp391701 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content65% 
IMG OID637960516 
Productchemotaxis sensory transducer 
Protein accessionYP_567476 
Protein GI91974817 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.760858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.789667 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCAA AGCTCTCCAT TCGCGCCAAG GTCACGGCGC TTGTCGCGGC CCTTCTGATC 
GCCATGACCG GCCTCGGCGG GCTCGGGATT CTCAAGGTGC GGTCGATGAA CTTTGCGGCG
CTCGACCTCG CCACCAACTG GCTGCCGAGC ATCAAGGTTC TCGGCGAATT GAGGTTCAAC
GTCCTCAACT ACCGGACCAT GATCCGCAAT CACATGCTCG ACGTCACCCC CGAAGGCAAA
GCACGCTTCG AACAGCGTCT GGCGTCGATC GACGCGACCA TCAAGAAGGA CCAAGAGACC
TATGCGGCGA TGATTGCCTC GCCGGAAGAA CGGCAGCTCT ACGACAGCTG GGTCGTGCAG
TGGAACGACT ACAAGTCCGT GACCACGAAA ATGCTCGAGA TGTCACGCAA GGACATCGGA
AAGGTCTCGG CGGAGTCGAC CGACTTCCTG TTCAAGAACC TGAATCCGAT CGGCGTCCGC
ATGGACGAGA TCCTGCAGAA GGATATCGAC ATGAACGACA AGGGCGCGGA CGGTGCGACC
GCCTTGGCGG CCTCGACCTA CTCCTCCGCC ATCTACCTGG TCCTGACCAT TCTGGGCGTT
GCGATGGTCG TCGGCATCGT TGCGAGCGTG ATGGTGATCC GCGACGTGGC GCAGGGCATT
GCGTCGATCG TCAAGCCGAT GCAGTCGCTC GGTCAGGGCG ACCTCTCGGC GGACGTGCCG
CATCGCGGCG AGAAGACCGA AGTCGGCTCG ATGGCGGATG CGCTGCAGAT CTTCAAGGAC
GCGCTGATCG CCAAGAAGGC CGCTGACGAG GACGCGGCGC GTGAAGCTGA GGCGAAGATC
GCGCGCGGCC AACGCATCGA TGCAGCGACC CGCCAGTTCG AAACCTCGAT CGGCGAAATC
GTCGAGACGG TGTCTTCGGC GTCGACCGAA CTGGAGGCGT CGGCCGGCAC GCTGACCGCC
ACGGCGGGAC ACGCCCAGGA ACTGACCACC GCGGTCGCGG CGGCCTCGGA AGAAGCCTCG
ACCAATGTGC AGTCGGTGGC CTCGGCGACC GAAGAGATGT CGTCCTCGAT CACCGAGATC
AGCCGTCAGG TTCAGGAATC GGCGCGGATC GCCACCGAGG CGGTCGACCA GGCGCGCAAG
ACCAACGACA GCGTCGGGAT GCTGTCAGCC GCCGCGGCGC GGATCGGCGA CGTCGTCGAA
CTGATCAACA CCATCGCCGG CCAGACCAAT CTGCTGGCGC TGAACGCCAC GATCGAGGCG
GCTCGCGCCG GCGAAGCGGG GCGCGGCTTC GCGGTGGTGG CGAGCGAGGT CAAGGCGCTC
GCCGAGCAGA CCGCCAAGGC GACCGGCGAG ATCGGCCAGC AGATCACCGG CATTCAGGCG
GCGACCGATC AGTCGGTCTC GGCGATCAAG GAGATCGGCC AGACCATCGG CCGGATGTCG
GAAATCGCCT CGACCATCGC CTCGGCGGTG GAAGAGCAGG GCGCGGCGAC GCAGGAGATT
TCGCGCAACG TGCAGCAGGC CGCGCAGGGC ACGCAGCAGG TTTCCGCCAA CATCACCGAC
GTCCAGCGCG GCGCGACCGA AACCGGCTCG GCGTCGACGC AGGTTCTGTC CGCCGCGAAA
TCGCTGTCAC AGGACAGCAA CCGGCTGAAG GAAGAGGTCG CTAGGTTCCT CGAAACCGTT
CGCGCCGCCT GA
 
Protein sequence
MFAKLSIRAK VTALVAALLI AMTGLGGLGI LKVRSMNFAA LDLATNWLPS IKVLGELRFN 
VLNYRTMIRN HMLDVTPEGK ARFEQRLASI DATIKKDQET YAAMIASPEE RQLYDSWVVQ
WNDYKSVTTK MLEMSRKDIG KVSAESTDFL FKNLNPIGVR MDEILQKDID MNDKGADGAT
ALAASTYSSA IYLVLTILGV AMVVGIVASV MVIRDVAQGI ASIVKPMQSL GQGDLSADVP
HRGEKTEVGS MADALQIFKD ALIAKKAADE DAAREAEAKI ARGQRIDAAT RQFETSIGEI
VETVSSASTE LEASAGTLTA TAGHAQELTT AVAAASEEAS TNVQSVASAT EEMSSSITEI
SRQVQESARI ATEAVDQARK TNDSVGMLSA AAARIGDVVE LINTIAGQTN LLALNATIEA
ARAGEAGRGF AVVASEVKAL AEQTAKATGE IGQQITGIQA ATDQSVSAIK EIGQTIGRMS
EIASTIASAV EEQGAATQEI SRNVQQAAQG TQQVSANITD VQRGATETGS ASTQVLSAAK
SLSQDSNRLK EEVARFLETV RAA