Gene RPD_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0226 
Symbol 
ID4020684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp260510 
End bp262195 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content63% 
IMG OID637960405 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_567367 
Protein GI91974708 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.716563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCGT TCTTCAGGCG CGCCGGCAGT TCTCCTTCTT CCACGGCGAT CATGGAGGCG 
ATGAACCGCT CGCAGGCGAT GATCGAATTC AATCTCGACG GCACGGTGCT GGGCGCGAAC
GAGAAGTTTC TCGAAGCGCT GGGCTACTCC CTCGCCGAGA TCAAGGGCAA GCATCACCGC
ATGTTCGTCG ATCCGGAAGA GACCCGGACC GATGCCTATC GCCGGTTTTG GGCCGATCTC
CAGGCCGGGA CGTATCAGGC TGGAGAGTTC AAGCGGATTG CCAAAGGCGC CCGCGAGATC
TGGATTCAGG CATCGTATAA TCCGGTGCTC GGTAACGACG GAAAGCCCCT CTGCGTGGTG
AAGTATGCCA CCGACATCAC CGCAGCGAAG CGCCGCAGCA TGGAGGATTT CGGCAAGATG
CAGGCGATCG GCCGTTCCCA GGCGGTGATC GAATTCGCGA TGGACGGCAC GATCCTGACG
GCGAACAAAA ATTTCCTGCA GGCGCTCGGC TACGAGCTCG ACGAGATCAA GGGCAAGCAT
CACAGCATGT TCGTCGAGCC GGCCTTCTGC GACAGTCCCG CCTATCGCGA ATTCTGGGCG
AGCCTCAATC GCGGTGACTA CCTCGCGGCC GAATACAAGC GCATCGGGAA AGGCGGACGC
GAAATCTGGA TTCTCGCCTC CTACAACCCG ATCCTCGACG ACAAGGGCAA GCCGTTCAAG
GTCGTCAAAT TCGCCACCGA CGTCACCGGG CAGAAGCTCA AGACCGCGGA CCTCGCCGGC
CAGATCGACG CGATCGGCAA GTCTCAGGCG GTGATCGAGT TCGGGATCGA CGGCACCATT
CTGGACGCCA ACGCCAATTT CCTGAAAGTG CTGGGTTACA ACCTCGCCGA CATCAAGGGC
AAACATCACA GCATGTTCGT CGAGCCCGCC GAGCGCGACG CCGCGGCCTA TCGCACGTTC
TGGGCAGAGC TTGCGGCCGG CAAGTATCAG GCCGCGGAGT ACAAGCGGAT CGGAAAGGGC
GGCCGGGAAG TCTGGATCCA GGCCTCCTAC AACCCGATCC TGGATCTCAA CGGCAAGCCA
TTCAAGGTGG TGAAATACGC CACCGACACC ACGCGTCAGG TGCTGGTGCG GATCGGCAAC
GAACGAGTCC GCGCGATGAT GGAATCGGTC GCAGCCGGCG CCGAGGAATT GAATGCGTCG
GTGCGGGAGA TTTCCGAGGC GATGACGAAG TCGCGGGAGA CGGCGCTGAC CGCCGTCAGC
GAGGTCGATG CCGCCGACGG TCAGGCCCAC CGGCTGAATG AAGCCGCCCA GGCGATGAGC
GGGATCGTCG AACTGATCAG CAACATCACC GGGCAGATCA ATCTGCTGGC GCTCAACGCC
ACCATCGAAT CGGCCCGCGC CGGAGAAGCC GGCCGCGGCT TCGCGGTGGT CGCCGGCGAA
GTGAAGAATC TCGCCAATCA GGCCAAGCAG GCGACCGACC GGATCGGCGC GGAGATCGAG
AACCTCAACG GCATTTCCGG CGACGTGGTC GGCGCGCTCG GCAAGATCAA GGCAGCGATC
CAGAACGTCA GCGAATACGT CACCTCGACG GCCGCGGCGG TCGAGGAGCA GAGCACGGTG
ACCGGCGAGA TGTCGTCGGG GATGCAGCGC GCCGCCACCG AGGCCGCGGC GATCGCGGCG
GCCTGA
 
Protein sequence
MFSFFRRAGS SPSSTAIMEA MNRSQAMIEF NLDGTVLGAN EKFLEALGYS LAEIKGKHHR 
MFVDPEETRT DAYRRFWADL QAGTYQAGEF KRIAKGAREI WIQASYNPVL GNDGKPLCVV
KYATDITAAK RRSMEDFGKM QAIGRSQAVI EFAMDGTILT ANKNFLQALG YELDEIKGKH
HSMFVEPAFC DSPAYREFWA SLNRGDYLAA EYKRIGKGGR EIWILASYNP ILDDKGKPFK
VVKFATDVTG QKLKTADLAG QIDAIGKSQA VIEFGIDGTI LDANANFLKV LGYNLADIKG
KHHSMFVEPA ERDAAAYRTF WAELAAGKYQ AAEYKRIGKG GREVWIQASY NPILDLNGKP
FKVVKYATDT TRQVLVRIGN ERVRAMMESV AAGAEELNAS VREISEAMTK SRETALTAVS
EVDAADGQAH RLNEAAQAMS GIVELISNIT GQINLLALNA TIESARAGEA GRGFAVVAGE
VKNLANQAKQ ATDRIGAEIE NLNGISGDVV GALGKIKAAI QNVSEYVTST AAAVEEQSTV
TGEMSSGMQR AATEAAAIAA A