Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0226 |
Symbol | |
ID | 4020684 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 260510 |
End bp | 262195 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637960405 |
Product | methyl-accepting chemotaxis protein |
Protein accession | YP_567367 |
Protein GI | 91974708 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.716563 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTCGT TCTTCAGGCG CGCCGGCAGT TCTCCTTCTT CCACGGCGAT CATGGAGGCG ATGAACCGCT CGCAGGCGAT GATCGAATTC AATCTCGACG GCACGGTGCT GGGCGCGAAC GAGAAGTTTC TCGAAGCGCT GGGCTACTCC CTCGCCGAGA TCAAGGGCAA GCATCACCGC ATGTTCGTCG ATCCGGAAGA GACCCGGACC GATGCCTATC GCCGGTTTTG GGCCGATCTC CAGGCCGGGA CGTATCAGGC TGGAGAGTTC AAGCGGATTG CCAAAGGCGC CCGCGAGATC TGGATTCAGG CATCGTATAA TCCGGTGCTC GGTAACGACG GAAAGCCCCT CTGCGTGGTG AAGTATGCCA CCGACATCAC CGCAGCGAAG CGCCGCAGCA TGGAGGATTT CGGCAAGATG CAGGCGATCG GCCGTTCCCA GGCGGTGATC GAATTCGCGA TGGACGGCAC GATCCTGACG GCGAACAAAA ATTTCCTGCA GGCGCTCGGC TACGAGCTCG ACGAGATCAA GGGCAAGCAT CACAGCATGT TCGTCGAGCC GGCCTTCTGC GACAGTCCCG CCTATCGCGA ATTCTGGGCG AGCCTCAATC GCGGTGACTA CCTCGCGGCC GAATACAAGC GCATCGGGAA AGGCGGACGC GAAATCTGGA TTCTCGCCTC CTACAACCCG ATCCTCGACG ACAAGGGCAA GCCGTTCAAG GTCGTCAAAT TCGCCACCGA CGTCACCGGG CAGAAGCTCA AGACCGCGGA CCTCGCCGGC CAGATCGACG CGATCGGCAA GTCTCAGGCG GTGATCGAGT TCGGGATCGA CGGCACCATT CTGGACGCCA ACGCCAATTT CCTGAAAGTG CTGGGTTACA ACCTCGCCGA CATCAAGGGC AAACATCACA GCATGTTCGT CGAGCCCGCC GAGCGCGACG CCGCGGCCTA TCGCACGTTC TGGGCAGAGC TTGCGGCCGG CAAGTATCAG GCCGCGGAGT ACAAGCGGAT CGGAAAGGGC GGCCGGGAAG TCTGGATCCA GGCCTCCTAC AACCCGATCC TGGATCTCAA CGGCAAGCCA TTCAAGGTGG TGAAATACGC CACCGACACC ACGCGTCAGG TGCTGGTGCG GATCGGCAAC GAACGAGTCC GCGCGATGAT GGAATCGGTC GCAGCCGGCG CCGAGGAATT GAATGCGTCG GTGCGGGAGA TTTCCGAGGC GATGACGAAG TCGCGGGAGA CGGCGCTGAC CGCCGTCAGC GAGGTCGATG CCGCCGACGG TCAGGCCCAC CGGCTGAATG AAGCCGCCCA GGCGATGAGC GGGATCGTCG AACTGATCAG CAACATCACC GGGCAGATCA ATCTGCTGGC GCTCAACGCC ACCATCGAAT CGGCCCGCGC CGGAGAAGCC GGCCGCGGCT TCGCGGTGGT CGCCGGCGAA GTGAAGAATC TCGCCAATCA GGCCAAGCAG GCGACCGACC GGATCGGCGC GGAGATCGAG AACCTCAACG GCATTTCCGG CGACGTGGTC GGCGCGCTCG GCAAGATCAA GGCAGCGATC CAGAACGTCA GCGAATACGT CACCTCGACG GCCGCGGCGG TCGAGGAGCA GAGCACGGTG ACCGGCGAGA TGTCGTCGGG GATGCAGCGC GCCGCCACCG AGGCCGCGGC GATCGCGGCG GCCTGA
|
Protein sequence | MFSFFRRAGS SPSSTAIMEA MNRSQAMIEF NLDGTVLGAN EKFLEALGYS LAEIKGKHHR MFVDPEETRT DAYRRFWADL QAGTYQAGEF KRIAKGAREI WIQASYNPVL GNDGKPLCVV KYATDITAAK RRSMEDFGKM QAIGRSQAVI EFAMDGTILT ANKNFLQALG YELDEIKGKH HSMFVEPAFC DSPAYREFWA SLNRGDYLAA EYKRIGKGGR EIWILASYNP ILDDKGKPFK VVKFATDVTG QKLKTADLAG QIDAIGKSQA VIEFGIDGTI LDANANFLKV LGYNLADIKG KHHSMFVEPA ERDAAAYRTF WAELAAGKYQ AAEYKRIGKG GREVWIQASY NPILDLNGKP FKVVKYATDT TRQVLVRIGN ERVRAMMESV AAGAEELNAS VREISEAMTK SRETALTAVS EVDAADGQAH RLNEAAQAMS GIVELISNIT GQINLLALNA TIESARAGEA GRGFAVVAGE VKNLANQAKQ ATDRIGAEIE NLNGISGDVV GALGKIKAAI QNVSEYVTST AAAVEEQSTV TGEMSSGMQR AATEAAAIAA A
|
| |