Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0995 |
Symbol | |
ID | 4021470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1123124 |
End bp | 1125094 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637961186 |
Product | chemotaxis sensory transducer |
Protein accession | YP_568134 |
Protein GI | 91975475 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.880627 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTTCC GCCTTCGCCT CGGCCACAAG ATCAATTCCA TCGCCGCAGT CGGCATCGCC GGCGTTCTCG CCCTCGGCGC ATTGTTCGTG CTGGGTAACG CCTCGCAGGA CGCGGCGCGG AGCATCGACG AGCAGGCGCG AGACCTCGGC CGCAGCAACG ACAGGCTGTT CGTCACCTTT CTCGAGCAAC GCCGCACCGA AAAGGACTTC CTGCTGCGCA AGGACGACAA ATACATCAAG CAGCATCAAC AGCTCAGCAC CACTGCGATT CGCTACCTCG ACGAACTCGC GCGGCAGAGC CGAGCCTTGG GCCGCACCGA CCTTACCGAC AGCCTGAAAG CCATCCAGTC CGGCTACAGC GACTACACGC GGCATTTCGC GGCACTGACG GCGGCGCGGA TCCGGCTCGG GCTGAAGGAA GATCTGGGCG TCGAGGGCAG CCTACGCAAT TCGGTGCGGG CGATCGAAAC GGCGCTGAAG AACTTCGACG CGCCGAAGCT GACGATCACC ATGCTGATGA TGCGGCGGCA CGAGAAGGAT TTCATGCTGC GAGGCAATCC GACCTACGGC GACGACATGA AGAAGCGCGC CGCGGAGTTT TCGCAGCAAC TGGCGGCAGC CGACCTGCCC GAGTCCGCAA AGGCCGAACT CGGCCGGAAG CTCGCGGACT ATCAGCGCGA CTTCATCGCC TGGATGGAGA CCGCGCTTTC GTTCGAACAG GAACAGAAGG CGGCCTCGGC CGCCTTCGCC GCGATCGAAC CGGTGATCGG CGAGGTCGCG AAATCTGTCG AACGACTGGT TCAGGAGGCG CAGGACGCCA ACACCGTGGC GCGCGAGACC ACCACGAAGA CTCTCGAGAT CGCGATCGCA CTGATCATTC TCAGCGTCGG CCTGCTCGGC CTCCTGATCG GCCGTTCGGT GTCCCGTCCG CTGAAGGGCC TGACCTCCGG GCTGAAGGAA CTCGGGGCCG GCAATTTCAA TGTCGTGCTG CCCGGGCTCG ATCGTCACGA CGAGATCGGC GACATGGCGC GAGCCGTGGA ATCGTTCAAG GTAGTGGCGG AGGAAAAAGC CCGCGCCGAA GCCGAGGCCA AGGCGCAGCA GGACCGCATC GCCTCAGAGC AGCGCAGGCG CGACATGCAC AGGCTCGCCG ATCATTTCGA GGAAGCCGTC GGCGAGATCG TCGAGACCGT CTCGTCGGCC TCGACCGAGC TGGAAGCATC GGCGACGACG CTGACGTCCA CAGCGCAGCG AGCGCAGCAG TTCACCGCGC GCGTCGCGGA GGCGTCGGAG GAGGCCTCGA CCAACGTCGA GTCGGTGGCG TCCGCGAGCG AGGAGATGGC GTCGTCGGTC AACGAGATCA GCCGCCAGGT GCAGGAATCC GCGCGGATCG CCAGCGAAGC GGTGACGCAG GCGCAGGAGA CCAATGATCG CGTCAGCAAT CTGTCGGAGG CCGCGGCGCG GATCGGTGAC GTCGTCGATC TGATCAACAC CATCGCCTCG CAGACCAACC TGCTGGCGCT GAATGCGACC ATCGAGGCGG CGCGCGCCGG CGACGCCGGG CGCGGCTTCG CCGTGGTGGC GAGCGAGGTC AAGGCACTGG CCGAGCAGAC CGCGAAGGCG ACCGAACAGA TCAGCCAGCA GGTCGGCGGC ATCCAGTCCG CGACCGGCCA GTCGGTGGCG TCGATCCGCG AGATCAGCGG CACGATCGCG CGGATGTCGG AGATCGCCGC GACGATCGCC TCTGCGGTCG AGGAGCAAGG CGCCGCGACC CAGGAAATCT CGCGCAACGT TCACCAGGCC GCCGCAGGCA CGCAGCAGGT CTCGGCCAAC ATCGTCGAAG TGCAGCGCGG CGCGAGCGAG ACCGGTTCGG CGTCTGCGCA GGTGCTGACG GCGGCGCAGT TGCTGGCGCA CGACAGCACC CGCCTGAAGG ACGAAGTCAG CCAGTTCCTG CGCACGGTTC GCGCGGGTTG A
|
Protein sequence | MPFRLRLGHK INSIAAVGIA GVLALGALFV LGNASQDAAR SIDEQARDLG RSNDRLFVTF LEQRRTEKDF LLRKDDKYIK QHQQLSTTAI RYLDELARQS RALGRTDLTD SLKAIQSGYS DYTRHFAALT AARIRLGLKE DLGVEGSLRN SVRAIETALK NFDAPKLTIT MLMMRRHEKD FMLRGNPTYG DDMKKRAAEF SQQLAAADLP ESAKAELGRK LADYQRDFIA WMETALSFEQ EQKAASAAFA AIEPVIGEVA KSVERLVQEA QDANTVARET TTKTLEIAIA LIILSVGLLG LLIGRSVSRP LKGLTSGLKE LGAGNFNVVL PGLDRHDEIG DMARAVESFK VVAEEKARAE AEAKAQQDRI ASEQRRRDMH RLADHFEEAV GEIVETVSSA STELEASATT LTSTAQRAQQ FTARVAEASE EASTNVESVA SASEEMASSV NEISRQVQES ARIASEAVTQ AQETNDRVSN LSEAAARIGD VVDLINTIAS QTNLLALNAT IEAARAGDAG RGFAVVASEV KALAEQTAKA TEQISQQVGG IQSATGQSVA SIREISGTIA RMSEIAATIA SAVEEQGAAT QEISRNVHQA AAGTQQVSAN IVEVQRGASE TGSASAQVLT AAQLLAHDST RLKDEVSQFL RTVRAG
|
| |