Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4382 |
Symbol | |
ID | 4024907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 4847347 |
End bp | 4848372 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637964592 |
Product | extracellular solute-binding protein |
Protein accession | YP_571500 |
Protein GI | 91978841 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01096] lysine-arginine-ornithine-binding periplasmic protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.98504 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATTTC GAAAGGGCCT GCTGATCGGC CTCGGCTTCG CTGCCGTGAT TGCCGCCGCC GCCGTGACCT ATGAGCGCTA CGACACCAAG ACGCTGAAGC GCACGATCCG GCGCGACGCC GTGCTGTGCG GCGTAAACAA GGGCCTGCCC GGCTTCTCGA CGCCGGACGA CAAGGGCAAT TGGAGCGGCT TCGACGTCGA CTTCTGTCGC GCTGTGGCGG CTGCAATCTT CAACGATCCG AACAAGGTCA AGTTCGTGCC GCTCGACGCC AACGAGCGCT TCAAGGAATT GCAGAGCCGC AAGGTCGACA TCCTGTCGCG CAACTCGACC TGGAGCATGT CGCGCGAGAC CGGTTACGAA CTCTATTTCC CGGCGGTCGC CTATTACGAC GGTCAGGGCT TCATGGCGCC GGCGGCGCGC AAGGTCGAGA CCGCGCTCGA ACTCGACGGC AGCAAGGTCT GCGTCCAGGA GGGCACCACC ACGCTGCTCA ACCTCGCCGA CTTCTTCCGC ACCAACAACA TGAAGTATCA GGAGGTCAAG TTCGGCAAGC TCGACGAGGT GGTGAGCGCC TACAAGAACG GCCAGTGCGA CACCTTCACC GCCGACGCCT CCCAGCTCTA TGCGCTGCGG CAGACGCTCG ACAAGCCGGG CGATCACGTC ATCCTGCCGG ACCTGATCTC CAAGGAGCCG CTCGCGCCGG TGGTCCGCCA GCGCGACGAC GACTGGATGA TGATCGTGAA ATGGACGCTG TACGCGATGA TCAACGCGGA AGAGCTCGGC ATCACCTCGA TCAACATCGA CGAGGCGCTG AAGTCCAAGA AGCCCGACGT GATGCGGCTG GTCGGCACCG AGGGCACCTA TGGCGAAGAA CTCGGCCTGC CCAAAGACTG GGCGGCGCGG ATCATCCGCC ACGTCGGCAA TTACGGCGAG ATCTACGATC GCAATGTCGG CAAGCTCGGC ATCCCGCGCG GCCTGAACCA GCTCTGGAAC GCCGGCGGCA TCCAATACGC GCCGCCGATC AGGTAG
|
Protein sequence | MSFRKGLLIG LGFAAVIAAA AVTYERYDTK TLKRTIRRDA VLCGVNKGLP GFSTPDDKGN WSGFDVDFCR AVAAAIFNDP NKVKFVPLDA NERFKELQSR KVDILSRNST WSMSRETGYE LYFPAVAYYD GQGFMAPAAR KVETALELDG SKVCVQEGTT TLLNLADFFR TNNMKYQEVK FGKLDEVVSA YKNGQCDTFT ADASQLYALR QTLDKPGDHV ILPDLISKEP LAPVVRQRDD DWMMIVKWTL YAMINAEELG ITSINIDEAL KSKKPDVMRL VGTEGTYGEE LGLPKDWAAR IIRHVGNYGE IYDRNVGKLG IPRGLNQLWN AGGIQYAPPI R
|
| |