Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2832 |
Symbol | |
ID | 6410499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 3083934 |
End bp | 3084995 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642712710 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_001991815 |
Protein GI | 192291210 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01096] lysine-arginine-ornithine-binding periplasmic protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.169864 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGCG TCTCTATTCT TGTCGCCCTC GCTATCGCTG TCGGTCTGTC GCCTCAGGTC GCCGGCGCGC AAAATCCCAA GGCCGGATCG ACGGCGGGCG CAGAGAAAAA GAAGACTACG CTTGAGATTG TTCGGGAAAA CGGCGCTCTG TCCTGCGGCG TCAGCCAGGG ACTGCCCGGT TTTTCGGCGC CCGACGACAA AGGCAATTGG GCCGGCCTCG ACGTCGATGT CTGCCGCGCG ATTGCGGCCG CGGTGTTCGA CGACCCCAGC AAGGTGAAGT TCGTGCCGCT GTCGGCGAAG GATCGCTTCA CCGCGCTGCA GTCCGGTGAG ATCGATGTGC TGTCGCGCAA TACCACCTGG ACGCTGTCGC GCGACACCTC GCTCGGCGTC AGCTTCGCTG GCGTCAGCTA TTATGACGGC CAGGGCTTCA TGGCCAAGAA GTCGCTCAAG GTGAATTCGG CGCTGGAACT GAACGGCGCC TCGATCTGCG TGCAGACCGG CACCACCAAC GAGCAGAACG TTGCCGACTA TTTCAAAGGC AACAACATGA AGTACGAGGT GATCGCGTTC GCCAATGCCG ACGAAGCGGT CAAGGCCTAT GAGTCCGGTC GTTGCGACGT GTTCACCTCC GACGTCTCGC AGCTCTACGC CCAGCGCCTA AAGCTGGCGA CCCCGGCTGA TCATGCTGTG CTGCCGGAGG TGATCTCCAA GGAGCCGCTC GGCCCGCTGG TTCGCCACGG CGACGATCAG TGGTTCGACA TCGTCAAGTG GACGTTGTTC GCGCTGGTCA ATGCCGAAGA GCTCGGCGTC ACCCAGAAGA ACGTCGACGA AATGGCCAAG TCGGACAAGC CCGAGCTGAA GCGGGTGTTC GGCACCGACG GCAATCTTGG CGAACAGCTC GGCCTGACCA AGGATTGGGT CGCCCGCATC GTCAAGGCGA CCGGCAATTA CGGCGAATCG TTCGAACGCA ATGTCGGCTC CGGCTCGAAG CTGGAGATCG CGCGCGGCCT CAACAAGCTC TGGAACAAGG GGGGCATCAT GTACGCCCCG CCGATCCGCT GA
|
Protein sequence | MKRVSILVAL AIAVGLSPQV AGAQNPKAGS TAGAEKKKTT LEIVRENGAL SCGVSQGLPG FSAPDDKGNW AGLDVDVCRA IAAAVFDDPS KVKFVPLSAK DRFTALQSGE IDVLSRNTTW TLSRDTSLGV SFAGVSYYDG QGFMAKKSLK VNSALELNGA SICVQTGTTN EQNVADYFKG NNMKYEVIAF ANADEAVKAY ESGRCDVFTS DVSQLYAQRL KLATPADHAV LPEVISKEPL GPLVRHGDDQ WFDIVKWTLF ALVNAEELGV TQKNVDEMAK SDKPELKRVF GTDGNLGEQL GLTKDWVARI VKATGNYGES FERNVGSGSK LEIARGLNKL WNKGGIMYAP PIR
|
| |