Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2912 |
Symbol | |
ID | 3910708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3320098 |
End bp | 3321111 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637884815 |
Product | extracellular solute-binding protein |
Protein accession | YP_486525 |
Protein GI | 86750029 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01096] lysine-arginine-ornithine-binding periplasmic protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.392971 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGCG TATCCCTCGT CGCCGCCGCC GTCGTCGCCT GCTTCGCGGC CCAGAGCGCG GGCGCGCAGA CGCTGAAGAC GGTCAAGGAC CGCGGCATTC TGTCCTGCGG CGTCAGCCAG GGGCTTCCGG GATTCTCGGC GCCCGACGAC AAGGGCAACT GGACCGGCCT CGACGTCGAC GTCTGCCGCG GCATCGCGGC GGCGATCTTC GACGACCCGA GCAAGGTGAA ATTCGTGCCG CTGTCGGCCA AGGACCGCTT CACCGCGCTG CAATCCGGCG AGATCGACGT GCTGTCGCGC AACACCACCT GGACGCTGTC GCGCGACACC TCGCTCGGCG TCAACTTCGC CGGCGTCAGC TATTACGACG GGCAGGGCTT CCTGGTGAAG AAGGCGCTCA AGGTCAATTC CGCGCTCGAG CTCAACAGCG CCTCGGTCTG CGTGCAGACC GGCACCACCA ACGAGCAGAA CGTCGCCGAC TACTTCAAGG GCAACAACAT GAAGTACGAG GTGATCGCTT TCGCCAACGC CGACGAGGCG ATCAAGGCCT ATGAGTCCGG CCGCTGCGAC GTGTTCACCT CGGACGTGTC CCAGCTCTAC GCGCAGCGGC TGAAGCTGGC GACGCCGGCC GACCACGTCG TGCTGCCGGA AGTGATCTCC AAGGAGCCGC TCGGTCCGCT GGTCCGGCAC GGCGACGACC AGTGGTTCGA CGTGGTCAAG TGGACGCTGT TCGCGATGAT CAATGCCGAG GAACTCGGCG TCACCCAGAG CAACGTCGCC GAGATGGCCA AGTCCGACAA GCCGGAACTG CGGCGCGTGT TCGGCACCGA CGGCAATCTC GGCGAGCAGC TCGGCCTGAC CAAGGACTGG GTCGCGCGGA TCATCAAGGC GACCGGCAAT TACGGCGAAT CGTTCGAGCG CAACGTCGGC GCCGGCTCCA AGCTCGAAAT CGCCCGCGGC CTGAACAAGC CCTGGAACAA GGGCGGCATC ATGTACGCGC CGCCGATCCG CTGA
|
Protein sequence | MKRVSLVAAA VVACFAAQSA GAQTLKTVKD RGILSCGVSQ GLPGFSAPDD KGNWTGLDVD VCRGIAAAIF DDPSKVKFVP LSAKDRFTAL QSGEIDVLSR NTTWTLSRDT SLGVNFAGVS YYDGQGFLVK KALKVNSALE LNSASVCVQT GTTNEQNVAD YFKGNNMKYE VIAFANADEA IKAYESGRCD VFTSDVSQLY AQRLKLATPA DHVVLPEVIS KEPLGPLVRH GDDQWFDVVK WTLFAMINAE ELGVTQSNVA EMAKSDKPEL RRVFGTDGNL GEQLGLTKDW VARIIKATGN YGESFERNVG AGSKLEIARG LNKPWNKGGI MYAPPIR
|
| |