Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4673 |
Symbol | |
ID | 3912491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 5286022 |
End bp | 5287047 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637886578 |
Product | extracellular solute-binding protein |
Protein accession | YP_488267 |
Protein GI | 86751771 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01096] lysine-arginine-ornithine-binding periplasmic protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.562067 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATTCC GCAAGGGCCT GCTGATCGGC CTCGGCGTCG CCGCCGTGGT CGCCGCCGCC GCCGTCACCT ACGAGCGCTA CGACACCAAG ACGCTGAAGC GCACGATCCG CCGCGACGCC GTGCTGTGCG GCGTCAACAA GGGCCTGCCG GGCTTCTCGA CGCCGGACGA AAAAGGCAAC TGGACCGGTT TCGACGTCGA TTTCTGCCGC GCGGTGGCGG CGGCGATCTT CAACGATCCG AACAAGGTGA AATTCGTGCC GCTCGACGCC AACGAGCGCT TCAAGGAATT GCAGAGCCGC AAGGTCGACA TCCTGTCGCG CAATTCGACC TGGAGCATGT CGCGCGAGGC CGGCTACGAA TTGTACTTCC CGGCGGTCGC CTATTACGAC GGCGAGGGCT TCATGGTGCC GTCGTCGCGC AAGATCGAGA CCGCGCTCGA ACTCGACGGC AGCAAGGTCT GCGTCCAGGC CGGCACCACC ACTTTGCTCA ACCTTGCCGA CTACTTCCGC GCCAACAACA TGAAGTATCA GGAGGTCAAG TTCCCCAAGC TCGACGAGGT GGTCGCCGCC TACAACAGCG GACAATGCGA CACCTTCTCC GCCGACGCCT CGCAGCTCTA TGCGCTGCGC CAGACGCTCG CCAAACCGGG CGATCACGTC ATCCTGCCGG ACCTGATCTC CAAGGAGCCG CTGGCGCCGG TGGTCCGCCA GCGCGACGAC GAATGGATGA TGATCGTGAA ATGGTCGCTC TACGCGATGA TCAACGCCGA GGAACTCGGC GTCACCTCGG CGAATATCGA CGAGGCGCTG AAGTCGAAGA AGCCGGACGT GATGCGGCTG GTCGGCACCG AGGGCGCCTA TGGCGAGGAG CTCGGCCTCA GCAAGGACTG GGCGGCGCGG ATCATCCGCC ACGTCGGCAA TTACGGCGAG GTCTACGACC GTAATGTCGG CAAGCTCGGC ATCCCGCGTG GCCTGAACCA GCTCTGGAGC GCCGGCGGCA TCCAATACGC GCCGCCGATC AGGTAG
|
Protein sequence | MSFRKGLLIG LGVAAVVAAA AVTYERYDTK TLKRTIRRDA VLCGVNKGLP GFSTPDEKGN WTGFDVDFCR AVAAAIFNDP NKVKFVPLDA NERFKELQSR KVDILSRNST WSMSREAGYE LYFPAVAYYD GEGFMVPSSR KIETALELDG SKVCVQAGTT TLLNLADYFR ANNMKYQEVK FPKLDEVVAA YNSGQCDTFS ADASQLYALR QTLAKPGDHV ILPDLISKEP LAPVVRQRDD EWMMIVKWSL YAMINAEELG VTSANIDEAL KSKKPDVMRL VGTEGAYGEE LGLSKDWAAR IIRHVGNYGE VYDRNVGKLG IPRGLNQLWS AGGIQYAPPI R
|
| |