Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4051 |
Symbol | |
ID | 3911858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4621428 |
End bp | 4622348 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885955 |
Product | binding-protein dependent transport system inner membrane protein |
Protein accession | YP_487655 |
Protein GI | 86751159 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.722195 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCG TCGCGCCCCC GATCACCGAG CCCGCCAAGC CCTCAACGGC CGCAAAGCCC GTGCGGCGCT CCGGCCTGGT CGAAATGATC GCGCACACCC GCTACGTCCT CGGCGACAAC CGCGTCACCG CCTTCGCCTT CGGGCTGCTG GTGGTGATCG TTTTCGCCGC ATTGTTCGGC CCGTATATCG TTCCCTACGA CCCGCTCGCC AGCAATACCG CGCAGGCGCT GAAGCCGCCG TCCGCCGCCA ACTGGTTCGG CACCGACCAG CTCGGCCGCG ATATCTTCAG CCGCGTCGTC GTCGCCACCC GGCTCGATCT GTTCATCGCC GTCGCCTCGG TGGTGCTGGT GTTCCTGATG GGCGGCCTCG CCGGCATCGC AGCCGGTTAT TTCGGCGGAT GGACCGACCG CATCGTCGGC CGCATCGCCG ACACCATCAT GGCGTTTCCG CTGTTCGTGC TGGCGATGGG CATCGTCGCG GCGCTCGGCA ACACCGTGCA GAACATCATC ATCGCCACCG CGATCGTCAA CTTCCCGCTT TACGCCCGGG TCGCCCGCGC CGAGGCCAAT GTCCGGCGCG AGGCCGGCTT CGTCATGGCA GCAAGGCTTT CGGGCAACAG CGAGATGCGC ATCCTGCTGG TGCACATCCT GCCGAACATC ATGCCGATCA TGATCGTGCA GATGTCGCTG ACGATGGGCT ACGCCATCCT CAACGCCGCC GGGCTGTCGT TCATCGGCCT CGGCGTCCGC CCGCCCACCG CCGAATGGGG CATCATGGTC GCCGAGGGCG CCTCGTTCAT GGTCTCGGGC GAGTGGTGGA TCGCGCTGTT CCCCGGCCTC GCGCTGATGA CCGCCGTGTT CTGCTTCAAC CTGCTCGGCG ACGGCCTGCG CGACATCTTC GACCCGCAGC GGAGGACGTG A
|
Protein sequence | MSSVAPPITE PAKPSTAAKP VRRSGLVEMI AHTRYVLGDN RVTAFAFGLL VVIVFAALFG PYIVPYDPLA SNTAQALKPP SAANWFGTDQ LGRDIFSRVV VATRLDLFIA VASVVLVFLM GGLAGIAAGY FGGWTDRIVG RIADTIMAFP LFVLAMGIVA ALGNTVQNII IATAIVNFPL YARVARAEAN VRREAGFVMA ARLSGNSEMR ILLVHILPNI MPIMIVQMSL TMGYAILNAA GLSFIGLGVR PPTAEWGIMV AEGASFMVSG EWWIALFPGL ALMTAVFCFN LLGDGLRDIF DPQRRT
|
| |