Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2889 |
Symbol | |
ID | 6410558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 3151874 |
End bp | 3152860 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642712769 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_001991872 |
Protein GI | 192291267 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATTG GCATGTCGCG TCGCCGTTTG TTGCAAGCCG GCGCAGGTCT TGGATTGTAC GGCTTGCTGG GGCCGGGAAG CGCACGCGCC GCCGCTCAAG CCGATCTCTC CGGGGTCACG CTGCGGATCG CGTTCTACAA GGGGCTGCAC AAGACCTTGC TGGAAACGGC GGGCCTCGCC GCAACGCCCT ACAAGATCGA CTGGAAGGAG TTCAACTCCG GCGTCCAGCA CATCGAAGGC ATCAACGTCG ACGCGCTCGA TCTTGGTTCG GGCAGCGAAA CCTCGGCAGC GTTCGGGGTG AGATCCAAAG CGCGCGTGAA ATTTATCGCC GTGTATCGCG AGGACCTCAA CAATCAGGGC ACGTTCGTGC AAAAGGACTC GAACATCAAC CGCGTGGCCG ACCTCAAGGG GAAACGGGTC GGCTATGTTC GGGGAACGAC GTCGCACTAT TATCTCTACA AGCAACTGGC CGAAGCCGGC CTTTCCTTTA ACGACATCAA GGCGACCCAC CTTGCGCCCA CCGACGGCCT GTCTGCCTTC GCCCGTGGTG ATCTCGATGC GTGGGCGATC TGGGGTTACA ATGGCCAGCT CGCCCGCTCC AAATACGGGG CCCGCACCCT GAAGACAGGC GTCGGCTATC TTTCAGGAAA CTTCCTGATC TCGGCCAATC CGTCCGCGAT CGACGACCCG CTGCGGCACG CTGCGCTCGC CGACTTTCTG CTGCGGTTGC AGAAGGCCTA TGCCTGGAGC AACGCAAACT ATCCGACCTA TGCGGAAGCC CAGTCGCGCG ATACAGGCGT GCCGGTGGAG GCGATTCTCG ACCTGTTCAA CAATCGGAAC CAGGACTACA GCCTGATCGC CAACAGCGAT GCAGCGGTTC AAAGCCATCA GGAAGTGGCT GACGTATTCA CCAAGATTGG CGTGTTCGAC GCGCCGGTGG ACGTCAAGCC TTTCTGGGAC CGCAGCTTCG ACCAGGCGTT GGCCTGA
|
Protein sequence | MSIGMSRRRL LQAGAGLGLY GLLGPGSARA AAQADLSGVT LRIAFYKGLH KTLLETAGLA ATPYKIDWKE FNSGVQHIEG INVDALDLGS GSETSAAFGV RSKARVKFIA VYREDLNNQG TFVQKDSNIN RVADLKGKRV GYVRGTTSHY YLYKQLAEAG LSFNDIKATH LAPTDGLSAF ARGDLDAWAI WGYNGQLARS KYGARTLKTG VGYLSGNFLI SANPSAIDDP LRHAALADFL LRLQKAYAWS NANYPTYAEA QSRDTGVPVE AILDLFNNRN QDYSLIANSD AAVQSHQEVA DVFTKIGVFD APVDVKPFWD RSFDQALA
|
| |