Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2686 |
Symbol | |
ID | 6410349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2915401 |
End bp | 2916489 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642712562 |
Product | ABC transporter substrate-binding protein |
Protein accession | YP_001991671 |
Protein GI | 192291066 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAACC AAGAAAAAGG CCGGTGGATC AGCCGGCGTT CCGTGCTGCT GGGCGCGGCG TCTGGCGTCA TCGCGGCGCC GGCTGGCGCT TTCGGCCTTC AGGCATTTCC CGCTCGGCCG ATCGGCTACG ATCTGTCGGA TGCGCCGATC TGCCGGACCT CCGGTGAAGC AACCCCGCTG ACTGGCGCGC CGCGCAAGGT GAAGCTGTCG TGGAACGCCA CTTCGGTGTG CTCGGTGCAG GTGCCGGTCG CGGTCGATTA CGACTTCTTC AAAAAGCAAA ATCTCGACGT CGAGCTGGTG AATTTCTCCG GCTCGACCGA TCAGCTGCTC GAAGCGATCG CCACCGGCAA GAGCGACGCC GGCGTCGGCA TGGCGCTGCG CTGGCTCAAG CCGCTGGAGC AGGGGTTCGA CGTCAAGATA GTCGCCGGCA CCCATGGCGG CTGCCTGCGT GCGATCGCGC CGACCAAGTC GGAGATCAGC AAGGTCACCG ATCTCAAGGG CAAGGTCGTG GCGATTGACG ATCAGGCCGG CCCGGGCAAG AACTTCTTCT CGATCCAGCT CGCCAAGGCC GGTATCGATC CGACCAAGGA TCTCGAGTGG AAGCAGTATC CAGCCAACCT CGTTCGGCTC GCAGTCGAGA AGGGCGAGGC GCAAGCGGCG CTGGCGTCTG ATCCGCTCGC GCATGCGTTC CTCAAGGACG GCGAGTTCAA GGAGATCGGC TCCAATCTCG ACGGCATCTA TCGCAATGTG AGCTGCTGCA TCGTCGGTGT CCGCGGCAGC CTGATCCGCG AGGAGCCGCA GGTGGCGCGC GCGCTGACCC AGGCGCTGCT CGACGCGGCC GAGTTCTCGT CGAAGAACCC GGAGAAGGCG GCGAAGTCGT TCCTGCCTTA CGCACCGAAG ATTGTCACCG AAGAGGATAT CCAGGCGCTG ATCAAGTATC ACACCCACGA TCACCACCCG ATCGGCGCGC AGCTCAAGAA CGAGCTCAAG CTTTACGCCG ACGACCTCAA GGCCGTTTCG GTGATCAAAC CCTCGACCGA TACCACCAAA TTTGCGGAAA AGATCTATGC CGACGTATTC CGCGTCTGA
|
Protein sequence | MSNQEKGRWI SRRSVLLGAA SGVIAAPAGA FGLQAFPARP IGYDLSDAPI CRTSGEATPL TGAPRKVKLS WNATSVCSVQ VPVAVDYDFF KKQNLDVELV NFSGSTDQLL EAIATGKSDA GVGMALRWLK PLEQGFDVKI VAGTHGGCLR AIAPTKSEIS KVTDLKGKVV AIDDQAGPGK NFFSIQLAKA GIDPTKDLEW KQYPANLVRL AVEKGEAQAA LASDPLAHAF LKDGEFKEIG SNLDGIYRNV SCCIVGVRGS LIREEPQVAR ALTQALLDAA EFSSKNPEKA AKSFLPYAPK IVTEEDIQAL IKYHTHDHHP IGAQLKNELK LYADDLKAVS VIKPSTDTTK FAEKIYADVF RV
|
| |