Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5003 |
Symbol | |
ID | 6412695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 5387398 |
End bp | 5388663 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 642714886 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_001993967 |
Protein GI | 192293362 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.408823 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCGCCCA GCCCTCCCCA GACCCTGGCG CGGCCGCTCG ACGCGCTGAA CTTCTTCCTC GCCGACGTGC GCGACGGCCT CGGGCCATAT CTGGCGATCT ACCTGCTGGC CGTGCAGAAC TGGAACGAGG CGTCGATCGG GCTGGTGATG TCGATCGCCG CGGCGGCCGG CATCGCAGCT CAGACCCCAG CCGGGGCACT GATCGACCGC TCCACCGCCA AGCGCGCGCT GATCATCGCG GCCGCTCTGG TCGTGACCGC CGCATCGGTG GTGCTGCCGT GGCTGGACAG TTTCGTGCTG GTGGCGGCGA CCCAGGCGCT GGCAGCGGCT GCTGGCGCGA TCTTTGCGCC CGCGGTGGCG GCGCTAACGC TGGGCATCGT CGGGCCGCGC GCCTTCGCCC GTCGGACCGG CCGCAACGAA GCCTTCAACC ACGCCGGCAA CGCGGTGGCG GCGATGCTCA CCGGCGCGTT CGCCTATGGG TTCGGCCCCG GCGTGGTGTT CTGGCTGATG GCCGCGATGG CGCTCGCCAG CATCTTCGCC ACGCTGGCGA TCCCAGCCGC GGCGATCGAC GATCACGTCG CCCGCGGGCT CGGCGACGAT CACGAGCGCG GCGCGCATCA CGACCAGCCG TCCGGCTTTA AGGTGCTGCT GACGTGCCGG CCGCTCTTGA TCTTCGCCGG CGCCACCGTG CTGTTTCACT TCGCCAATGC GGCGATGCTG CCGCTGGTCG GACAGAAGCT GGCGCTGGTG AACAAGAACC TCGGCACCAC GCTGATGTCG GTGTGTATCG TCGCCGCGCA GCTCGTGATG GTGCCGGTGG CGGCGCTGGT CGGGCACAAG GCCGACGTCT GGGGCCGCAA ACCGATCTTC GCCGTCGCGC TCGGCGTGCT GGCGCTGCGC GGCGCGCTAT ACCCCCTGTC CGACAATCCG TATTGGCTGG TCGGCGTGCA ACTGCTCGAC GGCGTCGGCG CCGGCATTTT CGGCGCGCTG TTTCCGCTGG TGGTGGCCGA CCTCACCCAC GGCACCGGGC ATTTCAACAT CAGCCAGGGC GCGATCGCGA CGGCCGCAGG CCTCGGCGCC GCGCTGTCGA CCGGCTTCGC CGGACTGATC GTGGTCAGCG CGGGCTACAG CGCCGCGTTC CTGGCGCTTG CCGGCATCGC TGCTGCGGCG CTGGTGTTGT TCCTGGTGCT GATGCCGGAG ACCCGACAGC AGCAATCGGC GGCACCACCT TCCGCAGCGC CGGAGGCGGT CTCATCCACC GTCTAA
|
Protein sequence | MPPSPPQTLA RPLDALNFFL ADVRDGLGPY LAIYLLAVQN WNEASIGLVM SIAAAAGIAA QTPAGALIDR STAKRALIIA AALVVTAASV VLPWLDSFVL VAATQALAAA AGAIFAPAVA ALTLGIVGPR AFARRTGRNE AFNHAGNAVA AMLTGAFAYG FGPGVVFWLM AAMALASIFA TLAIPAAAID DHVARGLGDD HERGAHHDQP SGFKVLLTCR PLLIFAGATV LFHFANAAML PLVGQKLALV NKNLGTTLMS VCIVAAQLVM VPVAALVGHK ADVWGRKPIF AVALGVLALR GALYPLSDNP YWLVGVQLLD GVGAGIFGAL FPLVVADLTH GTGHFNISQG AIATAAGLGA ALSTGFAGLI VVSAGYSAAF LALAGIAAAA LVLFLVLMPE TRQQQSAAPP SAAPEAVSST V
|
| |