Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3539 |
Symbol | |
ID | 6411213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 3789172 |
End bp | 3790419 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 642713417 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_001992514 |
Protein GI | 192291909 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTCA GCGAGCCGGC CCTGCGCGTC GCGGAAGAAT CTCGACCCGA ACCGCCGCCG AAGCCCGGCC CGGGCGGCGC CTCGGTGCCG GGGCCCGCAA TGGGCGCCGG GCTGACGCTG GCGATGGCGG CGGCCGCGGG CATCAGCGTT GCCAATATCT ATTACAACCA GCCTATGCTG GGGGTGATCG AGCGTGACCT CGGCAATCCG GCGCTGACCG GGATGATCCC GACGGCGACT CAACTCGGCT ACGCGGTCGG TCTATTCCTG CTGGTGCCGC TCGGCGACCT GACCGACCGG CGGCGGCTGA TTGCCGGCCA GTTCGTGCTG CTGGCGGTCG CGGCTGCGCT GGTGGCGCTG GCGCCGTCGG CCTGGTTGAT CATTGCCGCG TCGCTGGCGC TCGGCGCCTG CGCGACCGTG GCGCAGCAGG TGGTGCCGTT CGCCGCGGCG CTGGCGGCAC CGGAGCGGCG CGGCAAGACC ATCGGCCTGG TGATGGCCGG GCTGTTGTGC GGCATCCTGC TCAGCCGGAC GGTGGCCGGC TTTGTTGCCG GCCATCTCGG CTGGCGCGAG ATGTTCTGGC TGGCGGTGCC GGCGGCGCTC GCGGCCGCCG CGCTGATGGC GTGGCTGCTG CCGCGCCATC ACGGTCACCT CGATATCAGC TATGGCGCCG CGCTGAAGTC GCTCGCGTCG CTGTGGCGCG AGCAGCGGGA TCTCCGGCGG GGGACCGCGG TGCAGGCGGC GCTGTTCGCC TCGTTCAGCG TGTTCTGGAC GGTGCTGGCG CTGCATCTGC AGGAGCCGAA GTTCGGGCTC GGGGCCGAGG CGGCGGGCCT GTTCGGCCTG GTTGGCGTGG TTGGCGTGTT GGCGGCGCCG ATCTCCGGCC GGATCGCCGA CCGAAGTGGA CCGGGACCGG TGATCGCGAT CGGCGCGGCT CTGGTGCTGG CGTCGTGGGT GTTGTTCGGT CTGTGGGGCA GCGTCGTTGG ACTGCTGATC GGCGTCGTGG TGCTGGATTT CGGTCTGCAG AGCGCGCTGA TCTCCAACCA GCACATCGTC TACGCGTTGG TGCCGGAAGC GCGAAACCGC CTCAACACCG TGTTCATGAC CGGGATGTTC ATCGGCGGAT CGGTCGGTTC TGCCGGCGCG GCCTTCGCCT GGGCGCACGG CGGCTGGACG GTGGTCAGCC TCTATGGCGG CGCGCTGGCG GCAATCGCCT TGCTACTCGA ACTGACGGCG CGTTGGTCCC GCCGTTAG
|
Protein sequence | MSVSEPALRV AEESRPEPPP KPGPGGASVP GPAMGAGLTL AMAAAAGISV ANIYYNQPML GVIERDLGNP ALTGMIPTAT QLGYAVGLFL LVPLGDLTDR RRLIAGQFVL LAVAAALVAL APSAWLIIAA SLALGACATV AQQVVPFAAA LAAPERRGKT IGLVMAGLLC GILLSRTVAG FVAGHLGWRE MFWLAVPAAL AAAALMAWLL PRHHGHLDIS YGAALKSLAS LWREQRDLRR GTAVQAALFA SFSVFWTVLA LHLQEPKFGL GAEAAGLFGL VGVVGVLAAP ISGRIADRSG PGPVIAIGAA LVLASWVLFG LWGSVVGLLI GVVVLDFGLQ SALISNQHIV YALVPEARNR LNTVFMTGMF IGGSVGSAGA AFAWAHGGWT VVSLYGGALA AIALLLELTA RWSRR
|
| |