Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1889 |
Symbol | |
ID | 6409549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 2037168 |
End bp | 2038403 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642711778 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_001990890 |
Protein GI | 192290285 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGCAG CTCAGCTCAG CGCGTTTCAG CGCTGGTCGA TCCTGATCGG CGCGTCGGTG CTGCTCAGCC TCGCCATGGG CATGCGGCAA AGTTTCGGGC TGTTTCAGCC CTCGGTGATC CGCGATGTCG GTATCACCAG CGCTGACTTC TCCTTTGCCA CAGCGCTGCA GAACATCATC TGGGGTGTCA CGCAGCCGAT GGTGGGACTG ATCGCCGACC GCTACGGCAC CCGCTGGGTG ATGGCGGGCG GTGTGGTCGT CTATGCGGCC GGTCTGGTTC TCATGATGGT TGCGGACTCG GCGCTGATGT TCACGTTGGG CTGTGGTGTC TGTGTCGGCA TTGCGCTGTC CTGCACCGCC TCCAGCATGA CCATGACGGT GACCTCGCGC ACGGTATCGC CGGCCAAGCG CAGCGTCGCG ATGGGGGCGG TCTCGGCGGC CGGGTCGCTC GGCCTGGTGC TGGCCTCGCC GCTGGCGCAG ACGCTGATCT CGACCGCCGG CTGGCAGATG GCGCTGATCG GCTTTCTCGG CCTTGCCGCG GCGATGTTGC CGTCGGCGCT GTTCGCCGGT CGTGCCGACA AGCTCGACAT CGACAAGTCG GACGACGTGC AGCAGTCGGC CGGCGAGGTG GTACAGACCG CGCTCGGCCA TTCCGGCTTC CTGGTGATGG CGATCGCGTT CTTCGTCTGC GGTCTGCAGC TCGTGTTCAT CACCACGCAT CTGCCGAACT ATCTGGCGAT CTGCGGCCTC GATCCGTCGC TCGGCGCGTC CGCTCTGGCG GTGATCGGGC TGTTCAACGT GTTCGGCTCG TATGCGTTCG GCTGGCTCGG CGGTAAGTTT CCGAAGCAGT ATTTGCTCGG CGGCATCTAC ATCGTGCGGT CGTTGACGGT CGCGGCGTAT TTCTATTTCC CGGCGTCAGC GACTTCGACG ATCGTGTTCG CCGCGATCAT GGGATCGTTG TGGCTCGGGG TGATTCCGCT GGTGAACGGC CTGGTCGCCC AGTTGTTCGG CCTGCGCTAC ATGGCGACGC TAACCGGCAT CGCCTTCCTC AGCCATCAGG TCGGCTCGTT CCTCGGGGCC TGGGGCGGCG GCGTGATCTA CGACCATCTC GGCAGCTATG ATCGCGCTTG GCAGGCTGCG GTGCTGATCG GCCTGATCGC CGGTTGTGCC CAGATGCTGA TGAACGTTCG GCCGCCACGC CGCCGGGACG AATTGGCTGT GCCTGCCACC GCCTGA
|
Protein sequence | MKAAQLSAFQ RWSILIGASV LLSLAMGMRQ SFGLFQPSVI RDVGITSADF SFATALQNII WGVTQPMVGL IADRYGTRWV MAGGVVVYAA GLVLMMVADS ALMFTLGCGV CVGIALSCTA SSMTMTVTSR TVSPAKRSVA MGAVSAAGSL GLVLASPLAQ TLISTAGWQM ALIGFLGLAA AMLPSALFAG RADKLDIDKS DDVQQSAGEV VQTALGHSGF LVMAIAFFVC GLQLVFITTH LPNYLAICGL DPSLGASALA VIGLFNVFGS YAFGWLGGKF PKQYLLGGIY IVRSLTVAAY FYFPASATST IVFAAIMGSL WLGVIPLVNG LVAQLFGLRY MATLTGIAFL SHQVGSFLGA WGGGVIYDHL GSYDRAWQAA VLIGLIAGCA QMLMNVRPPR RRDELAVPAT A
|
| |