Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0856 |
Symbol | |
ID | 6408510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 907723 |
End bp | 909138 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 642710770 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_001989889 |
Protein GI | 192289284 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACCAGA CCGTTGCCGC AACGATCGAT AACTCCCGCC GCTGGCGGGT GCTGGCCATC GTGGTCGCCG CGCAGTTTAT GTTCGGGGTC GATGCCTTCA TCGTCAACGT CGCGCTGCCG ACGATCTCGA GCGAACTCGG CGCGTCATCG TCGCAGCTCG AGGCGGTGAT CGCGATCTAC CTGATCGGCT ACGCAACGCT GATCGTCGCC GGCGGCCGGC TCGGCGACAT CTTCGGCACC AAGACGGTGT TCCTGCTCGG CGTCGCCGGC TTCACGCTGA CCTCGCTGTG GTGCGGGCTG GCGCGCTCCG GTCCCGAACT GATCCTGGCG CGGCTCGCCC AGGGCACTAC GGCGGCGTTC ATGGTGCCGC AAGTGCTGGC GACGCTGCAC GTGCTGTTTC CGGACGCTGC GCGCGCCAAG GCGTTTGCGA TCTACGGCAC CGTACTCGGG CTCGCTGGCG CCACCGGCTT CGCGCTCGGC GGTCTGTTGG TGACGCTCGA TCTCGGCGGC TTCGGCTGGC GCTCGATCTT CTACGTCAAT GGTCCGGTCG GGCTGATCAT CATCGCGGCC GCCGCCCGGG TGATGCCGCA GACCCCGCGA CGGCCGGGCA CGCGGCTCGA TCTCGGCGGC GCCGTGATCC TGTTCGCCGG CCTCGTCTGC GTGATCGGTC CGTTGCTGTT CGGCCGCGAT TTCGGCTGGG CCGGATGGGT GTGGGCCGTG ATGGCCGGCG GCGGCGCGAT GCTGGCGCTG TTCCTGCGCT ACGAGCGCCG CGTCGCTGCG CGCGGCGGCA TGCCGGTGGT TGACCTGACG CTGCTCGGCG ATCGCGCTTT CGTCCGCGGT CTCGGCGCGG TGTTCTGCTT CTTCTTCGCC AACCAGTCGT TCTATCTGGT GATGACGCTG TACATGCAGT TCGAGCTGAA CATCCCGCCG CTGCAGGCCG GCCTGGTGTT CCTGCCGCTG GCGCTGGCCT TCGTGATCGC GTCGCGGCAT TCCGGCGCGC GCGCCCGGCG CCGCGGCACG CTGGTGCTGA TCGAAGGCTG CCTGCTGCAG ATCGCCGGCC TCGGCTTGAT CGCCGCCACG GTCACGGTTA TCGCATCACC GACGCCATTC GTGTTGGCGC TGGCGCTGTT AGTCGCCGGC TACGGCCAGG GACTGGTGAT GGCCCCGCTG TCGGGCGTGG TACTGTCGAG CGTGCAGGCG ACCAGCGCGG GCTCGGGCTC CGGCCTCTAC GGCACTACCA CGCAGATCGC CAGCGCGGTC GGCGTCGCGG CGCTCGGTTC GGTGTACTTC ACGCTGGCGC AAAACGGCTC TGGCCGTGTT GCCCTGCTCG GCGCGCTGGC GCTGCTGGGG CTCGCGATCG CCGGCTGCAT CGGGCTGCTG CGCTGGATGC GCCGCGCCGT GGCGGTGGCA GCTTAA
|
Protein sequence | MHQTVAATID NSRRWRVLAI VVAAQFMFGV DAFIVNVALP TISSELGASS SQLEAVIAIY LIGYATLIVA GGRLGDIFGT KTVFLLGVAG FTLTSLWCGL ARSGPELILA RLAQGTTAAF MVPQVLATLH VLFPDAARAK AFAIYGTVLG LAGATGFALG GLLVTLDLGG FGWRSIFYVN GPVGLIIIAA AARVMPQTPR RPGTRLDLGG AVILFAGLVC VIGPLLFGRD FGWAGWVWAV MAGGGAMLAL FLRYERRVAA RGGMPVVDLT LLGDRAFVRG LGAVFCFFFA NQSFYLVMTL YMQFELNIPP LQAGLVFLPL ALAFVIASRH SGARARRRGT LVLIEGCLLQ IAGLGLIAAT VTVIASPTPF VLALALLVAG YGQGLVMAPL SGVVLSSVQA TSAGSGSGLY GTTTQIASAV GVAALGSVYF TLAQNGSGRV ALLGALALLG LAIAGCIGLL RWMRRAVAVA A
|
| |