Gene Rpal_2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2086 
Symbol 
ID6409746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2258799 
End bp2260442 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content66% 
IMG OID642711971 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001991083 
Protein GI192290478 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily
[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.389714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACTGGC GAGTTTCCCC GCATCGTTTC AGCCCGACGA GAATCCGCGC CCGTATGAGC 
ATGTCTGAAC GCCAGAGCCG CACGTCTGCG GGGGCGACCC ATCATCTGCT GTCCGACGAA
TCCGCCGCAG AGCTCTCCAG CATTCCGACC GAAGTGATCA ATCTCGGCGA CGCGCCGCCG
CTGGCGCCCG CCGGTGCGCT GACGCAGGGC GAAGTTCGCG CGATCTTGTT GAGCCTTCTG
CTGGCGATGT TCCTCGCAGC GCTCGACCAG ACGATCGTCG CCACCGCGCT GCCGACGATC
GGGCGGCAAT TCGGCGATGT CGAAAATTTG TCCTGGGTGA TCACGGCCTA TCTGCTGTCG
TCGACGGCGG TGGCTCCGGT GTTCGGCAGC CTCGCCGACA TCTACGGCCG CCGGGTAATG
ATTATCGTCT CGCTCAGCCT ATTCGTGGCC GGTTCGGTGA TGTGCGCGCT GGCACCGAGT
CTGCTCGTCC TGATCCTCGG TCGCGCTCTG CAGGGGCTGG GTGGCGGCGG CATCATGCCG
ATCGTGCAGA CGGTGATCTC CGACGTCGTC AGTCCGCGCG AACGGGGCCA GTATCAGGCT
TATTTCTCCG GCGTGTGGGT GTCGGCCGGA ATCGGCGGTC CGATCCTCGG CGGCTTCTTC
GCCGAGCATC TGCATTGGTC GATGATCTTC TGGATCAACC TGCCGCTCGC GATCGGGGCG
CTGGCGCTGC TGCTGCCGAA GATGGCGAAG ATCCCGGTGT ATCACCGCCG CCGGAAGGTG
GATTGGCTCG GCGGCGTGCT GCTGATGGCC GCTGCGATGG CGGTGATGCT GGTGCTGACC
TGGGGAGGCA ATCGCTTCGC CTGGCTGTCG CCGACGATCC TGGCGCTGTC GGGCGCGGCG
GTGCTGCTGG CGGCGAGCTT CATCTGGCAC GCGCTACGTG CGCGCGAGCC ATTCCTGCCG
CTGCAATTGA TGAGCGGGAC GGTGGTGCCG TGGGCGATGG CGGGCGGCGC GTTCACCATG
GGGGCGATGA TCGCGCTGAC CGTGCACATG CCGCTGTACT ATGAAGCTGT GTACCATCTG
ACCGCCAGCC AGTCCGGCCT CGCGCTGATT CCGATCGCGG CGATCTCGGT GTTCGGCGCA
GCGTTCACAG GCCGGGCGAT GGTGCATGTC GAGCGCTACA AGCGGATCGC GATTCTTGGT
ACCGGCTTTG CGGCGCTGAT GGCATTGGCC ATCGCGCTGT TGACGCCGCT GCCGCTGTGG
CTGTTTTTGA CCCTGCTATC GCTGTGTTCG CTTGGCCTCG GCACGGTGTT TCCGGTCAGC
GTGGTGTCGA TCCAGAACGC GGTGCCGCGG CCGCAGATCG GCACCGCGAC CGGCGCGATG
AACTTCTTCC GGGCGCTGAT GGCCTCGTTC ACGGTGGCGG CGTTCACGGC GATCCTGCTG
ATCGCGCTCG GTGGCAACGT GCAGCTTGGC GGCGGCGAAC ATCGTCATCT CGTCGGCAGC
GTAGCGGCCG CCGAGATGGT GGCGGGGTTC CGCTGGGTGT TCGGTGCGGC GGCTCTGATG
CTGGCGGCCT CGGCGCTGTG CATCATGGTG ATGGAAGAGC GCAAGCTGGC CGGACCCGAG
CCGACGCTCG CGCTCTCGGA ATAG
 
Protein sequence
MHWRVSPHRF SPTRIRARMS MSERQSRTSA GATHHLLSDE SAAELSSIPT EVINLGDAPP 
LAPAGALTQG EVRAILLSLL LAMFLAALDQ TIVATALPTI GRQFGDVENL SWVITAYLLS
STAVAPVFGS LADIYGRRVM IIVSLSLFVA GSVMCALAPS LLVLILGRAL QGLGGGGIMP
IVQTVISDVV SPRERGQYQA YFSGVWVSAG IGGPILGGFF AEHLHWSMIF WINLPLAIGA
LALLLPKMAK IPVYHRRRKV DWLGGVLLMA AAMAVMLVLT WGGNRFAWLS PTILALSGAA
VLLAASFIWH ALRAREPFLP LQLMSGTVVP WAMAGGAFTM GAMIALTVHM PLYYEAVYHL
TASQSGLALI PIAAISVFGA AFTGRAMVHV ERYKRIAILG TGFAALMALA IALLTPLPLW
LFLTLLSLCS LGLGTVFPVS VVSIQNAVPR PQIGTATGAM NFFRALMASF TVAAFTAILL
IALGGNVQLG GGEHRHLVGS VAAAEMVAGF RWVFGAAALM LAASALCIMV MEERKLAGPE
PTLALSE