Gene Rpal_0856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0856 
Symbol 
ID6408510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp907723 
End bp909138 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content69% 
IMG OID642710770 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001989889 
Protein GI192289284 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCAGA CCGTTGCCGC AACGATCGAT AACTCCCGCC GCTGGCGGGT GCTGGCCATC 
GTGGTCGCCG CGCAGTTTAT GTTCGGGGTC GATGCCTTCA TCGTCAACGT CGCGCTGCCG
ACGATCTCGA GCGAACTCGG CGCGTCATCG TCGCAGCTCG AGGCGGTGAT CGCGATCTAC
CTGATCGGCT ACGCAACGCT GATCGTCGCC GGCGGCCGGC TCGGCGACAT CTTCGGCACC
AAGACGGTGT TCCTGCTCGG CGTCGCCGGC TTCACGCTGA CCTCGCTGTG GTGCGGGCTG
GCGCGCTCCG GTCCCGAACT GATCCTGGCG CGGCTCGCCC AGGGCACTAC GGCGGCGTTC
ATGGTGCCGC AAGTGCTGGC GACGCTGCAC GTGCTGTTTC CGGACGCTGC GCGCGCCAAG
GCGTTTGCGA TCTACGGCAC CGTACTCGGG CTCGCTGGCG CCACCGGCTT CGCGCTCGGC
GGTCTGTTGG TGACGCTCGA TCTCGGCGGC TTCGGCTGGC GCTCGATCTT CTACGTCAAT
GGTCCGGTCG GGCTGATCAT CATCGCGGCC GCCGCCCGGG TGATGCCGCA GACCCCGCGA
CGGCCGGGCA CGCGGCTCGA TCTCGGCGGC GCCGTGATCC TGTTCGCCGG CCTCGTCTGC
GTGATCGGTC CGTTGCTGTT CGGCCGCGAT TTCGGCTGGG CCGGATGGGT GTGGGCCGTG
ATGGCCGGCG GCGGCGCGAT GCTGGCGCTG TTCCTGCGCT ACGAGCGCCG CGTCGCTGCG
CGCGGCGGCA TGCCGGTGGT TGACCTGACG CTGCTCGGCG ATCGCGCTTT CGTCCGCGGT
CTCGGCGCGG TGTTCTGCTT CTTCTTCGCC AACCAGTCGT TCTATCTGGT GATGACGCTG
TACATGCAGT TCGAGCTGAA CATCCCGCCG CTGCAGGCCG GCCTGGTGTT CCTGCCGCTG
GCGCTGGCCT TCGTGATCGC GTCGCGGCAT TCCGGCGCGC GCGCCCGGCG CCGCGGCACG
CTGGTGCTGA TCGAAGGCTG CCTGCTGCAG ATCGCCGGCC TCGGCTTGAT CGCCGCCACG
GTCACGGTTA TCGCATCACC GACGCCATTC GTGTTGGCGC TGGCGCTGTT AGTCGCCGGC
TACGGCCAGG GACTGGTGAT GGCCCCGCTG TCGGGCGTGG TACTGTCGAG CGTGCAGGCG
ACCAGCGCGG GCTCGGGCTC CGGCCTCTAC GGCACTACCA CGCAGATCGC CAGCGCGGTC
GGCGTCGCGG CGCTCGGTTC GGTGTACTTC ACGCTGGCGC AAAACGGCTC TGGCCGTGTT
GCCCTGCTCG GCGCGCTGGC GCTGCTGGGG CTCGCGATCG CCGGCTGCAT CGGGCTGCTG
CGCTGGATGC GCCGCGCCGT GGCGGTGGCA GCTTAA
 
Protein sequence
MHQTVAATID NSRRWRVLAI VVAAQFMFGV DAFIVNVALP TISSELGASS SQLEAVIAIY 
LIGYATLIVA GGRLGDIFGT KTVFLLGVAG FTLTSLWCGL ARSGPELILA RLAQGTTAAF
MVPQVLATLH VLFPDAARAK AFAIYGTVLG LAGATGFALG GLLVTLDLGG FGWRSIFYVN
GPVGLIIIAA AARVMPQTPR RPGTRLDLGG AVILFAGLVC VIGPLLFGRD FGWAGWVWAV
MAGGGAMLAL FLRYERRVAA RGGMPVVDLT LLGDRAFVRG LGAVFCFFFA NQSFYLVMTL
YMQFELNIPP LQAGLVFLPL ALAFVIASRH SGARARRRGT LVLIEGCLLQ IAGLGLIAAT
VTVIASPTPF VLALALLVAG YGQGLVMAPL SGVVLSSVQA TSAGSGSGLY GTTTQIASAV
GVAALGSVYF TLAQNGSGRV ALLGALALLG LAIAGCIGLL RWMRRAVAVA A