Gene Rpal_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1889 
Symbol 
ID6409549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2037168 
End bp2038403 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content65% 
IMG OID642711778 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001990890 
Protein GI192290285 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCAG CTCAGCTCAG CGCGTTTCAG CGCTGGTCGA TCCTGATCGG CGCGTCGGTG 
CTGCTCAGCC TCGCCATGGG CATGCGGCAA AGTTTCGGGC TGTTTCAGCC CTCGGTGATC
CGCGATGTCG GTATCACCAG CGCTGACTTC TCCTTTGCCA CAGCGCTGCA GAACATCATC
TGGGGTGTCA CGCAGCCGAT GGTGGGACTG ATCGCCGACC GCTACGGCAC CCGCTGGGTG
ATGGCGGGCG GTGTGGTCGT CTATGCGGCC GGTCTGGTTC TCATGATGGT TGCGGACTCG
GCGCTGATGT TCACGTTGGG CTGTGGTGTC TGTGTCGGCA TTGCGCTGTC CTGCACCGCC
TCCAGCATGA CCATGACGGT GACCTCGCGC ACGGTATCGC CGGCCAAGCG CAGCGTCGCG
ATGGGGGCGG TCTCGGCGGC CGGGTCGCTC GGCCTGGTGC TGGCCTCGCC GCTGGCGCAG
ACGCTGATCT CGACCGCCGG CTGGCAGATG GCGCTGATCG GCTTTCTCGG CCTTGCCGCG
GCGATGTTGC CGTCGGCGCT GTTCGCCGGT CGTGCCGACA AGCTCGACAT CGACAAGTCG
GACGACGTGC AGCAGTCGGC CGGCGAGGTG GTACAGACCG CGCTCGGCCA TTCCGGCTTC
CTGGTGATGG CGATCGCGTT CTTCGTCTGC GGTCTGCAGC TCGTGTTCAT CACCACGCAT
CTGCCGAACT ATCTGGCGAT CTGCGGCCTC GATCCGTCGC TCGGCGCGTC CGCTCTGGCG
GTGATCGGGC TGTTCAACGT GTTCGGCTCG TATGCGTTCG GCTGGCTCGG CGGTAAGTTT
CCGAAGCAGT ATTTGCTCGG CGGCATCTAC ATCGTGCGGT CGTTGACGGT CGCGGCGTAT
TTCTATTTCC CGGCGTCAGC GACTTCGACG ATCGTGTTCG CCGCGATCAT GGGATCGTTG
TGGCTCGGGG TGATTCCGCT GGTGAACGGC CTGGTCGCCC AGTTGTTCGG CCTGCGCTAC
ATGGCGACGC TAACCGGCAT CGCCTTCCTC AGCCATCAGG TCGGCTCGTT CCTCGGGGCC
TGGGGCGGCG GCGTGATCTA CGACCATCTC GGCAGCTATG ATCGCGCTTG GCAGGCTGCG
GTGCTGATCG GCCTGATCGC CGGTTGTGCC CAGATGCTGA TGAACGTTCG GCCGCCACGC
CGCCGGGACG AATTGGCTGT GCCTGCCACC GCCTGA
 
Protein sequence
MKAAQLSAFQ RWSILIGASV LLSLAMGMRQ SFGLFQPSVI RDVGITSADF SFATALQNII 
WGVTQPMVGL IADRYGTRWV MAGGVVVYAA GLVLMMVADS ALMFTLGCGV CVGIALSCTA
SSMTMTVTSR TVSPAKRSVA MGAVSAAGSL GLVLASPLAQ TLISTAGWQM ALIGFLGLAA
AMLPSALFAG RADKLDIDKS DDVQQSAGEV VQTALGHSGF LVMAIAFFVC GLQLVFITTH
LPNYLAICGL DPSLGASALA VIGLFNVFGS YAFGWLGGKF PKQYLLGGIY IVRSLTVAAY
FYFPASATST IVFAAIMGSL WLGVIPLVNG LVAQLFGLRY MATLTGIAFL SHQVGSFLGA
WGGGVIYDHL GSYDRAWQAA VLIGLIAGCA QMLMNVRPPR RRDELAVPAT A