Gene Rpal_5288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5288 
Symbol 
ID6412989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5704721 
End bp5705968 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content68% 
IMG OID642715178 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001994250 
Protein GI192293645 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.120613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGATG CGACGGCCAA CGACATCGTC GAAGACGATG CCCGTGCCCG TTCCAACGTG 
GTGCGGCTGG TCGCGGCGCA GGCGCTGACC GGCGCAAACG CCGCAGTGAT TTTCGCGACC
GGTTCGATCA TCGGTGCGCA GCTCGCGCCG GAGATGTCGC TCGCGACTGT GCCGCTGTCG
ATGTACGTGC TCGGCCTTGC TGCCGGCACG CTGCCGACTG GGTGGATCGC ACGGGCTTAC
GGCCGCCGCG TCTCGTTCAT GATCGGCACC GGCTGCGGCA CGCTCACCGG CGTGCTCGGC
GCCGTCGCAA TCCTGTACGG CTCGTTCCTG CTGTTCTGCG TCGCGACGTT CATCGGCGGG
CTTTACGCCG CCGTGTCGCA GTCCTACCGG TTCGCCGCCG CCGACGGCGC CAGCGCCTCC
TATCGGCCGA AGGCGGTGTC GTGGGTGATG GCAGGCGGCG TGTTCGCCGG CGTGCTCGGT
CCGCAGCTCG TGCAGTGGAC CATGGACGTC TGGCAGCCTT ATCTGTTTGC GTTCTCCTAT
GTGGTGCAGG CGGCGATTGC GCTGATCGCC ATGGCGGTGC TGTGGGGCGT CGATGCACCG
AAGCCGAAGC CGGCGGAGCG GGCTGGCGGT CGGCCGCTGC TGGAGATCGC GCGACAGCCG
CGCTTCATCG CCGCGGCGCT GTGCGGCGCG ATCGCCTATC CGATGATGAA TTTGGTGATG
ACCTCGGCGC CGCTGGCGAT GCAGATGTGC GGGCTGAGCG TCGGCGATTC CAATTTCGGC
CTGCAATGGC ACATCGTGGC GATGTACGCG CCGAGTTTCG TCACCGGCTC GCTGATCGCC
AAGTTCGGCG CGCCGCGCGT GGTCGCGGCC GGCCTGGTCC TGGAAGCGCT CGGCGCGTCG
ATCGGCCTGC TCGGCGTCAC CGCCCCGCAC TTCTGGGCGA CGCTGTTCGT GATCGGCGTC
GGATGGAATC TGGCGTTCGT CGGCGCCTCG GCGCTGGTGC TGGAAACGCA CCAGCCGAAC
GAGAAGAACA AGGTCCAGGC GTTCAACGAC TTCATCATCT TCGGCCTGAT GGCGCTGGGG
TCGTTCTCGT CGGGCCAGCT GCTGGCGAAC TACGGCTGGA CCACCGTGAA CCTCGCGGTG
TTCCCGCCAG TGCTGCTCGG CCTGATCGTG CTGGCGATCA CCGGCTGGTC GAAAGTCCGG
AAACGGGTGG CCGAGGCCGC CACCGAACTA TCCGATCGCG GCGTCTGA
 
Protein sequence
MVDATANDIV EDDARARSNV VRLVAAQALT GANAAVIFAT GSIIGAQLAP EMSLATVPLS 
MYVLGLAAGT LPTGWIARAY GRRVSFMIGT GCGTLTGVLG AVAILYGSFL LFCVATFIGG
LYAAVSQSYR FAAADGASAS YRPKAVSWVM AGGVFAGVLG PQLVQWTMDV WQPYLFAFSY
VVQAAIALIA MAVLWGVDAP KPKPAERAGG RPLLEIARQP RFIAAALCGA IAYPMMNLVM
TSAPLAMQMC GLSVGDSNFG LQWHIVAMYA PSFVTGSLIA KFGAPRVVAA GLVLEALGAS
IGLLGVTAPH FWATLFVIGV GWNLAFVGAS ALVLETHQPN EKNKVQAFND FIIFGLMALG
SFSSGQLLAN YGWTTVNLAV FPPVLLGLIV LAITGWSKVR KRVAEAATEL SDRGV