Gene Rpal_4797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4797 
Symbol 
ID6412483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5163959 
End bp5165224 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content65% 
IMG OID642714675 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001993762 
Protein GI192293157 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.292232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACAAGC CTGTCACCGT CGATGCGGAT ATCGCTGCTC CGGTCAATCC GGCCGAGCTG 
CCGCTACCCG AACTTACCAA GCCGGCGGCG CCTGCCGCGG CCGGGCCGGC CTACATCGTG
CTCGGTGGCA TCAGCTTCTC GCATTTCCTC AACGATACCA TGCAGTCGCT GATCCCTTCG
GTGTATCCGA TCCTGAAGGC GAACTACGCG CTCGATTTCG GCCAGATCGG CATGATCACG
CTGGCGTTCC AGTTCACCGC GTCACTGCTG CAGCCGGTGG TCGGGCACAT CACCGACAAG
AAGGCGCAGC CGTTCTCGCT GGCGATAGGC ATGGGCTCGA CCTTCCTGGG GTTGCTGCTG
CTCAGCGTCG CGCATTCCTA TGCGGTGATC CTGATCGCCG CGGCGATGAT CGGCCTCGGT
TCGGCGGTGT TTCATCCTGA GTCGGCGCGG ATCGCGCGGC TTGCCTCGGG CGGTCGCCAC
GGCATGGCAC AATCGGTGTT TCAGGTCGGC GGCAATGCCG GCACCGCGCT CGGTCCGGTG
CTGGCGGCGC TGATCGTGGT GCCGTTCGGC CAGCCGTCGA TCGCCTGGTT CTCGTCGATC
GCGTTCCTCG CCATTATCGT GCTGTGGCGG ATCGGCGTCT GGTACAAGCC GCAGGTCGCC
GGCAAGAAGA AGATGGCGGT GCAGCCGCAT CCGCACGCGC CGAGCCGGCG CCGGGTGATG
GTCGCGCTGG TGGTGCTGGT GGCGCTGCTG TTCTCCAAGC AGCTCTATCT GTCGAGCCTG
TCGAGCTACT ACACGTTCTA TCTGATCGAG AAGTTCCACG TCTCGACCCA GACCGCGCAG
ATGTTCCTGT TCATCTTCCT GGCTGCAACC GCAGCCGGCG TGTTCTTCGG TGGCCCGCTC
GGCGACCGGT TCGGCCGCCG CTATGTGATC TGGTTCTCGA TCCTGGGTAT CCTGCCGTTC
ACCCTGGCGC TGCCCTATGC GGGGCTCACC GCCAGCGCGG TGCTCACGGT GTTCATCGGC
TTCATCCTGG CGTCGGCGAC GCCGGCGATC ATCGTGTTCG CCCAGGAACT GATGCCGCAT
CGCTTCGGCA TGATCTCCGG CGTGTTCTTC GGCTTCGCGT TCGGCATCGG CGGTCTCGGC
GCGGCGGCGC TTGGCCAAGT CGCTGACATC CGTGGCATCG ACTTCGTCTA TCAGGTCTGT
TCGTTCCTGC CGGCGCTCGG CCTGCTGGCG GTGCTGCTGC CGAAGATGCC CCGGCACGCC
CACTAG
 
Protein sequence
MNKPVTVDAD IAAPVNPAEL PLPELTKPAA PAAAGPAYIV LGGISFSHFL NDTMQSLIPS 
VYPILKANYA LDFGQIGMIT LAFQFTASLL QPVVGHITDK KAQPFSLAIG MGSTFLGLLL
LSVAHSYAVI LIAAAMIGLG SAVFHPESAR IARLASGGRH GMAQSVFQVG GNAGTALGPV
LAALIVVPFG QPSIAWFSSI AFLAIIVLWR IGVWYKPQVA GKKKMAVQPH PHAPSRRRVM
VALVVLVALL FSKQLYLSSL SSYYTFYLIE KFHVSTQTAQ MFLFIFLAAT AAGVFFGGPL
GDRFGRRYVI WFSILGILPF TLALPYAGLT ASAVLTVFIG FILASATPAI IVFAQELMPH
RFGMISGVFF GFAFGIGGLG AAALGQVADI RGIDFVYQVC SFLPALGLLA VLLPKMPRHA
H