Gene Rpal_3504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3504 
Symbol 
ID6411178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3752286 
End bp3753572 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content67% 
IMG OID642713383 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001992480 
Protein GI192291875 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0374275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGG AAGCCGCAGC GCAGCAGACG CCGGTGCCCG GCATGACCGG GCCGCGCCGC 
GCCGCCGTCG GCTTCATCTT CATCACCATC GCACTCGATA TGCTCAGCCT CGGCATGATC
CTGCCGATCC TGCCGAAGCT GATCGAGAGT TTCTCCGACG ACAACACCGC TAACGCGGCG
CGGATCTACG GGCTGTTCGG CACCGCCTGG GCTCTGATGC AGCTGTTCGC CTCGCCGATC
CTCGGCGGGC TGTCGGACCG GTTCGGGCGG CGCCCGGTGA TCCTGCTGTC CAATCTGGGG
CTCGGCCTTG ACTACATCCT GATGGCGCTG GCGCCGTCGC TGTGGTGGCT GTTTGTCGGC
CGGGTGATCT CGGGCATCAC TTCGGCCAGC ATCTCGACCT CGTTCGCCTA TATCGCCGAC
GTCACACCGG CGGAAAAACG TGCGGCGGTG TTCGGCATGG TCGGCGCCGC GTTCGGACTC
GGCTTCACCT TTGGGCCGGC GATCGGCGGT CTGCTCGGCG GTGTGGATCC GCGGTTGCCG
TTCTGGGTGG CTGCGGCGCT GAGCCTGGCG AATACGTTGT ACGGCCTGTT CGTATTGCCG
GAGTCGCTAC CGCGAGACCG ACGCTCGCCG TTCCGCTGGA AGTCCGCCAA TCCGATCGGG
GCGGTGCGGC TGCTGAGTTC GAACGCGCGA TTGGCTGCGC TTGCGGTGGT CGAGTTCTGT
GCCGAGGTGG CGCATGTCGC GCTGCCGGCA ACCTTCGTGC TCTACACCGG CTATCGCTAT
GCCTGGGATC AGACCACGAT CGGTCTGGCG CTGGCGTTCG TCGGCGTCTG CACCACCATC
GTGCAGGGCG GGCTGGTCGG CCCCGCGGTG AAGCTGCTCG GCGAGCGCAA CGCCCAGATC
ATCGGCTATG GCGGCGGTGC GCTCGGCTTC CTGATCTATG CGCTGGCGCC GAGCGGGACG
CTGTTCTGGA TCGGCATTCC GGTGATGACG CTGTGGGGCA TCGCCGGGCC GGCAACCTCG
GGGATGATGA CCCGGCTGGT CTCGCCGTCG CAGCAGGGCC AGTTGCAGGG CGCCACCACC
AGCGTCAAGA GCGTCGCCGA ACTGATTGGC CCGTTCCTGT TCACGATGAT CTTCGCGTAC
TTCATCGATG CCGGCGCACC GCTGCAGCTG CCCGGGGCGC CGTTCCTGCT CGCCGGTGCG
CTTCTGGTCG TGTCCGTCGT GATCGTGGCC TTCGCATCGC CCGCCGCCGA CAGCAAACAG
GCGCCGCAAG CGGCGCCTGA AAGCTGA
 
Protein sequence
MTEEAAAQQT PVPGMTGPRR AAVGFIFITI ALDMLSLGMI LPILPKLIES FSDDNTANAA 
RIYGLFGTAW ALMQLFASPI LGGLSDRFGR RPVILLSNLG LGLDYILMAL APSLWWLFVG
RVISGITSAS ISTSFAYIAD VTPAEKRAAV FGMVGAAFGL GFTFGPAIGG LLGGVDPRLP
FWVAAALSLA NTLYGLFVLP ESLPRDRRSP FRWKSANPIG AVRLLSSNAR LAALAVVEFC
AEVAHVALPA TFVLYTGYRY AWDQTTIGLA LAFVGVCTTI VQGGLVGPAV KLLGERNAQI
IGYGGGALGF LIYALAPSGT LFWIGIPVMT LWGIAGPATS GMMTRLVSPS QQGQLQGATT
SVKSVAELIG PFLFTMIFAY FIDAGAPLQL PGAPFLLAGA LLVVSVVIVA FASPAADSKQ
APQAAPES