Gene Rpal_3681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3681 
Symbol 
ID6411357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3931485 
End bp3932693 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content69% 
IMG OID642713561 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001992656 
Protein GI192292051 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCATC CCTACCGTTG GGTGATAGTT GCGGCTGGTG GGGTGCTCGG CTGCGTCGCG 
GCCGGCGCGA TGTTCTCGCT GCCGGTGTTG ATCAGGCCGA TGTCGCAGGA CACCGGCTGG
TCGGTCACCG GCATCTCGAC CGCGATGACG ATCGGCTTCC TGGCGATGGC CGCCGCCAGC
ATGGTGTGGG GCAATCTGTC GGACAGGTTC GGCCCGCGCC CCGTGGTGCT CACTGGATCG
GTGGTGCTGG CGGCCAGCCT CGCCCTCGCC AGCCGCGCAT CATCGTTGAT CGAATTCCAG
CTGCTGTTCG GCGCTTTGGT CGGCGCGGCC ACCGCCGCCG TGTTCGCCCC GATGATGGCC
TGCGTCACCG GCTGGTTCGA CACCGGGCGG GGTCTTGCGG TGTCGCTGGT CTCCGCCGGG
ATGGGCATGG CGCCGCTGAC GATGGCGCCG CTGGCGGCAT GGTTGGTGAC GATCCACGAC
TGGCGCGGCG CGATGCTGAT CATCGCGGCG ATCACGGCTG CGCTGATGAT TCCTGCGGCG
CTGCTGGTGC GGCGGCCGCC GGCGCTCGAA CCCGGCGGCG CGGAGGGCGC CTGGGCCGAT
GCGCAGGACG ACATGACGCT CAGGCAGGCG GTGCGGTCGC CGCAATTCGT CACGCTGTTG
CTGGCCAACT TCTTCTGCTG CGCGACCCAT TCCGGCCCGA TCTTCCACAC CGTCAGCTAC
GCGGTGACCT GCGGCATCCC GATGATCGCT GCCACCTCGA TCTATAGCGT CGAGGGACTA
TCCGGCATGT TCGGCCGGCT CGGCTTCGGC CTCGCCGGCG ATCGCTTCGG CGCGCAGCGC
GTGCTCGTGA TCGGCTTGCT CGCGCAGGCG TTCGGCGTGC TCGCTTACGC CTTCGTCGGC
GGACTCGGCG GCTTCTATGC CGTCGCTGTC GCGGTCGGCT TCATCTACGC GGGCACCATG
CCGCTCTACG CCGTGATCAT CCGCGAGAAC TTTCCGCTGC GGATGATGGG CACGATCGTC
GGCGGCACCG CGATGGCCGG CAGCCTCGGC ATGTCGACCG GCCCGGTGCT TGGCGGGCTG
ATCTACGATG CCTACGACAG CTACGCACCG ATGTATGTCG CCTCCTGCGG CATGGGCCTC
GCCGCGATGC TGATCCTGGC GACGTTCCGG CCGTTCCCGC AGCGGCGGGG CGAGTTAGCG
GTGGCGTAG
 
Protein sequence
MNHPYRWVIV AAGGVLGCVA AGAMFSLPVL IRPMSQDTGW SVTGISTAMT IGFLAMAAAS 
MVWGNLSDRF GPRPVVLTGS VVLAASLALA SRASSLIEFQ LLFGALVGAA TAAVFAPMMA
CVTGWFDTGR GLAVSLVSAG MGMAPLTMAP LAAWLVTIHD WRGAMLIIAA ITAALMIPAA
LLVRRPPALE PGGAEGAWAD AQDDMTLRQA VRSPQFVTLL LANFFCCATH SGPIFHTVSY
AVTCGIPMIA ATSIYSVEGL SGMFGRLGFG LAGDRFGAQR VLVIGLLAQA FGVLAYAFVG
GLGGFYAVAV AVGFIYAGTM PLYAVIIREN FPLRMMGTIV GGTAMAGSLG MSTGPVLGGL
IYDAYDSYAP MYVASCGMGL AAMLILATFR PFPQRRGELA VA