Gene Rpal_2737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2737 
Symbol 
ID6410401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2976983 
End bp2978200 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content67% 
IMG OID642712613 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001991721 
Protein GI192291116 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.710897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATCACC GTGCCTCGCG CACCGGCCTG TTCATTCTCG GCCTGTGCTT CGTGCTGTCG 
CTGCTCGGCC GCGGGCTCGG CGAGAGCTTC ACCGTGTTCC TGCTGCCGAT CTCGGCCTCG
TTCGGCTGGG ACCGCGCCGA GGTGGTGTCG ATCTATTCGC TGACGGCGCT GTCCACCGGC
CTCGCCTCAC CTCTGGTCGG CCGGCTGTTC GATCGTTCCG GCCCGCGCGT GGTCTACACC
CTCGGCATGA CGCTGCTGGG CGGCGCGCTG CTGATCGCCG CTTATGCCGA GCGACTGTGG
CAATTGCAGC TCACCGTCGG GCTGTGCGTC GGCCTCGGCA TCGCCTTCAC CGGTAACGTT
CCGAACTCGA TCCTGCTCGG CCGCTGGTTC GGCGCGCGGC TGCCGACCGC GATGGCTGTG
GTATATTCGG CGATCGGCGC CGGCGTGCTG GTGATGCTGC CGATCGCACA GCTGCTGATC
GACCGGCACG GCTGGCGCGA GGCGTATCTG ATCCTGGGCG GCGCGATGCT GATCCTGCTG
GTCCCGCTAT CGCTGATGCC GTGGCGCCGG TTCGCCGCCG GGGCGGACGC ACACGCACAG
CGGCCCGCAC AGGATGCTGA CGATGACGGC TGGACGCTGG GAAGCGCGAT GCGCCATCAC
GCGTTCTGGG CGCTGTTCGC GACCTTCTTC TTCACCGCAA TCGGGATGTA CTCGATCTCG
GCGCAGGTGG TCGCCTATCT GGTCGACGCA GGTTTCACGC CCCTACAGGC AGCGACCGCC
TGGGGCTTCT CCGGCGTCGT GCTGGTAGTC GGCATGCTCG GCGTGAGCTG GCTGGATGGC
GTGATCGGAC GGCGGCCGTC GATCCTGTTC TCTTACGCGA TTTCGATCAC CGGCCTCGTG
CTGCTGTGGC TGCTGCAGTG GTATCCGAAC CTGTTGCTGC TCGGCGGCTT CGTGATCTGC
TTCGGCTCGA TGATCGGCTC GCGCGGCCCG CTGATCACCG CCACCGCGAT GAGCATCTTC
CGCGGCAAGC GCGTCGGCAC GATCTACGGC ACGATCTCGA TCGGCAGCGG CCTCGGCTCG
GCGTTCGGCT CGTGGTGCGG CGGCCTGCTG CACGACCTCA CCCAGAGCTA CAATCCGGTG
CTCGGCTTCG CGCTGGTCAG CGTGGTGCTC GGCATGATCC CGTTCCTGGT GGTCCCGGCA
CTGCGGGAGC GACAATAA
 
Protein sequence
MDHRASRTGL FILGLCFVLS LLGRGLGESF TVFLLPISAS FGWDRAEVVS IYSLTALSTG 
LASPLVGRLF DRSGPRVVYT LGMTLLGGAL LIAAYAERLW QLQLTVGLCV GLGIAFTGNV
PNSILLGRWF GARLPTAMAV VYSAIGAGVL VMLPIAQLLI DRHGWREAYL ILGGAMLILL
VPLSLMPWRR FAAGADAHAQ RPAQDADDDG WTLGSAMRHH AFWALFATFF FTAIGMYSIS
AQVVAYLVDA GFTPLQAATA WGFSGVVLVV GMLGVSWLDG VIGRRPSILF SYAISITGLV
LLWLLQWYPN LLLLGGFVIC FGSMIGSRGP LITATAMSIF RGKRVGTIYG TISIGSGLGS
AFGSWCGGLL HDLTQSYNPV LGFALVSVVL GMIPFLVVPA LRERQ