Gene Rpal_2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2154 
Symbol 
ID6409814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2332511 
End bp2333803 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content64% 
IMG OID642712038 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001991150 
Protein GI192290545 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACA CGGTGCACGG GGTGGCCGCC GCGCAGACGC GGCGCGTGAG TGAGAGCTAT 
CGCTGGACGC AACTCGCCAT CGGCGTCGGC GCGATGGTGA TGATCGCCAA CTACCAATAC
GGCTGGACGT TCTTCGTCCC CGACATTCAG AAAACCTTCG GCTGGGACCG CGCCTCGATC
CAATGGGCAT TCACCCTCTT CGTATTGTTC GAGACCTGGC TGGTGCCGAT CGAGGGCTGG
TTCGTGGACA AATACGGCCC GCGTCTGGTG GTGCTGATCG GCGGCATCCT GTGTGCGCTC
GGCTGGGCGA TCAACGCGCA GGCGACGACG CTGTCGGGTT TCTACCTCGG CATGATCGTC
GCGGGCATCG GCGCTGGTGC GGTCTATGGC ACCTGCGTGG GCAACGCGCT GAAATGGTTT
CCGGACAAGC GCGGATTAGC GGCCGGATTG ACGGCGGCGG GCTTCGGGGC CGGCTCGGCG
CTCACGGTGG CGCCGATCCA GGCGATGATC CGCGACTCCG GCTTTCAGAC CACCTTCATG
TATTTCGGTC TCGGCCAGGG CATCGTGATC GTGTTTCTGT CGCTGCTGCT GCTCGCGCCG
AAGCCGGGAC AGGTGCCGTC CCCGACCCGC AACGCCAACG TCTTCCAGAC GCGTCGCGAC
TACCGTCCGA CCGAAGTGCT GCGCCAGCCG GTGTTCTGGC TGATGTACTT CATGTTCGTG
ATCGTCGGTG CCGGCGGGTT GATGGTCACC GCCAATCTGA AGCCGATCGC CGCCGACTGG
AAGATCGCCG ACACGCCCGT CACGCTGATG GCGATGACCA TGACGGCAGT GACCTTCGCA
GCCACCTTCG ATCGCATCCT CAACGGCCTG ACGCGGCCGT TCTTCGGCTG GATCTCGGAC
AAGATCGGCC GCGAGAATAC GATGTTCATC GCGTTCGGGC TCGAAGGCAT CGGCATCTAC
GCGCTGTATG CGCTGGGTCA GGATCCGGTG TGGTTCGTGC TGCTGTCGGG GCTGGTGTTC
TTCGCCTGGG GCGAGATCTA CTCGCTGTTT CCCTCAACCT GCACCGACAC CTTCGGGTCG
AAGTTCGCCG CCACCAATGC GGGCCTGCTG TACACCGCGA AGGGCACCGC GGCGCTGCTG
GTGCCATTCG CCAATTCGCT GCAGCAATCC AGCGGCAGTT GGGATCTGGT GTTCCTGATC
GCTGCCGCCG CCAACATCCT GGCGTCGCTG TTGGCGCTGC TGGTGCTCAA GCCATGGCGG
CGCAGCGTCG TCGCCAAAAG CGAAATGGTC TGA
 
Protein sequence
MTDTVHGVAA AQTRRVSESY RWTQLAIGVG AMVMIANYQY GWTFFVPDIQ KTFGWDRASI 
QWAFTLFVLF ETWLVPIEGW FVDKYGPRLV VLIGGILCAL GWAINAQATT LSGFYLGMIV
AGIGAGAVYG TCVGNALKWF PDKRGLAAGL TAAGFGAGSA LTVAPIQAMI RDSGFQTTFM
YFGLGQGIVI VFLSLLLLAP KPGQVPSPTR NANVFQTRRD YRPTEVLRQP VFWLMYFMFV
IVGAGGLMVT ANLKPIAADW KIADTPVTLM AMTMTAVTFA ATFDRILNGL TRPFFGWISD
KIGRENTMFI AFGLEGIGIY ALYALGQDPV WFVLLSGLVF FAWGEIYSLF PSTCTDTFGS
KFAATNAGLL YTAKGTAALL VPFANSLQQS SGSWDLVFLI AAAANILASL LALLVLKPWR
RSVVAKSEMV