Gene RPD_1026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1026 
Symbol 
ID4021501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1167625 
End bp1169109 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content65% 
IMG OID637961217 
Productmajor facilitator transporter 
Protein accessionYP_568165 
Protein GI91975506 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.76492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGTC CGAACAGCGC CGTGATGAGA GAGCCGCCGC CGCAGGCCGT CGGTGGCCGT 
GTCATCGAAA CCGATATTCC GGCGCGATTG GACGGCCTGC TGTGGAGCGG CTTCCACACC
CGCGTGGTGT TCGCGCTCGG CGTCACGTGG ATTCTCGACG GCCTCGAAGT CACGCTGGCA
GGTTCGCTGT CGGGCGCGCT GAAAGCCAGC CCGCAGCTTC AGTTCTCCAA TCTCGACATC
GGCTTCGCCA CCAGCGCCTA TCTGGCGGGC GCTGTGCTGG GCGCGATCGG GTTCGGCTGG
CTGACCGACC GGATCGGCCG CAAGAAATTG TTCTTCATCA CGCTCGCGCT GTATCTCACC
GCCACCGCGG CGACGGCGCT GTCGTGGGAT CTCTGGAGCT ACGCGCTGTT TCGTTTTCTC
ACCGGGGCGG GAATCGGTGG CGAATACACG GCGATCAACT CGACGATCCA GGAGCTGATG
CCGGCGCGCT ATCGCGGCTG GACCGATCTG GTGATCAACG GCAGCTTCTG GATCGGTGCG
GCGATCGGTG CAATCAGCGC CATCGTGCTG CTCGATCCGG CTGTGATCGA TCCCGAACGC
GGCTGGCGTC TGGCGTATCT GATCGGAGCG GCGCTCGGAC TGATCGTATT CGCGATGCGG
TTCTGGATTC CCGAAAGTCC GCGCTGGCTG ATGATCCATG GCCGTCCGGA GGAAGCCGAA
GCGATCGTCG CCGACATCGA GAAGACGGCG CGCGCAGCGC CGGAGGCCGA GCATCGCAAC
CCGTCGAAGA TCAGGTTGCA GATGCGCAGC CACACGCCGC TGCGTGAGGT CGCCCATACG
CTGTTCACGA CATACCGGCA GCGCTCGATC GTCGGGCTGA CGCTGATGGC GGCGCAGGCG
TTCTTCTACA ACGCGATCTT CTTCACCTAC GCGTTGGTGC TGACCGATTT CTTCGGCATC
CCGTCCGGCG ACGTCGGCTG GTACATCCTG CCGTTCGCGG CCGGAAACTT CCTCGGACCG
CTGCTGCTCG GCCGGCTGTT CGACACGCTC GGACGCCGCA AGATGATCGC CTTCACCTAC
GGCGCTTCTG GAATCCTGCT CGCCGTGTCC GGTTATCTGT TCTCGATCGG CGCCCTGAGC
GCGCAGGGAC AGACGATCGC CTGGATGGTG ATCTTCTTCT TCGCGTCGCC GGCGGCGAGT
GCGGCCTATC TCACCGTCAG CGAGACCTTC CCGCTGGAGG TCCGGGCGCT GGCGATCGCA
TTGTTCTACG CATTCGGCAC CGGAATCGGC GGCGTCGCCG GCCCGGCGCT GTTCGGGGCG
CTGATCGACA CCGGTTCGCG CACGAGCGTG TTTGCCGGCT ATCTGCTCGG CGCGAGTCTG
ATGATGATCG CCGCTGTGGT CGGTTGGCGT TATGGTATTG CGGCTGAACG CCGGTCGCTT
GAACACATTG CGCGGCCGCT GGCCGCCGTA GAGGAAAGCC GATGA
 
Protein sequence
MASPNSAVMR EPPPQAVGGR VIETDIPARL DGLLWSGFHT RVVFALGVTW ILDGLEVTLA 
GSLSGALKAS PQLQFSNLDI GFATSAYLAG AVLGAIGFGW LTDRIGRKKL FFITLALYLT
ATAATALSWD LWSYALFRFL TGAGIGGEYT AINSTIQELM PARYRGWTDL VINGSFWIGA
AIGAISAIVL LDPAVIDPER GWRLAYLIGA ALGLIVFAMR FWIPESPRWL MIHGRPEEAE
AIVADIEKTA RAAPEAEHRN PSKIRLQMRS HTPLREVAHT LFTTYRQRSI VGLTLMAAQA
FFYNAIFFTY ALVLTDFFGI PSGDVGWYIL PFAAGNFLGP LLLGRLFDTL GRRKMIAFTY
GASGILLAVS GYLFSIGALS AQGQTIAWMV IFFFASPAAS AAYLTVSETF PLEVRALAIA
LFYAFGTGIG GVAGPALFGA LIDTGSRTSV FAGYLLGASL MMIAAVVGWR YGIAAERRSL
EHIARPLAAV EESR