Gene RPD_3004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3004 
Symbol 
ID4023507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3348891 
End bp3350156 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content67% 
IMG OID637963203 
Productmajor facilitator transporter 
Protein accessionYP_570131 
Protein GI91977472 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.42057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG AGGAGCAACC TCCCGGGACG ATCGCGCCGC TCCCGGGCGC GCGCCCCGCC 
GCGGTCGGCT TCATCTTCGT CACCATCCTG CTCGATATGC TGAGCGTCGG CATGATCCTG
CCGATCCTGC CGAAGCTGAT CGAGAGTTTT TCCGACAACA ACACCGCGGA CGCGGCGCGA
ATCTACGGCG TGTTCGGCAC AGCGTGGGCG CTGATGCAGT TCGTCGCGTC GCCGGTGCTG
GGCGGGCTGT CCGATCGTTT CGGCCGCCGC CCGGTGATCC TGCTGTCCAA TCTCGGTCTC
GGTCTCGACT ACATCCTGAT GGCGCTGGCG CCGACGCTGA GCTGGCTGTT CATCGGCCGG
GTGATCTCCG GCATCACGTC GGCGAGTATT TCGACCTCGT TCGCCTATAT CGCCGACGTC
ACGCCGGCGG AGAAGCGCGC GGCCGTGTTC GGCAAGGTCG GCGCCGCGTT CGGTCTCGGC
TTCATCTTCG GCCCGGCGAT CGGCGGTTTG CTCGGTGGTA TCGATCCGCG ACTGCCGTTC
TGGGTGGCGG CTGGGCTCAG CCTGTGCAAC GCGCTGTACG GTCTGTTCGT GCTGCCGGAA
TCGCTGCCGC CGGAGCGGCG CTCGCCGTTT CGCTGGAGGT CCGCCAATCC GGTCGGCGCT
GTGCGGCTGC TGGGCTCGAA TGCCCGGCTG GCGGCGATGG CTCTGGTCGA GTTCTGCGCC
GAGGTGGCGC ATGTCGCGCT GCCGGCGATC TTCGTGTTGT ACAGCACCTA CCGTTACGGC
TGGGACCAGA CCACGGTCGG GCTCGCGCTC GCTTTCGTCG GGGTCTGCAC CGCGATCGTG
CAGGGCGGCT TGGTGGGGCC TGCCGTGAAG CGACTCGGCG AACAAAGGGC CCAGATCATC
GGCTATGGCG GCGGCGCGCT AGGCTTTCTG ATCTACGCGC TGGCGCCGAC CGGAGCGCTG
TTCTGGATCG GCATCCCGGT GATGACGCTG TGGGGCATCG CAGGGCCGGC GACCTCCGGC
ATGATGACGC GGCTGGTGTC GCCGGACCAG CAGGGCCAGT TGCAGGGCGC CATCACCAGC
CTCAAGAGCA TCGCCGAACT GATCGGGCCG TTCCTGTTCA CGCTGATCTT CGCGTATTTC
ATTGGAGGCA ACGCGCCGCT GGCTCTTCCC GGGGCGCCGT TCCTGCTCGC AGGCCTGCTG
CTGATGGTCT CGGCGCTGAT CGCCGCGTCC ACCAATGAAG CGACCAAACA GGCCGGCACC
GGCTAG
 
Protein sequence
MTDEEQPPGT IAPLPGARPA AVGFIFVTIL LDMLSVGMIL PILPKLIESF SDNNTADAAR 
IYGVFGTAWA LMQFVASPVL GGLSDRFGRR PVILLSNLGL GLDYILMALA PTLSWLFIGR
VISGITSASI STSFAYIADV TPAEKRAAVF GKVGAAFGLG FIFGPAIGGL LGGIDPRLPF
WVAAGLSLCN ALYGLFVLPE SLPPERRSPF RWRSANPVGA VRLLGSNARL AAMALVEFCA
EVAHVALPAI FVLYSTYRYG WDQTTVGLAL AFVGVCTAIV QGGLVGPAVK RLGEQRAQII
GYGGGALGFL IYALAPTGAL FWIGIPVMTL WGIAGPATSG MMTRLVSPDQ QGQLQGAITS
LKSIAELIGP FLFTLIFAYF IGGNAPLALP GAPFLLAGLL LMVSALIAAS TNEATKQAGT
G