Gene RPD_4387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4387 
Symbol 
ID4024912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4853425 
End bp4854645 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID637964597 
Productmajor facilitator transporter 
Protein accessionYP_571505 
Protein GI91978846 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.310839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGATG CGACCGCGAT CGACGGTTCA GCGGACGATG CGCGCGCGCG GTCGAACGTG 
GTGCGACTCG CGGCGGCGCA GGCGCTGACC GGCGCCAATG CCGCGGTGAT CTTCGCCACC
GGCTCGATCA TCGGCGCGCA GCTCGCGCCG GATCTGTCGC TCGCGACCGT GCCGATCTCG
ATGTATGTGG TCGGCCTCGC TGCCGGCACG CTGCCGACCG GCGCGATCGC GCGCCGCTAC
GGGCGCCGCG CCTCCTTCAT GATCGGCGCC GGCTGCGGCG CGTTCACCGG CCTGCTCGGC
GCGCTGGCGA TCCTGTACGG CTCGTTCGAG CTGTTCTGCG TCGCCACGTT CCTCGGCGGG
CTTTACGGCG CGGTGTCGCA ATCCTATCGC TTCGCCGCCG CCGACGGCGC CAGCGTCGCG
TATCGCCCCA AGGCGGTGTC CTGGGTGATG GCCGGCGGCG TGTTCGCCGG CGTGCTCGGT
CCGCAGTTGG TGCAGTGGAC CATGGACATC TGGCAGCCTT ATCTGTTCGC CTTCAGCTAT
CTGGTGCAGG CCGCGGTCGC GTTGATCGCG ATGGCGGTGC TGTGGAGCGT CGATGCGCCG
AAGCCGCAGC CGGCCGACTT CGCCGGCGGC CGGCCGCTGC TCGAAATCGT GCGGCAGCCG
CGCTTCATCG CCGCGGCGAT GTGCGGCGCG ATCGCCTATC CGATGATGAA TCTGGTGATG
ACCTCGGCGC CGCTGGCGAT GCAGATGTGC GGACTCCCGC TCAGCGATTC CAATTTCGGC
CTGCAATGGC ACATCGTCGC GATGTACGCG CCGAGCTTCT TCACCGGCTC GCTGATCGTG
CGGTTCGGTG CGCCGCGGGT GGTCGCGTTC GGGCTCGTGC TCGAGGCGCT GGGCGCTGCG
ATCGGCCTGA CCGGGATCAC CGCGCCGCAC TTCTGGGCAA CGCTGTTCGT GATCGGGGTC
GGCTGGAATT TCGCGTTCGT CGGCGCCTCG GCGCTGGTGC TGGAGACGCA CATGCCGAAC
GAGAAGAACA AGGTGCAGGC GTTCAACGAT TTCGTGGTGT TCGGGATGAT GGCGCTGGGA
TCGTTCTTGT CCGGCCAGTT GCTGGCGAAT TACGGCTGGG CGACCGTCAA CATGACGGTG
TTCCCGCCGG TGCTGCTGGG CCTCGTGGTG CTCGCGATCA CCGGCTGGTC CAGAAAACGG
GTGGCGGCGC CGCGTCGGTG A
 
Protein sequence
MVDATAIDGS ADDARARSNV VRLAAAQALT GANAAVIFAT GSIIGAQLAP DLSLATVPIS 
MYVVGLAAGT LPTGAIARRY GRRASFMIGA GCGAFTGLLG ALAILYGSFE LFCVATFLGG
LYGAVSQSYR FAAADGASVA YRPKAVSWVM AGGVFAGVLG PQLVQWTMDI WQPYLFAFSY
LVQAAVALIA MAVLWSVDAP KPQPADFAGG RPLLEIVRQP RFIAAAMCGA IAYPMMNLVM
TSAPLAMQMC GLPLSDSNFG LQWHIVAMYA PSFFTGSLIV RFGAPRVVAF GLVLEALGAA
IGLTGITAPH FWATLFVIGV GWNFAFVGAS ALVLETHMPN EKNKVQAFND FVVFGMMALG
SFLSGQLLAN YGWATVNMTV FPPVLLGLVV LAITGWSRKR VAAPRR