Gene RPB_4679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4679 
Symbol 
ID3912497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5292705 
End bp5293946 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content68% 
IMG OID637886584 
Productmajor facilitator transporter 
Protein accessionYP_488273 
Protein GI86751777 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.206752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGACG CGACCGCGAT CGATGGATTC GCTGACGATG CGCGCGCACG CTCGAACGTG 
ATGCGGCTGG CGGCGGCGCA GGCGCTGACC GGCGCCAATG CGGCGGTGAT CTTCGCCACC
GGCTCGATCA TCGGCGCGCA GCTCGCGCCC AGCGTGGCGC TCGCGACCGT GCCGATCTCG
ATGTATGTGG TCGGCCTCGC CGCCGGCACG CTGCCGACCG GCGCGATCGC GCGGCGCTAT
GGCCGCCGCG TCTCCTTCAT GATCGGCGCC GGCTGCGGCG CGTTCACCGG CCTGCTCGGC
GCGCTGGCGA TCCTGTACGG CTCGTTCGAG CTGTTCTGCG TGGCCACCTT TCTCGGCGGG
CTGTACGGCG CGGTGTCGCA ATCCTATCGC TTCGCCGCCG CCGACGGCGC CAGCGTCGCG
TATCGCCCCA AGGCGGTATC CTGGGTGATG GCCGGCGGCG TGTTCGCCGG CGTGCTCGGT
CCGCAGCTGG TGCAGTGGAC CATGGACATT TGGCAGCCTT ATCTGTTCGC CTTCAGCTAT
CTGGTGCAAG CCGCGGTCGC GCTGGTCGCG ATGGCGGTGC TGTGGAGCGT CGACGCGCCG
AAGCCGCAGC CCGCCGATTT CGCCGGCGGC CGGCCGCTGC TCGAAATCGT GCGGCAGCCG
CGCTTCATCG CCGCGGCGAT GTGCGGCGCG ATCGCCTATC CGATGATGAA TCTGGTGATG
ACGTCGGCGC CGCTCGCGAT GCAGATGTGC GGGCTGCCGA TCAGCGATTC CAATTTCGGC
CTGCAATGGC ACATCGTCGC GATGTATGCG CCGAGCTTCT TCACCGGCTC GCTGATCGCG
AAATTCGGCG CGCCGCGCGT GGTCGCGCTC GGGCTGGCGC TGGAAGCTGC AGGCGCGTCG
ATCGGCCTGA TGGGGATCAC CGCCCCGCAT TTCTGGGCGA CGCTGTTCGT GATCGGGGTA
GGCTGGAATT TCGCCTTCGT CGGCGCTTCG GCGCTGGTGC TGGAGACCCA CCAGCCGAGC
GAAAAGAACA AGGTGCAGGC GTTCAACGAT TTCGTGGTGT TCGGCATGAT GGCGCTGGGC
TCGTTCGGGT CCGGGCAATT GCTGGCGAAT TACGGCTGGG CGACCGTCAA CCTGACGGTG
TTTCCGCCGG TTCTGCTCGG CCTCGTCGTG CTCGCGATCA CCGGCTGGTC GCGAAAACGG
GTGGCGGCGG CCGCAGCCGC CGTGCCAGAA CGCGGCATCT GA
 
Protein sequence
MVDATAIDGF ADDARARSNV MRLAAAQALT GANAAVIFAT GSIIGAQLAP SVALATVPIS 
MYVVGLAAGT LPTGAIARRY GRRVSFMIGA GCGAFTGLLG ALAILYGSFE LFCVATFLGG
LYGAVSQSYR FAAADGASVA YRPKAVSWVM AGGVFAGVLG PQLVQWTMDI WQPYLFAFSY
LVQAAVALVA MAVLWSVDAP KPQPADFAGG RPLLEIVRQP RFIAAAMCGA IAYPMMNLVM
TSAPLAMQMC GLPISDSNFG LQWHIVAMYA PSFFTGSLIA KFGAPRVVAL GLALEAAGAS
IGLMGITAPH FWATLFVIGV GWNFAFVGAS ALVLETHQPS EKNKVQAFND FVVFGMMALG
SFGSGQLLAN YGWATVNLTV FPPVLLGLVV LAITGWSRKR VAAAAAAVPE RGI