Gene RPC_0089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0089 
Symbol 
ID3971347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp100508 
End bp101728 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content70% 
IMG OID637923205 
Productmajor facilitator transporter 
Protein accessionYP_529987 
Protein GI90421617 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGACG CAATCATGAT CGACGAAGCC GCCGGCGATG CGCGGGCGCG GTCGAACCTG 
TTGCGGCTCG GCATCGCCCA GGCGCTGACC GGCGCCAACG CCGCGGTGAT CTTCGCCACC
GGCGCGATCG TCGGCGCCAC GCTGGCGCCC GACGCCACCT TCGCCACACT GCCGGTGTCG
GTCTATGTGG TCGGCATGGC GGCCGGCACG CTGCCGACCG GGGCGATCTC GCGGGCGTTC
GGCCGCCGTG TCGCCTTCCT GCTCGGCGGC GGCTGCGGCG CGCTGTGCGG AGCGCTGGCC
TGCCTCGCCA TCCTGCACGG CTCGTTCGCG CTGTTTTGCG TCGCCACCTT CCTCGGCGGG
CTCTACGGCG CGGTGGCGCA ATCCTATCGC TTCGCCGCCG CCGACGGCGC CAGCGTGGCG
TTTCGGTCGC GCGCGATTGG CTGGGTGATG ACCGGCGGCA TCTTTGCCGG CGTGATCGGT
CCGCAGCTGG TGCAATGGAC CATGGATATC TGGCCACCCT ATCTGTTCGC CTTCAGCTTC
GCCACCCAGG CCGTGGTGGC GCTGATCGCG ATGGCGGTGT TGGCCGGGGT CGACGCGCCG
CGGCCAAAGC CCGCCGAACT CGCCGGCGGC CGGCCGCTGT GGCAGATCGC CCGGCAGCCG
CGCTTCGTCA TCGCGGTGGT GTGCGGCGTG GTGTCCTACG CGATGATGAA CCTGGTGATG
ACTTCGGCGC CGCTGGCGAT GCAGATGTGC GGCTTGTCGA TCAGCGATTC CAACACCGGG
ATTCAGTGGC ACATGGTGGC GATGTATGGC GCGAGTCTGC TGGCCGGGCC GATGATCGCC
CGGTTCGGCG CCGCGCGCAC CGCGGCGCTC GGGCTCGTGC TGGAAGCGCT CGCCGCCTGC
ATCGACCTGT CCGGCGTCAC CGCGCTGCAT TTCTGGGCCG GGCTGATCGC GCTCGGCATC
GGCTGGAATT TCGGCTTCGT CGGCGCCTCG GCGCTGGTGC TGGAAACCCA CCTGCCGGCG
GAGCGCAACA AGGTGCAGGC GTTGAACGAT TTCCTGGTGT TCGGGGTGAT GGCGCTGGGC
TCGTTCGCCT CCGGCGGCGT GCTGGCGCTG TACGGCTGGT CGACCATCAA CTGGGTGGTG
TTTCCGCCGG TGCTGCTGGC GCTGGCGGTG CTGGCGTTCG CGACCTGGGG CCAACGGCGA
GCGGTGCCGC GCGGTTCGTG A
 
Protein sequence
MVDAIMIDEA AGDARARSNL LRLGIAQALT GANAAVIFAT GAIVGATLAP DATFATLPVS 
VYVVGMAAGT LPTGAISRAF GRRVAFLLGG GCGALCGALA CLAILHGSFA LFCVATFLGG
LYGAVAQSYR FAAADGASVA FRSRAIGWVM TGGIFAGVIG PQLVQWTMDI WPPYLFAFSF
ATQAVVALIA MAVLAGVDAP RPKPAELAGG RPLWQIARQP RFVIAVVCGV VSYAMMNLVM
TSAPLAMQMC GLSISDSNTG IQWHMVAMYG ASLLAGPMIA RFGAARTAAL GLVLEALAAC
IDLSGVTALH FWAGLIALGI GWNFGFVGAS ALVLETHLPA ERNKVQALND FLVFGVMALG
SFASGGVLAL YGWSTINWVV FPPVLLALAV LAFATWGQRR AVPRGS