Gene RPD_0489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0489 
Symbol 
ID4020957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp563283 
End bp564467 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content69% 
IMG OID637960676 
Productmajor facilitator transporter 
Protein accessionYP_567628 
Protein GI91974969 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCCTGC TCGACCGCAC CGAGGCGCCG GTTCATCCCG CGCGCCTGAT CCTGATCCTG 
TCGCTCGCCC CCACAGTGGG ACTTGGGATC GGCCGCTTCG CCTATTCGCT GCTGCTGCCC
GACATGCGGG ACAGCCTGCA ATGGTCGTAT TCGGCCGCCG GCTTCATGAA CACCATCAAT
GCCGCCGGCT ATCTCGCCGG CGCGCTGATC ACCTCGCAGC TGGTTCGGCG TTACGGATTG
TCGGCGATCG TGCGGGTCGG AACGCTCGGC TGCGTGCTGT CGCTGGCGCT GTGTGCGCTG
TCGGGCAATT TCGTGCTGCT GTCGGCGGCG CGGCTGATCG CCGGGATCGG CGCGGCGCTG
GCTTTCGTCG CCGGCGGAGC GCTGGCGACC ACGATCGCGC AGTCGCAGCC ACAGCGCTCG
GCGTTTCTGC TCAGCCTGTT CTATGCCGGC CCCGGCCTCG GCATCCTGTC GTCGGGGCTG
ATCACCCCGT TTCTGTTGCA GGCGGCGGGC CCCGGCTCGT GGTGGATCGG CTGGCTGGTG
ATGGCGGCGC TGTCGGCCGT GATGACGCTG CCGCTTCTGC TCGCGCCGCT CGACAGCCAT
GCCAGCATGA GCGGCGGACC GGCGACGACA TTCTCGATCC GGCCGGTGCT GATCTATCTG
GTCGGCTATT TCATGTTCGG CGCCGGCTAC ATCGCCTACA TGACCTTCAT GATCGCCTAT
GTGCGCGACG CTGGCGGCGG ACCGGCGGCG CAGAGCGCGT TCTGGTGCCT GATCGGGGCG
AGCGCCTTCG TCACCCCGTG GGTGTGGCGC CGGATCATGG CGCTCGACCG CGGCGGGGTG
TCGACCACGA TCATCCTCGC CGTCAACGCG CTCGGCGCGG CGCTGCCGCT GTTCGGACTG
TCGCCGCTGA TCCTGGCGAT CTCGGCGCTG GTGTTCGGCG TGTCGTTCTT CGCCGTGGTG
GCGTCGACCA CCGCCTTCGT CCGCTTCAAT TATGCGCAGG CGGCGTGGCC GGGCGCGATC
GCCGCGATGA CGATTGCGTT CGGGATCGGC CAGACGCTGG GCCCCCTTGC GGTCGGCGCC
ATCACCGACG CAGTCGGCAG CCTGTCCTCG GCGCTCGCGG TCTCCGCCGC CACACTGGCG
CTCGGCGCGG TGTTCTCGGC ATTTCAGCGG CCGTTGAAAC GGTAG
 
Protein sequence
MTLLDRTEAP VHPARLILIL SLAPTVGLGI GRFAYSLLLP DMRDSLQWSY SAAGFMNTIN 
AAGYLAGALI TSQLVRRYGL SAIVRVGTLG CVLSLALCAL SGNFVLLSAA RLIAGIGAAL
AFVAGGALAT TIAQSQPQRS AFLLSLFYAG PGLGILSSGL ITPFLLQAAG PGSWWIGWLV
MAALSAVMTL PLLLAPLDSH ASMSGGPATT FSIRPVLIYL VGYFMFGAGY IAYMTFMIAY
VRDAGGGPAA QSAFWCLIGA SAFVTPWVWR RIMALDRGGV STTIILAVNA LGAALPLFGL
SPLILAISAL VFGVSFFAVV ASTTAFVRFN YAQAAWPGAI AAMTIAFGIG QTLGPLAVGA
ITDAVGSLSS ALAVSAATLA LGAVFSAFQR PLKR