Gene RPD_4227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4227 
Symbol 
ID4024748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4694026 
End bp4695303 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content69% 
IMG OID637964433 
Productmajor facilitator transporter 
Protein accessionYP_571345 
Protein GI91978686 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0992586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.574471 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCCTGC AGCAGACGCC CGGCCAGCAA CGGTCGCTCG ACGCCCTGAA CTTCTTCCTG 
GCCGACGTCC GCGACGGGCT CGGGCCCTAT CTGGCGATCT ATCTGCTGTC GGTTCAGCAC
TGGAATGAGG CCTCGATCGG ACTGGTGATG ACGGTTGCCG CGATGGCTGG GATCGCCGCT
CAGACGCCGG CCGGGGCGCT GATCGACCGC TCTACGGCCA AGCGCGGCCT GCTGATCGCC
GCCGCGATCG CGGTGACGCT GGCGTCAGTA ACGCTGCCGC TGTTGCAGAG CTTCGAAGCC
GTCGCGGCGA CCCAGGCACT TGCAGGCGCC GCCGGCGCGA TCTTCGCGCC CGCGGTCGCG
GCGGTGACAC TCGGGATCGT CGGGCCCCGC GCCTTCGCCC GCCGCACCGG GCGCAACGAG
GCGTTCAATC ACGCAGGCAA TGCGGTGGCG GCGACGCTGG CGGGGGTATC TGCCTATTTT
TTCGGTCCGG TGGTGGTGTT CTGGCTGATG TCGGCGATGG CCGTCGCCAG CATTTTCGCG
ACGCTGTCGA TCCCGGCGAA AGCGATCGAC GATCAGGTCG CGCGCGGTCT CGCCTCGATC
GGCGGACTGG ACGCAGGCCC GCAAGTTCCC GACCAGCGCC ACGACCAGCC CTCGGGTTTC
AAAGTGCTGA TCACCTGCCG TCCGCTGCTG ATCTTCGCGG CGGCGACCGT GCTGTTTCAC
TTCGCCAATG CCGCGATGCT GCCGCTGGTC GGGCAGAAGC TCACGCTGGT GAACAGGGAG
ATCGGCACCA CCCTGATGTC GGTGTGCATC GTCGCGGCGC AGATCGTGAT GGTGCCGGTG
GCGATGCTGG TCGGGCACAA GGCCGATGTC TGGGGCCGCA AGCCGATCTT TGCGGTGGCG
CTGGGCGTGC TGGCGCTGCG CGGCGCGCTG TATCCGTTGT CCGACAATCC GTTCTGGCTG
GTCGGGGTGC AGATGCTCGA CGGCGTCGGG GCCGGCATCT TCGGCGCGCT GTTTCCGCTG
GTGGTGGCCG ACCTCACCCG CGGCACCGGT CATTTCAATA TCAGCCAGGG CGCGATCGCC
ACCGCTACCG GGATCGGCGG CGCGCTGTCG ACCGGCGTCG CGGGGCTGAT CGTGGTCACG
GCCGGCTACA GCGCTGCATT CCTCACCCTC GCTGCGATCG CGGCGCTCGG GCTGGTGCTA
TTCGTCGTCC TGATGCCCGA GACCCGCCAG ACCGGGCTGC CTGCGATCGG ACTGGCCCCG
GGCATGCCGG CTGAGTAG
 
Protein sequence
MPLQQTPGQQ RSLDALNFFL ADVRDGLGPY LAIYLLSVQH WNEASIGLVM TVAAMAGIAA 
QTPAGALIDR STAKRGLLIA AAIAVTLASV TLPLLQSFEA VAATQALAGA AGAIFAPAVA
AVTLGIVGPR AFARRTGRNE AFNHAGNAVA ATLAGVSAYF FGPVVVFWLM SAMAVASIFA
TLSIPAKAID DQVARGLASI GGLDAGPQVP DQRHDQPSGF KVLITCRPLL IFAAATVLFH
FANAAMLPLV GQKLTLVNRE IGTTLMSVCI VAAQIVMVPV AMLVGHKADV WGRKPIFAVA
LGVLALRGAL YPLSDNPFWL VGVQMLDGVG AGIFGALFPL VVADLTRGTG HFNISQGAIA
TATGIGGALS TGVAGLIVVT AGYSAAFLTL AAIAALGLVL FVVLMPETRQ TGLPAIGLAP
GMPAE