Gene RPC_3739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3739 
Symbol 
ID3970334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4164063 
End bp4165304 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content66% 
IMG OID637926849 
Productmajor facilitator transporter 
Protein accessionYP_533593 
Protein GI90425223 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.101854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCAG CCCAACTCGG CGCATGGCAG CGCTGGTCGA TCCTGGCGGG CGCCGCGATC 
CTGCTGAGCC TGGCGATGGG GATGCGGCAA AGCTTCGGGC TGTTTCAACC CTCGGTCATT
CGCGACATCG GCATCACCTC GGCGGATTTC TCGCTCGCCA CCGCGCTGCA GAACGTGGTC
TGGGGCGTCA CCCAACCCTT CGTCGGCATG TTCGCCGATC GCTACGGCAC CCGCTATGTG
ATGCTCGGCG GCGTGCTGAT CTATGCCGCG GGTCTGGTGT TGATGATGGT CGCGACCTCG
GCGTTGGTGT TCACGCTGGG CGCCGGGTTC TGCGTCGGGC TGGCGCTGTC CTGCACCGCG
TCGAGCCTGA CCATGACGGT GACCTCGCGC ACGGTGTCGG CGGCCAAGCG CAGCGTCGCG
ATGGGCGCGG TGTCGGCGGT CGGATCGCTC GGGCTGGTGA TCGCCTCGCC ATTGGCGCAG
ACGCTGATCT CGACCTCGGG CTGGAAGATG GCGCTGATCG GCTTTCTCGG TCTCGCCGCG
GTGATGCTGC CATCGGCATT GTTCGCCGGA CGCTCCGACA AAATCGAGAT CGACAAGGCC
GACGACAGCG AGCAATCGCT CGGCTCCGTG ATGCAATCCG CGCTCGGGCA TTCCGGTTTC
GTGGTGATGT CGCTGGCGTT CTTCGTCTGC GGGTTGCAAT TGGTGTTCAT TACCACGCAT
CTGCCGAACT ATCTGGATAT TTGCGGGCTC GATCCGTCGC TCGGCGCCAC TGCGCTCGCC
ATCATCGGGC TGTTCAACGT GATCGGCTCC TATGCCTGCG GCTGGCTCGG CGGCCGCTAT
CCGAAACAGC TGCTGCTCGG CGCGATCTAC ATCATCCGCT CGGTGGCGCT CGCCGCCTAT
TTCTATTTTC CGGCGTCGGC CGCCTCCACC ATGGTGTTCG CCGCGGTGAT GGGATCGCTG
TGGCTCGGCG TGGTGCCGCT GGTCAACGGG TTGGTGGCGC AACTGTTCGG GCTGCGCTTC
ATGGCGACGC TGGCCGGCAT CGCCTTCCTC AGCCATCAGG CCGGCTCGTT CCTCGGCGCC
TGGGGCGGCG GGATGATCTA CGACCGGCTC GGCAGCTATG ACGCTGCCTG GCAAGCCGCG
GTGCTGATCG GATTGATCGC CGGCGCTTTT CAGATGTTGA TGAACGTACG TCCACCGCAG
CGTCGCGATG CGCTGGGCGG TGCGGTGGCC AATGCGGCGT GA
 
Protein sequence
MRAAQLGAWQ RWSILAGAAI LLSLAMGMRQ SFGLFQPSVI RDIGITSADF SLATALQNVV 
WGVTQPFVGM FADRYGTRYV MLGGVLIYAA GLVLMMVATS ALVFTLGAGF CVGLALSCTA
SSLTMTVTSR TVSAAKRSVA MGAVSAVGSL GLVIASPLAQ TLISTSGWKM ALIGFLGLAA
VMLPSALFAG RSDKIEIDKA DDSEQSLGSV MQSALGHSGF VVMSLAFFVC GLQLVFITTH
LPNYLDICGL DPSLGATALA IIGLFNVIGS YACGWLGGRY PKQLLLGAIY IIRSVALAAY
FYFPASAAST MVFAAVMGSL WLGVVPLVNG LVAQLFGLRF MATLAGIAFL SHQAGSFLGA
WGGGMIYDRL GSYDAAWQAA VLIGLIAGAF QMLMNVRPPQ RRDALGGAVA NAA