Gene RPC_3634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3634 
Symbol 
ID3970649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4040942 
End bp4042639 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content69% 
IMG OID637926742 
Productmajor facilitator transporter 
Protein accessionYP_533488 
Protein GI90425118 
COG category[G] Carbohydrate transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0589] Universal stress protein UspA and related nucleotide-binding proteins
[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.836817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0365069 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAATG CCCCCGCCGG CGACGTCGGT TCGAGTTTCT TTCGCCAGCC CCGCGCGGTC 
TGGGCCACCG CCTTCGCCGC GGTGGTCGGC TTCATGGGGA TCGGCCTGGT CGATCCGATC
CTGACCTCGA TCGCCGAGGG GCTGCAGGCC ACGCCGAGCC AAGTGTCGCT GCTGTTCACC
AGCTATTTCG CGGTGACCTC GGTGATGATG CTGGCGACTG GCTTCGTCTC CAGCCGGATC
GGCGGGCGGC GGACGCTGCT GCTCGGCGCG GCGCTGATCG CCTGCTTCGC TGCACTCGCC
GGCACGTCGC ATTCGGTCAC CGAACTGGTG CTGTATCGGG CCGGCTGGGG GCTCGGCAAC
GCGTTCTTCG TCGCCACCGC GCTGTCGGTG ATCGTGGCGG CGGCGAGCGG CGGCACCGCC
ACCGCGATTC TGTTGTATGA GGCGGCGCTC GGGCTCGGCA TTTCGGTCGG CCCGCTGCTC
GGCGCAGCCC TTGGTAACCT GTCGTGGCGC TATCCGTTTT TCGGCACCGC GGCGCTGATG
ACCATCGGCT TCGTAGCGAT TGCGTTGTTT CTCGAGGTGC AGCCCAAGGC GGCGCGGAAA
ACCAGTCTGG TCGATCCGAT CCGCGCGCTC GGCCATCGCG GGCTGTTGTC GGTGGCGGGC
AGCGCGTTCT TCTACAATTA CGCGTTCTTC ACCGTGCTGG CGTTCGTGCC CTTCGTGCTG
CAGGCCTCGG CGCACACCGT CGGGCTGATC TTCTTCGGCT GGGGACTGGC GCTGGCGGTG
TTCTCGGTGC TGGTGGCGCC GCGGCTGCAG ATCCTGTTCA GCGCGCTGAC GCTCGGCGTC
GGCAATCTGC TGGCGCTGGC GGTGCTGCTC CTCGGCATGG CCTTCGGCTC GGTGCCGATC
ATCGTCGCCG CGGTGGTGCT GTCCGGCGCG GTGATGGGCA TCAACAACAC CGTGTTCACC
GAAATGGCGC TGGAGATTTC GCCGTTTCCG CGCCCGGTGG CCTCGGCGGC CTATAATTTC
GTGCGCTGGT TCGCCGGCGT GATCGCGCCC TTTGCGGCGC CGAAGATCGC CGAGCATTTC
GGCGCCTCGG CGTCGTTCGT GGTCGCGGCG GTGTCGGCGC TGGCCGCCGC AGGCGTGCTG
CTGGCGATGC GCGGCAATCT CGGTCGGTTC GCCTCAAAGC ATCCCGCCGC GGCGCCCGCG
GAAGCGCCCT CGGCGGCAAC CGGGCCGATC CTGGTCGCGG TCGACGGCAC CGCCAACGAC
CGGGCGATCC TTGCCCGTGC GGCCAAGGTA GCGCTGGCGC TGGGCGCGCC GATCGAGGTG
CTGCATGTCC GCCCGCTGGA ACTGGTCGAG GGTGAAGCCG CCGAGGCGGA AAGTTCCACC
GGCTCCGCCG CCATTCTCGA CCTCGCCTTG GCGCAACTGC GCGACGCCGG CCTGCAGGCC
GCCGGTGCGG TGCGGGAAGA GGTTGCCGCA AGAACGCCAC AAGCGATTCT CGACCATGCC
GCGGATCTCG ACGCCCGGCT GATTGTGCTC GGCGCCCGTC ACCACGACGA CCCGACCGAC
ATCATTCATG GCAGCGTCGC CGATATCATC GGCCGCAGGG CCACTCGCCC AGTGATGTTG
GTGCCGGAGC CGGGCGCGAA TGAGCGGCGA TGCGAGTCCG CGTCGCCCGC CGCAGATCGC
TCGAAAATCG GTATCTGA
 
Protein sequence
MSNAPAGDVG SSFFRQPRAV WATAFAAVVG FMGIGLVDPI LTSIAEGLQA TPSQVSLLFT 
SYFAVTSVMM LATGFVSSRI GGRRTLLLGA ALIACFAALA GTSHSVTELV LYRAGWGLGN
AFFVATALSV IVAAASGGTA TAILLYEAAL GLGISVGPLL GAALGNLSWR YPFFGTAALM
TIGFVAIALF LEVQPKAARK TSLVDPIRAL GHRGLLSVAG SAFFYNYAFF TVLAFVPFVL
QASAHTVGLI FFGWGLALAV FSVLVAPRLQ ILFSALTLGV GNLLALAVLL LGMAFGSVPI
IVAAVVLSGA VMGINNTVFT EMALEISPFP RPVASAAYNF VRWFAGVIAP FAAPKIAEHF
GASASFVVAA VSALAAAGVL LAMRGNLGRF ASKHPAAAPA EAPSAATGPI LVAVDGTAND
RAILARAAKV ALALGAPIEV LHVRPLELVE GEAAEAESST GSAAILDLAL AQLRDAGLQA
AGAVREEVAA RTPQAILDHA ADLDARLIVL GARHHDDPTD IIHGSVADII GRRATRPVML
VPEPGANERR CESASPAADR SKIGI