Gene RPC_4821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4821 
Symbol 
ID3973525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5378560 
End bp5380224 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content62% 
IMG OID637927933 
Productgeneral substrate transporter 
Protein accessionYP_534662 
Protein GI90426292 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACGA TTGCTGTTGC ACCAGCCCGT ACGGGAGGGA TGACGAAGGA CGAACGTTTC 
GTCATTCTCG CGTCATCACT CGGCACGGTA TTCGAGTGGT ATGATTTTTA CCTCTACGGT
TCACTTGCCA GCATTATCGG CGCGCAGTTC TTCAGCGCCT ATCCGCCGGC CACCCGCGAC
ATCTTCGCGC TGCTCGCCTT CGCCGCGGGC TTCCTGGTCC GCCCGTTCGG TGCCATCGTG
TTCGGCCGCA TCGGCGATAT CGTCGGCCGC AAATATACCT TCCTGGTCAC CATCCTGATC
ATGGGTCTGT CGACCTTCAT CGTCGGTCTG TTGCCCAACG CGGCGACCAT CGGCATCGCG
GCGCCGATCA TCCTGATCGG TCTGCGGCTG TTGCAGGGCC TGGCGCTCGG CGGCGAATAC
GGCGGGGCAG CGACCTATGT CGCCGAGCAC TCGCCGCCCG GCAAACGCGG TTACTACACC
TCATTCATCC AGACCACGGC GACGCTCGGA CTCTTTCTCT CGCTGATCGT GATCCTGTTT
ACCCGCACCA TTCTCGGCGA GCCGGAATTC GCAGCCTGGG GCTGGCGTAT TCCGTTCCTG
GTGTCGGTGC TGCTGCTCGG CGTTTCGGTC TGGATCCGGC TGAAGCTGAA TGAGTCGCCG
GTGTTCCAGA AGATGAAGGA CGAAGGCAAG AGCTCGAAAG CGCCCTTGAC CGAAGCCTTT
GCCAACTGGG GCAACGCCAA GATCGTGCTG ATCGCCTTGA TCGGCGGCAC CATGGGCCAG
GGCGTGGTGT GGTACACCGG CCAGTTCTAC GCGCTGTTCT TCCTGCAATC GATCCTCAAG
GTTGACGGCT ATACCTCCAA CCTGTTGATC GCCTGGTCGC TGTTGTTCGG GACCGGCTTC
TTCATCTTCT TCGGCTGGCT GTCGGACAAA ATCGGCCGTA AGCCGATCAT TCTGACCGGC
TGCTTGATCG CGGCGCTGAG CTTCTTCCCG ATCTTCCGGA TGATCACCTC CAACGCCAAC
CCGGCGTTGG AAAAGGCCAT CGAGACCGTG AAGGTCGAGG TTGTGTCCGA TCCTGCGCTG
TGCGGCGATC TGTTCAACCC GGTCGGCACC CGCGTGTTCA CCGCGCCTTG CGACACCGCG
CGGGCCTATC TGGCGCAGTC CTCGGTGAAG TACTCGACCG CCTACGGTCC GGCCGGCTCC
GGCGTCAAGG TCGTCGTCAA CGGCACCGAG GTACCTTACG TCGACGCCAA GACCTCCAAT
CCGGCGGTGC TGGCGGCGGT TCAGGGCGCC GGCTATCCGA AGGCGGGTAA CGCCGACATC
GTCAAGATGT CGAACCCGTT CGACATCTTC AAGCCGCAGG CCGCGGCGGT GATCGGGCTG
CTGTTCATCT TGGTGCTGTT CGTCACCATG GTGTACGGGC CGATCGCGGC GATGCTGGTC
GAACTGTTCC CGACCAGGAT CCGCTACACC TCGATGTCGC TGCCCTATCA CATCGGCAAC
GGCTGGTTCG GCGGCTTGCT GCCGGCGACC GCCTTCGCCA TCGTGGCCTC GACCGGCGAT
ATCTATGCCG GCCTGTGGTA CCCGATCATC TTCGCCTTGA TCACCTTCGT CGTCGGTCTG
ATCTTCATGC CGGAGACCAA GAACGTCGAT ATCGGTCGCA GCTAA
 
Protein sequence
MSTIAVAPAR TGGMTKDERF VILASSLGTV FEWYDFYLYG SLASIIGAQF FSAYPPATRD 
IFALLAFAAG FLVRPFGAIV FGRIGDIVGR KYTFLVTILI MGLSTFIVGL LPNAATIGIA
APIILIGLRL LQGLALGGEY GGAATYVAEH SPPGKRGYYT SFIQTTATLG LFLSLIVILF
TRTILGEPEF AAWGWRIPFL VSVLLLGVSV WIRLKLNESP VFQKMKDEGK SSKAPLTEAF
ANWGNAKIVL IALIGGTMGQ GVVWYTGQFY ALFFLQSILK VDGYTSNLLI AWSLLFGTGF
FIFFGWLSDK IGRKPIILTG CLIAALSFFP IFRMITSNAN PALEKAIETV KVEVVSDPAL
CGDLFNPVGT RVFTAPCDTA RAYLAQSSVK YSTAYGPAGS GVKVVVNGTE VPYVDAKTSN
PAVLAAVQGA GYPKAGNADI VKMSNPFDIF KPQAAAVIGL LFILVLFVTM VYGPIAAMLV
ELFPTRIRYT SMSLPYHIGN GWFGGLLPAT AFAIVASTGD IYAGLWYPII FALITFVVGL
IFMPETKNVD IGRS