Gene RPD_0835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0835 
Symbol 
ID4021309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp935891 
End bp937555 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content62% 
IMG OID637961025 
Productgeneral substrate transporter 
Protein accessionYP_567974 
Protein GI91975315 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0740201 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCT ATGCCGCGAC GCAGGTGCGG TCCAGCGGAA TGACGAAGGA CGAACGTTTC 
GTCATTGTCG CATCGTCGCT CGGTACCGTT TTTGAGTGGT ACGATTTCTA TCTGTACGGA
TCGCTCGCAG CCATTATCGG CGCGCAATTC TTCAGTGCCT ATCCGCCTGC GACGCGCGAC
ATCTTCGCGC TTCTGGCCTT CGCCGCCGGC TTTCTCGTGC GCCCGTTCGG CGCCATCGTG
TTCGGTCGTG TCGGCGACAT CGTCGGCCGT AAATACACCT TCCTCGTCAC CATTCTGATC
ATGGGCCTGT CGACGTTCAT CGTCGGCCTG CTGCCCAATG CGGCGACGAT CGGCATCGCG
GCGCCGATTA TCCTGATCGG TCTGCGCCTG CTGCAGGGCC TCGCGCTCGG CGGTGAATAT
GGCGGCGCGG CGACCTATGT GGCCGAGCAC GCACCTCCCG GCAAACGCGG CTACTACACG
TCGTTCATCC AGACCACGGC AACACTCGGA CTATTCCTGT CGCTGATGGT GATCCTGTTC
ACCCGCACGA TTCTCGGCGA AGCCGCGTTC GCGGATTGGG GCTGGCGTGT TCCGTTCCTG
GTGTCGGTCG TGCTGCTCGG CGTTTCGGTC TGGATCCGGC TGCGGCTGAA CGAATCTCCC
GTGTTCCAGA AGATGAAGGA CGAGGGCAAG AGCTCGAAGG CGCCGTTGAC CGAAGCCTTT
GCGAACTGGG GCAACGCCAA GATCGTGCTG ATCGCACTGT TCGGCGCCGT GATGGGTCAG
GGCGTGGTCT GGTACACCGG CCAGTTCTAC GCGCTGTTCT TCCTGCAATC GATCCTGAAG
GTCGACGGCT ACACCTCGAA CCTGCTGATC GCCTGGTCGC TGCTGCTCGG CACCTTCTTC
TTCATCGTGT TCGGTTGGCT GTCCGACAAG ATCGGTCGCA AGCCGATCAT CCTCACCGGC
TGCGCGATCG CTGCGCTGTC GTTCTTCCCG ATCTTCAAGG CGATCACCTC CAACGCCAAC
CCGGCGCTGG AAAGGGCCAT CGAGACCGTC AAGGTCGAGG TGGTGTCGGA TCCGGCGCTG
TGCGGCGACC TGTTCAACCC GGTCGGCACC CGCGTCTTCA CCGCCCCGTG CGACACCGCC
CGCGCCTACC TGTCGCAATC CTCGGTCAAG TACTCGACCA CCAACGGCCC GGCCGGCTCC
GGCGTCAAGG TGCTCGTGAA CGGCACGGAA GTGCCCTACA CCGACGCCAA AACGTCCAAT
CCGCAGGTGC TGGCGACGAT CCAGGCGGCC GGCTATCCGA AGGCGGGAAA TTCTGAAATC
ATCAAGATGT CGAACCCATT CGACATCTTC CGCCCGCAAG TGATGGCGGT GATCGGGCTG
CTGTTCGTCC TGGTGCTGTT CGTCACCATG GTTTACGGGC CGATCGCGGC GATGCTGGTC
GAACTATTCC CGACCCGCAT CCGCTACACC TCGATGTCGC TGCCCTACCA CATCGGCAAC
GGCTGGTTCG GTGGCCTGCT GCCCGCGACC GCCTTCGCGA TCGTGGCCTC GACCGGCGAT
ATCTACGCCG GCCTCTGGTA CCCGATCATC TTCGCGTCGA TCACCGTCGT GATCGGCCTG
ATCTTCCTGC CAGAGACCAA ACACGTCGAT ATCAGCAAAA CCTGA
 
Protein sequence
MSTYAATQVR SSGMTKDERF VIVASSLGTV FEWYDFYLYG SLAAIIGAQF FSAYPPATRD 
IFALLAFAAG FLVRPFGAIV FGRVGDIVGR KYTFLVTILI MGLSTFIVGL LPNAATIGIA
APIILIGLRL LQGLALGGEY GGAATYVAEH APPGKRGYYT SFIQTTATLG LFLSLMVILF
TRTILGEAAF ADWGWRVPFL VSVVLLGVSV WIRLRLNESP VFQKMKDEGK SSKAPLTEAF
ANWGNAKIVL IALFGAVMGQ GVVWYTGQFY ALFFLQSILK VDGYTSNLLI AWSLLLGTFF
FIVFGWLSDK IGRKPIILTG CAIAALSFFP IFKAITSNAN PALERAIETV KVEVVSDPAL
CGDLFNPVGT RVFTAPCDTA RAYLSQSSVK YSTTNGPAGS GVKVLVNGTE VPYTDAKTSN
PQVLATIQAA GYPKAGNSEI IKMSNPFDIF RPQVMAVIGL LFVLVLFVTM VYGPIAAMLV
ELFPTRIRYT SMSLPYHIGN GWFGGLLPAT AFAIVASTGD IYAGLWYPII FASITVVIGL
IFLPETKHVD ISKT