Gene RPD_0594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0594 
Symbol 
ID4021063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp671337 
End bp672662 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content65% 
IMG OID637960782 
Productgeneral substrate transporter 
Protein accessionYP_567733 
Protein GI91975074 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGATA CGACGACGGC GGGACTCGGC TCTGATCCGG CGCGCAAGCA CGTCAATCCC 
GGCGAAATCG CCATCGGCGT CATCATCGGC CGCACCGCCG AGTTCTTCGA CTTCTTCGTC
TATGCGATCG CGTCGGTGAT CGTGTTTCCA CGGCTGGTGT TCCCGTTCGC CAGCGAGCTG
ACCGGCACGC TGTATTCGTT CGCGATCTTC GCCATCGCCT TCATCGCCCG GCCGCTTGGC
ACCGTGATCT TCATGGCGAT CGACCGCCGC TACGGCAAAA GCGCGAAGCT GATCCTCGCG
ATGCTGTTGC TCGGCACCGC GACGGTCGCG ATCGCGTTCC TGCCCGGCTA TTACGAGATC
GGCGCGGCGG CGATCTGGCT GCTGGCGCTG GCGCGCGCGG CCCAGGGCCT CGCCTGGGGC
GGCGCCTGGG ACGGTCTGGC GTCGCTGCTG GCGCTCAACG CGCCGAAACA GCAGCGCGGC
TGGTATGCGA TGGTCCCGCA GCTCGGCGCG CCGCTCGGCC TGATCGTGGC GAGTGCGCTG
TTCGCCTACT TCGCCGGCAA TTTGTCTTCG GAAGACTTCT TCGGCTGGGG CTGGCGCTAT
CCGTTCTTCG TCGCCTTCGC GATTAACGTC GTCGCGCTGT TCGCGCGGCT GCGTATGGTG
GTAACGCCGG AATATTCGTC GCTGTTCGAA AGCCGCGAAT TGCAGCCCAG CCGCGTCCTC
GACACGCTCC GCCACGACGG CCGCAACATT ATCATCGGCG CTTTCGCGCC GCTGGCGAGC
TTCGCGCTGT TCCATATGGT CACGGTGTTC CCGCTGTCCT GGGTGTTCCT GTTCACGCGC
GAAAGCCCGG TGCGGTTCCT GATCATCGAG ACCATCGGCG CAATGTTTGG CGTGGTGGCG
ATCATCGCGT CCGGCATGCT CGCCGATCGC ATCGGCCGCA AGCCGCTGCT GATGGGCTCG
GCGATCGCAA TTGCGGTGTT CAGCGGCTTC GCCCCGCAGA TGCTCGATGC CGGCCCGGTC
GGCGAGACCG CCTATATGAT CCTCGGCTTC ATCCTGCTGG GATTGTCGTT CGGCCAGTCG
TCCGGCGTGA TCGCATCGAG CTTCCGGACT ACCTATCGCT ACACCGCGTC GGCGCTGACC
GCCGACCTCG CCTGGCTGTT CGGCGCCGGC TTCGCCCCGC TGGTCGCACT GCTGCTCGCC
ACCGATTTCG GCCTGATCTC GTCGGGCGCA TACCTGCTGT CGGGCGCAGT GGTGACCATC
CTGGCGCTGT GGATCAGCGG CCTGCGCGAA TCCAACGAGT ACCGCTCACC ATCGGAATCG
GAATAG
 
Protein sequence
MTDTTTAGLG SDPARKHVNP GEIAIGVIIG RTAEFFDFFV YAIASVIVFP RLVFPFASEL 
TGTLYSFAIF AIAFIARPLG TVIFMAIDRR YGKSAKLILA MLLLGTATVA IAFLPGYYEI
GAAAIWLLAL ARAAQGLAWG GAWDGLASLL ALNAPKQQRG WYAMVPQLGA PLGLIVASAL
FAYFAGNLSS EDFFGWGWRY PFFVAFAINV VALFARLRMV VTPEYSSLFE SRELQPSRVL
DTLRHDGRNI IIGAFAPLAS FALFHMVTVF PLSWVFLFTR ESPVRFLIIE TIGAMFGVVA
IIASGMLADR IGRKPLLMGS AIAIAVFSGF APQMLDAGPV GETAYMILGF ILLGLSFGQS
SGVIASSFRT TYRYTASALT ADLAWLFGAG FAPLVALLLA TDFGLISSGA YLLSGAVVTI
LALWISGLRE SNEYRSPSES E