Gene RPD_4124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4124 
Symbol 
ID4024646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4589038 
End bp4590246 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content70% 
IMG OID637964332 
Productmajor facilitator transporter 
Protein accessionYP_571244 
Protein GI91978585 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.278491 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCGC TGCGCAGCCT CGCGTTTGGA CCCGGACGCG CGGTCGTCGT GCTGTCGTTC 
ACCCAGATCC TGTGCTGGGG CATCCTGATC TACCCTCCTG TGCTGACCAT GCCGCATCTG
ACGGCGGATC GGGGCTGGTC GCTGGCGTTC GGGATGGCGG GATTTTCGCT GGCGCTGGTG
ATGTCCGGGA TCATGTCGCC GATCGTCGGC GGCCTGATCG ATCGCCGGGG CGGCAATCTC
GTGATGGCAC CCGGCGCATT GGCTGGGGCG CTCGGACTGG CGCTGCTCGC CAGCACCGAC
GCATGGCCGC TCTATTTCGC GAGCTGGGCG CTGATCGGCG TTTCGATGGC CTCGAGCCTG
TACGATCCGG CCTTCGCCAC GCTGGCGCGA TTGTTCGGCA GCTCGGCGCG GCGGCAGATC
ACCTTCGTCA CTTTCGCAGG GGGCTTCGCC TCGACGGTCG GCTGGCCGGC GACGCATCTG
TTGCTGGAAC ATGTCGGCTG GCGCGGCACC TATCTGGTGT TCGCCGCCGT GCTGGCCTTC
GTCGTCGCAC CGCTGCACGC CTTTGCGTTG CCGAGAACGC CGTCGCCGTC CCAGGGGGCG
GTCCCGCCCA GCCCGCATCT GGTGCCGGAG CAACCGCTGC GGCCCGAAGG GCGGGTTTTC
ATCCTGCTGG CGATGGGGTT CGCGCTGCAT GCGCTGATCC TGTCCGGCGT CACCTCGAAC
CTGCTGTCGA TGCTCGAACG CGGCGGGCTG AGCGCCGCCA CGGTGGTGAC GTTGGGGGCG
CTGTTCGGTC CCGCGCAGGT CGCGGCGCGC CTGGTCGATT TCCTGCTGGC GGGCCGCACC
CATCCGTTGT GGATCGCGCG CGGGGCGATC GCGCTGATGG CGGTCGCGTT CGCGATGCTG
GCGTTCGTCG GGGTATCGGT CGTCGTCGCA GGGCTGTTCT GCATCGCCTT CGGCGCGGCC
AACGGCGTGA TGACGATCGC GCGTGGCAGC CTGCCGCTGC TGATGTTCGG GCCGCAAGGT
TATGGCCGGG TGATCGGGCG CATCGCGCGG CCGGCGCTGT TCGTCCAGGC ATCGGCGCCG
TTCGTGGTCG CGGCGGCGGT CGAACGATTT TCCGACGCCG TGGTGATCGA GGTCGGGATG
GCGGCGGCGC TGGTCGGCGT CGGCTGCTTC CTGCTGATCC GGACGCCGCG CGCGGCGCCG
CGGCAATAG
 
Protein sequence
MTALRSLAFG PGRAVVVLSF TQILCWGILI YPPVLTMPHL TADRGWSLAF GMAGFSLALV 
MSGIMSPIVG GLIDRRGGNL VMAPGALAGA LGLALLASTD AWPLYFASWA LIGVSMASSL
YDPAFATLAR LFGSSARRQI TFVTFAGGFA STVGWPATHL LLEHVGWRGT YLVFAAVLAF
VVAPLHAFAL PRTPSPSQGA VPPSPHLVPE QPLRPEGRVF ILLAMGFALH ALILSGVTSN
LLSMLERGGL SAATVVTLGA LFGPAQVAAR LVDFLLAGRT HPLWIARGAI ALMAVAFAML
AFVGVSVVVA GLFCIAFGAA NGVMTIARGS LPLLMFGPQG YGRVIGRIAR PALFVQASAP
FVVAAAVERF SDAVVIEVGM AAALVGVGCF LLIRTPRAAP RQ