Gene RPD_4068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4068 
Symbol 
ID4024585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4520382 
End bp4521461 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content67% 
IMG OID637964271 
ProductABC transporter related 
Protein accessionYP_571188 
Protein GI91978529 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.412838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCA GCCTGGAGAA TGTCACCAGG ATGATCGACG GCGTGCCGGC GATCCGCGAC 
GTGTCGCTGA CGCTGGAGCG CGGCACGCTG AGCGTGCTGC TCGGGCCGAC GCTGTCGGGC
AAGACCTCGA TCATGCGGCT GCTCGCCGGC CTCGACAAGC CGACCACAGG CCGCGTGCTG
GTCGACGGAA AGGACGTCAC CGGGTTCGAC GTGCGCAAGC GTTCGGTGGC GATGGTGTAT
CAACAATTCA TCAATTACCC GTCGCTGACG GTCTATGAGA ACATCGCCTC GCCGCTGCGG
GTGCAGGGCA AGTCGCGCGA CGAGATCGAG CAGCGCGTGC AGGAGGCGGC CAAGCTGCTG
AAGCTCGAGC CGTATCTGAA GCGCACGCCG CTGCAACTCT CCGGCGGCCA GCAGCAGCGC
ACCGCGATCG CCCGCGCGCT GGTCAAGGGC GCCGATCTCG TGCTGCTCGA CGAGCCGCTC
GCCAATCTCG ACTACAAGCT GCGCGAAGAA CTGCGCACCG AACTGCCGCG GATCTTCGAG
GCGTCGGGTG CGATCTTCGT CTACGCCACC ACCGAGCCCT CCGAGGCGCT GCTGCTCGGT
GGTCGCACCG TCTGCATGTG GGAAGGACAG GTGCTGCAGA CCGGCCCGAC GCCCTACGTC
TATCGGCAGC CCGACACCAT GCGGGTCGCG CAGGTGTTCT CCGATCCGCC GCTCAATATT
GTCGGCGCGG AGAAGAAGGC CGGCACCGTG CATTATGCCG GCGGCGTTAC GGCGCCCGCC
ACTGGCGTCT TCGCCGGGCT CGGCGACGGC GCCTATCGGG TCGGCTTCCG CGCCCATCAG
ATCGAGGTCG CGCGCGTCAA TCCGGATCGC CACGCGTTCC AGGCCACCGT CGCGGTGACC
GAGATCACCG GCTCGGAGAG CTTCGTGCAT CTCAAGCGCG GCGACGACAA TTGGGTCGCG
GTGCTGCACG GCGTCCACGA GTTCGAACCG GGCCAAACCC TGGACGCGAT CCTCGACCCC
GCCAATCTGT TCGTGTTCGA CGCGGCCGAC CGCCTCGTCG CCGCGCCGAA GCCGATGTGA
 
Protein sequence
MSVSLENVTR MIDGVPAIRD VSLTLERGTL SVLLGPTLSG KTSIMRLLAG LDKPTTGRVL 
VDGKDVTGFD VRKRSVAMVY QQFINYPSLT VYENIASPLR VQGKSRDEIE QRVQEAAKLL
KLEPYLKRTP LQLSGGQQQR TAIARALVKG ADLVLLDEPL ANLDYKLREE LRTELPRIFE
ASGAIFVYAT TEPSEALLLG GRTVCMWEGQ VLQTGPTPYV YRQPDTMRVA QVFSDPPLNI
VGAEKKAGTV HYAGGVTAPA TGVFAGLGDG AYRVGFRAHQ IEVARVNPDR HAFQATVAVT
EITGSESFVH LKRGDDNWVA VLHGVHEFEP GQTLDAILDP ANLFVFDAAD RLVAAPKPM