Gene RPD_3322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3322 
Symbol 
ID4023832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3680943 
End bp3682268 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content63% 
IMG OID637963526 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_570447 
Protein GI91977788 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACT TCACATTCGA TCGCCGCTCG TTGCTGAAGG GTGGCGCGCT GACTCTGGCC 
GCGGCGGCGA CCATGTCCGC GGATCAATTG CTGGGTTATG CGAAGGCCTG GGCGCAGACC
TCGCCGTGGA AGCCGGAAGC CGGCGCCAAG ATCAATCTGC TGCGCTGGAA GCGCTTCGTC
GAGGCCGAAG ACGTCGCCTT CATGAAGATC GTCGATGCGT TCCAGAAGGC CAACAACGTC
ACCATCAACG TTTCCAACGA ATCCTACGAC GACATCCAGC CGAAGGCTTC GGTCGCCGCC
AATACCGGGC AGGGTCTCGA TATGGTGTGG GGCCTATATT CGCTGCCGTT CCTGTTCCCG
AGCAAGTGCA CCGACGTCTC CGACGTCGCC GACCACCTCG CCAAAAAGTG CGGCGGCTGG
ACCGATTCCG GCAAGGCCTA TGGCATGCAC AACGGCAAGT GGATCGGCAT TCCGGTCGCG
GCGACCGGCG GCCTCGTCAA CTACCGCATC AGCGCGGCGG AGAAAGCGGG CCACAAGGAG
TTTCCGAAGG ACCTCGCCGG CTTCTCGGAT CTGATCAAGG GCCTGAACAA GAACGGCACG
CCGGCCGGAA TGGCGCTCGG CCACGCCTCG GGCGACGCCA ACAGCTGGCT GCACTGGGCG
CTGTGGGCGC ATGGCGGAAG GCTGATCGAC AAGGACAACA AGGTCGTCGT CAATTCACCC
GAGACCGCAA AGGCGCTGGA GTACACCAAG GGTCTGTACG ACAGTTTTAT TCCCGGCACG
GCGTCGTGGA ACGACGCGTC CAACAACAAG GCGTTTCTGG CCGGCCAGCT CTATCTCACC
GTCAACGGCA TCTCGATTTA CGTGACGGCC AAGAAGGACA ACAAGGAGAT GGCGGCGGAC
ATCAACCACG CGCATCTGCC CGCCGGCGTC AGCGGCAAGA CCCGCGAAAT GCATCTCGGC
TTTCCGATCC TGATCTACAA CTTCACCAAG TTCCCGAACA CCTGCAAGGC GTTCACCGCC
TTCATGATGG AGCCGGAGCA GTTCAACCCG TGGGTCGAGG CGGCGCAGGG CTATCTGTCG
CCGTTCCTGC TCGACTACGA GAAGAATCCG ATGTGGACGG CGGACCCGAA GAACACCCCA
TATCGCGACG TCGGACGCAC GGCGTCGACG CCGGCCGGCG ACGGTCAGAT GGGCGAGAAC
GCCGCCGCCG CGATCGCCGA CTTCGTCATC GTGGATATGT TTGCGAACTA CTGCACCGGT
CGCGAGGACG TGAAGACCGC GATGAGCAGC GCCGAACGCG CGGCGAAGCG GATCTTCCGG
GCGTGA
 
Protein sequence
MTDFTFDRRS LLKGGALTLA AAATMSADQL LGYAKAWAQT SPWKPEAGAK INLLRWKRFV 
EAEDVAFMKI VDAFQKANNV TINVSNESYD DIQPKASVAA NTGQGLDMVW GLYSLPFLFP
SKCTDVSDVA DHLAKKCGGW TDSGKAYGMH NGKWIGIPVA ATGGLVNYRI SAAEKAGHKE
FPKDLAGFSD LIKGLNKNGT PAGMALGHAS GDANSWLHWA LWAHGGRLID KDNKVVVNSP
ETAKALEYTK GLYDSFIPGT ASWNDASNNK AFLAGQLYLT VNGISIYVTA KKDNKEMAAD
INHAHLPAGV SGKTREMHLG FPILIYNFTK FPNTCKAFTA FMMEPEQFNP WVEAAQGYLS
PFLLDYEKNP MWTADPKNTP YRDVGRTAST PAGDGQMGEN AAAAIADFVI VDMFANYCTG
REDVKTAMSS AERAAKRIFR A