Gene RPD_0685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0685 
Symbol 
ID4021156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp768592 
End bp770280 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content70% 
IMG OID637960873 
Productprotein of unknown function DUF894, DitE 
Protein accessionYP_567824 
Protein GI91975165 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.667795 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGGG GAGCGAAGCG AGGCGTGTTC TCCGCCGGCG GCGTCGCGGC GCCGCTGCGC 
TACGCGCTGT TCCGGCGGAT CTGGCTGGCG AGCCTTCTGT CCAATCTCGG CCTGATGATC
AACGGCGTCG GCGCCGCCTG GGCGATGACG CAGATGGCGT CGTCCGCCGA CAAGGTCGCG
CTGGTGCAGA CCGCCTTGAT GCTGCCGATC ATGCTGGTGG CGATGCCGGC CGGCGCGATC
GCCGATATGT ATGATCGCCG AATCGTGGCG CTGGCGTCGC TGACGCTCGG CCTCGGCGGC
TCGACGGTGC TGGCGGTGCT GGCGCATCTC GGCCTGGTGA CTCCGGAGAT CCTGCTCGCC
TTCTGCTTCG TCATCGGCAC CGGCATGGCG CTGTTCGGCC CCGCCTGGCA GGCCTCGGTC
AGCGAACAGG TGCCGGGCGA GGCGCTGCCG GCGGCGGTGG CGCTGAACGG CATCAGCTAC
AACATCGCCC GCAGCTTCGG CCCCGCGGTC GGCGGCATCG TGGTGGCGAC GGCCGGCGCG
GTTGCGGCGT TCGCGGCCAA TGCGGCGCTG TATCTCCCGC TGCTGATGGT GCTGTTCCTG
TGGCGGCGCG TCAGCGAGCC GCCTCGGTTG CCGCCGGAAC GGATGAATCG CGCGATCGTC
TCCGGCGTGC GCTACATCGC CAACTCACCC TCGATCCGGA TCGTGCTGAC GCGGACGCTG
GTCACCGGGA TCGCCGGCAG CTCGGTGCTG GCGCTGATGC CGCTGGTGGC GCGGGACCTG
CTGAAGAGCG GCGCAGAGAC CTACGGCATT CTGCTCGGCG CGTTCGGCGT CGGCGCGGTG
ATCGGCGCGC TCAATGTCGG GCTGGCACGC GAACGGCTGA GCAGCGAAGC CGCGGTGCGC
TCCTGCGCGA TCATCATGGG CCTGGCGATG GCGGCGGTCG CGCTGAGCCG CTCGTCGCTG
ATCAGCGCCG CGGCGCTGGT GGTCGCGGGC GCGGTGTGGA TGCTGGCGAT CGCGCTGTTC
AATATCGGCG TGCAGCTATC CGCGCCGCGC TGGGTGGCGG GCCGCTCGCT GGCGGCGTTT
CAGGCGTCGA TCGCCGGCGG CATCGCAATC GGAAGCTGGG TCTGGGGCCA TGTCGCCGAT
CTGGCGGGCG TTGCGCCGTC GATGCTGATC TCGGCCGGGG TGATGTTGGT CTCGCCGCTG
GTCGGACTGC TTCTGCGAAT GCCGTCGGTC GGCACCCAGA CCGAGGATGC CGAACTCCTC
GCCGACCCGG AAGTGCGGCT GGCCTTGACG CCGCGCAGCG GACCGGTGGT GATCGAGATC
GATTACCGCG TCGATCAGGA CGACGCCCGC GCGTTTCACG GCGTGATGCA GCAGGTCCAG
CTCAGCCGCC AGCGTAACGG CGCCTATGGC TGGTCGATCG CCCGCGACAT CGCCGACCCG
GAACTGTGGA CCGAGCGCTA TCACTGCCCG ACCTGGCTGG ACTATCTGCG ACAGCGAAGC
CGTTCGACTC AGCACGATCG CGCCATGCAC CAGCGCGCGA TGGCGTTTCA CCGCGGGCCG
GCCCCGATCC GGGTGCGCCG GATGCTGGAG CGGCCGTTCG GATCGGTGCG CTGGAAGGAC
GAGTCGCCCG ATCGCCCCAC CGGGACCGAA GTGCTGCCGG TCGCAGGCGT CAGCGGCGGT
TCGACCTGA
 
Protein sequence
MTGGAKRGVF SAGGVAAPLR YALFRRIWLA SLLSNLGLMI NGVGAAWAMT QMASSADKVA 
LVQTALMLPI MLVAMPAGAI ADMYDRRIVA LASLTLGLGG STVLAVLAHL GLVTPEILLA
FCFVIGTGMA LFGPAWQASV SEQVPGEALP AAVALNGISY NIARSFGPAV GGIVVATAGA
VAAFAANAAL YLPLLMVLFL WRRVSEPPRL PPERMNRAIV SGVRYIANSP SIRIVLTRTL
VTGIAGSSVL ALMPLVARDL LKSGAETYGI LLGAFGVGAV IGALNVGLAR ERLSSEAAVR
SCAIIMGLAM AAVALSRSSL ISAAALVVAG AVWMLAIALF NIGVQLSAPR WVAGRSLAAF
QASIAGGIAI GSWVWGHVAD LAGVAPSMLI SAGVMLVSPL VGLLLRMPSV GTQTEDAELL
ADPEVRLALT PRSGPVVIEI DYRVDQDDAR AFHGVMQQVQ LSRQRNGAYG WSIARDIADP
ELWTERYHCP TWLDYLRQRS RSTQHDRAMH QRAMAFHRGP APIRVRRMLE RPFGSVRWKD
ESPDRPTGTE VLPVAGVSGG ST