Gene Rpal_4889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4889 
Symbol 
ID6412575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5256234 
End bp5258024 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content63% 
IMG OID642714766 
Productputative periplasmic binding ABC transporter protein, putative sugar binding precursor 
Protein accessionYP_001993853 
Protein GI192293248 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGGAC GAAAGCCCCA CCTCAGCCGT TTGCGGCTGA TGACGATGGC GAGCGCCGCG 
GCGCTTGTCG CCGGGTCGAT GATGCTGGCG GCGCCGGCAT GGTCGGCCGA CGATGCTGTG
CTGAAGAAGT GGATCGACGA GGAATTCCAA CCCTCGACGC TGTCCAAGGA AGAACAGCTC
AAGGAATTGC AGTGGTTCGC CAAGGCGGCC GAGCCGTTCA AGGGCATGGA CATCAACGTC
GTCTCCGAGA CGATCACCAC GCACGAATAC GAAGCCAAGA CGCTCGCCAA GGCGTTCTCG
GAAATCACCG GCATCAAGCT GAAGCACGAT TTGATCCAGG AGGGCGACGT CGTCGAGAAG
CTGCAGACCC AGATGCAGTC CGGCAAGAAC GTCTATGACG GCTGGATCAA CGATAGCGAC
CTGATCGGTA CCCATTTCCG TTACGGCCAG ACCATCGCGC TGTCGGACTA CATGACCGGC
GAGGGCAAGG ACGTCACCGA TCCGATGCTC GACATCAACG ACTTCATCGG CAAGTCGTTC
ACCACCGCGC CCGACAAGAA GATGTATCAG CTGCCGGACC AGCAGTTCGC CAATCTGTAC
TGGTTCCGCT ACGACTGGTT CACCAATCCG GACTACAAGG CCAAGTTCAA GGCGAAGTAC
GGCTACGACC TCGGCGTCCC GGTGAACTGG TCGGCCTATG AGGACATCGC CGAGTTCTTC
ACCAACGACA TCAAGGAAAT CAACGGCGTC AAAGTCTATG GCCACATGGA CTACGGCAAG
AAGGATCCGT CGCTCGGCTG GCGCTTCACC GACGCCTGGC TGTCGATGGC CGGCAACGGC
GACAAGGGCC TGCCGAACGG TCTGCCGGTC GACGAATGGG GCATCCGCAT GGAAGGCTGC
CGTCCGGTCG GCTCGTCGAT CGAGCGCGGC GGCGACACCA ACGGTCCGGC CGCGGTGTAC
TCGATCGTCA AATATCTCGA CTGGATGAAG AAGTACGCCC CGCCGCAGGC CCAGGGCATG
ACGTTCTCGG AGTCGGGGCC GGTGCCGGCG CAGGGCAACG TCGCCCAGCA GATGTTCTGG
TACACCGCCT TCACCGCCGA CATGGTGAAG CCGGGCCTGC CGGTGATGAA CGCCGACGGC
ACGCCGAAGT GGCGGATGGC GCCGTCGCCG CACGGCGCGT ACTGGAAAGA AGGCATGAAG
CTCGGCTACC AGGACGTCGG CTCGGGCACG CTGCTGAAGT CGACCCCGCC GGATCGCCGC
AAGGCCGCCT GGCTGTATCT GCAGTTCATC ACCTCCAAGA CGGTGTCGCT GAAGAAGAGC
CATGTCGGTC TCACCTTCAT CCGTGAGAGC GATATCTGGG ACAAGTCGTT CACCGAACGT
GCGCCGAAGC TCGGCGGCCT GATCGAGTTC TATCGCTCGC CGGCCCGCGT GCAGTGGTCG
CCCACCGGCA ACAACATCCC GGACTATCCG AAGCTGGCGC AGCTGTGGTG GCAGAACATC
GGCGACGCGT CGTCCGGTGC GAAGACTCCG CAGGCCGCGA TGGACTCGCT GGCCGCGGCG
CAGGACTCGG TGCTGGAGCG CCTCGAAAAG TCGAAGGTGC AGGGCGATTG CGGTCCGAAG
CTGAACAAGA AGGAGACCGC CGAGTACTGG TACGCCAAGG CCGAGAAGGA CGGCAACATC
GCGCCGCAGC GCAAGCTGGC GAACGAGAAG CCGAAGGGTG AAACCGTCGA CTACGACACC
CTGATCAAGT CCTGGCCGGC GACCCCGCCG AAGCGCGCCG AAGCGAAGTA A
 
Protein sequence
MIGRKPHLSR LRLMTMASAA ALVAGSMMLA APAWSADDAV LKKWIDEEFQ PSTLSKEEQL 
KELQWFAKAA EPFKGMDINV VSETITTHEY EAKTLAKAFS EITGIKLKHD LIQEGDVVEK
LQTQMQSGKN VYDGWINDSD LIGTHFRYGQ TIALSDYMTG EGKDVTDPML DINDFIGKSF
TTAPDKKMYQ LPDQQFANLY WFRYDWFTNP DYKAKFKAKY GYDLGVPVNW SAYEDIAEFF
TNDIKEINGV KVYGHMDYGK KDPSLGWRFT DAWLSMAGNG DKGLPNGLPV DEWGIRMEGC
RPVGSSIERG GDTNGPAAVY SIVKYLDWMK KYAPPQAQGM TFSESGPVPA QGNVAQQMFW
YTAFTADMVK PGLPVMNADG TPKWRMAPSP HGAYWKEGMK LGYQDVGSGT LLKSTPPDRR
KAAWLYLQFI TSKTVSLKKS HVGLTFIRES DIWDKSFTER APKLGGLIEF YRSPARVQWS
PTGNNIPDYP KLAQLWWQNI GDASSGAKTP QAAMDSLAAA QDSVLERLEK SKVQGDCGPK
LNKKETAEYW YAKAEKDGNI APQRKLANEK PKGETVDYDT LIKSWPATPP KRAEAK