Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4889 |
Symbol | |
ID | 6412575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5256234 |
End bp | 5258024 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642714766 |
Product | putative periplasmic binding ABC transporter protein, putative sugar binding precursor |
Protein accession | YP_001993853 |
Protein GI | 192293248 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGGAC GAAAGCCCCA CCTCAGCCGT TTGCGGCTGA TGACGATGGC GAGCGCCGCG GCGCTTGTCG CCGGGTCGAT GATGCTGGCG GCGCCGGCAT GGTCGGCCGA CGATGCTGTG CTGAAGAAGT GGATCGACGA GGAATTCCAA CCCTCGACGC TGTCCAAGGA AGAACAGCTC AAGGAATTGC AGTGGTTCGC CAAGGCGGCC GAGCCGTTCA AGGGCATGGA CATCAACGTC GTCTCCGAGA CGATCACCAC GCACGAATAC GAAGCCAAGA CGCTCGCCAA GGCGTTCTCG GAAATCACCG GCATCAAGCT GAAGCACGAT TTGATCCAGG AGGGCGACGT CGTCGAGAAG CTGCAGACCC AGATGCAGTC CGGCAAGAAC GTCTATGACG GCTGGATCAA CGATAGCGAC CTGATCGGTA CCCATTTCCG TTACGGCCAG ACCATCGCGC TGTCGGACTA CATGACCGGC GAGGGCAAGG ACGTCACCGA TCCGATGCTC GACATCAACG ACTTCATCGG CAAGTCGTTC ACCACCGCGC CCGACAAGAA GATGTATCAG CTGCCGGACC AGCAGTTCGC CAATCTGTAC TGGTTCCGCT ACGACTGGTT CACCAATCCG GACTACAAGG CCAAGTTCAA GGCGAAGTAC GGCTACGACC TCGGCGTCCC GGTGAACTGG TCGGCCTATG AGGACATCGC CGAGTTCTTC ACCAACGACA TCAAGGAAAT CAACGGCGTC AAAGTCTATG GCCACATGGA CTACGGCAAG AAGGATCCGT CGCTCGGCTG GCGCTTCACC GACGCCTGGC TGTCGATGGC CGGCAACGGC GACAAGGGCC TGCCGAACGG TCTGCCGGTC GACGAATGGG GCATCCGCAT GGAAGGCTGC CGTCCGGTCG GCTCGTCGAT CGAGCGCGGC GGCGACACCA ACGGTCCGGC CGCGGTGTAC TCGATCGTCA AATATCTCGA CTGGATGAAG AAGTACGCCC CGCCGCAGGC CCAGGGCATG ACGTTCTCGG AGTCGGGGCC GGTGCCGGCG CAGGGCAACG TCGCCCAGCA GATGTTCTGG TACACCGCCT TCACCGCCGA CATGGTGAAG CCGGGCCTGC CGGTGATGAA CGCCGACGGC ACGCCGAAGT GGCGGATGGC GCCGTCGCCG CACGGCGCGT ACTGGAAAGA AGGCATGAAG CTCGGCTACC AGGACGTCGG CTCGGGCACG CTGCTGAAGT CGACCCCGCC GGATCGCCGC AAGGCCGCCT GGCTGTATCT GCAGTTCATC ACCTCCAAGA CGGTGTCGCT GAAGAAGAGC CATGTCGGTC TCACCTTCAT CCGTGAGAGC GATATCTGGG ACAAGTCGTT CACCGAACGT GCGCCGAAGC TCGGCGGCCT GATCGAGTTC TATCGCTCGC CGGCCCGCGT GCAGTGGTCG CCCACCGGCA ACAACATCCC GGACTATCCG AAGCTGGCGC AGCTGTGGTG GCAGAACATC GGCGACGCGT CGTCCGGTGC GAAGACTCCG CAGGCCGCGA TGGACTCGCT GGCCGCGGCG CAGGACTCGG TGCTGGAGCG CCTCGAAAAG TCGAAGGTGC AGGGCGATTG CGGTCCGAAG CTGAACAAGA AGGAGACCGC CGAGTACTGG TACGCCAAGG CCGAGAAGGA CGGCAACATC GCGCCGCAGC GCAAGCTGGC GAACGAGAAG CCGAAGGGTG AAACCGTCGA CTACGACACC CTGATCAAGT CCTGGCCGGC GACCCCGCCG AAGCGCGCCG AAGCGAAGTA A
|
Protein sequence | MIGRKPHLSR LRLMTMASAA ALVAGSMMLA APAWSADDAV LKKWIDEEFQ PSTLSKEEQL KELQWFAKAA EPFKGMDINV VSETITTHEY EAKTLAKAFS EITGIKLKHD LIQEGDVVEK LQTQMQSGKN VYDGWINDSD LIGTHFRYGQ TIALSDYMTG EGKDVTDPML DINDFIGKSF TTAPDKKMYQ LPDQQFANLY WFRYDWFTNP DYKAKFKAKY GYDLGVPVNW SAYEDIAEFF TNDIKEINGV KVYGHMDYGK KDPSLGWRFT DAWLSMAGNG DKGLPNGLPV DEWGIRMEGC RPVGSSIERG GDTNGPAAVY SIVKYLDWMK KYAPPQAQGM TFSESGPVPA QGNVAQQMFW YTAFTADMVK PGLPVMNADG TPKWRMAPSP HGAYWKEGMK LGYQDVGSGT LLKSTPPDRR KAAWLYLQFI TSKTVSLKKS HVGLTFIRES DIWDKSFTER APKLGGLIEF YRSPARVQWS PTGNNIPDYP KLAQLWWQNI GDASSGAKTP QAAMDSLAAA QDSVLERLEK SKVQGDCGPK LNKKETAEYW YAKAEKDGNI APQRKLANEK PKGETVDYDT LIKSWPATPP KRAEAK
|
| |