Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0685 |
Symbol | |
ID | 4021156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 768592 |
End bp | 770280 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637960873 |
Product | protein of unknown function DUF894, DitE |
Protein accession | YP_567824 |
Protein GI | 91975165 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.667795 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGGG GAGCGAAGCG AGGCGTGTTC TCCGCCGGCG GCGTCGCGGC GCCGCTGCGC TACGCGCTGT TCCGGCGGAT CTGGCTGGCG AGCCTTCTGT CCAATCTCGG CCTGATGATC AACGGCGTCG GCGCCGCCTG GGCGATGACG CAGATGGCGT CGTCCGCCGA CAAGGTCGCG CTGGTGCAGA CCGCCTTGAT GCTGCCGATC ATGCTGGTGG CGATGCCGGC CGGCGCGATC GCCGATATGT ATGATCGCCG AATCGTGGCG CTGGCGTCGC TGACGCTCGG CCTCGGCGGC TCGACGGTGC TGGCGGTGCT GGCGCATCTC GGCCTGGTGA CTCCGGAGAT CCTGCTCGCC TTCTGCTTCG TCATCGGCAC CGGCATGGCG CTGTTCGGCC CCGCCTGGCA GGCCTCGGTC AGCGAACAGG TGCCGGGCGA GGCGCTGCCG GCGGCGGTGG CGCTGAACGG CATCAGCTAC AACATCGCCC GCAGCTTCGG CCCCGCGGTC GGCGGCATCG TGGTGGCGAC GGCCGGCGCG GTTGCGGCGT TCGCGGCCAA TGCGGCGCTG TATCTCCCGC TGCTGATGGT GCTGTTCCTG TGGCGGCGCG TCAGCGAGCC GCCTCGGTTG CCGCCGGAAC GGATGAATCG CGCGATCGTC TCCGGCGTGC GCTACATCGC CAACTCACCC TCGATCCGGA TCGTGCTGAC GCGGACGCTG GTCACCGGGA TCGCCGGCAG CTCGGTGCTG GCGCTGATGC CGCTGGTGGC GCGGGACCTG CTGAAGAGCG GCGCAGAGAC CTACGGCATT CTGCTCGGCG CGTTCGGCGT CGGCGCGGTG ATCGGCGCGC TCAATGTCGG GCTGGCACGC GAACGGCTGA GCAGCGAAGC CGCGGTGCGC TCCTGCGCGA TCATCATGGG CCTGGCGATG GCGGCGGTCG CGCTGAGCCG CTCGTCGCTG ATCAGCGCCG CGGCGCTGGT GGTCGCGGGC GCGGTGTGGA TGCTGGCGAT CGCGCTGTTC AATATCGGCG TGCAGCTATC CGCGCCGCGC TGGGTGGCGG GCCGCTCGCT GGCGGCGTTT CAGGCGTCGA TCGCCGGCGG CATCGCAATC GGAAGCTGGG TCTGGGGCCA TGTCGCCGAT CTGGCGGGCG TTGCGCCGTC GATGCTGATC TCGGCCGGGG TGATGTTGGT CTCGCCGCTG GTCGGACTGC TTCTGCGAAT GCCGTCGGTC GGCACCCAGA CCGAGGATGC CGAACTCCTC GCCGACCCGG AAGTGCGGCT GGCCTTGACG CCGCGCAGCG GACCGGTGGT GATCGAGATC GATTACCGCG TCGATCAGGA CGACGCCCGC GCGTTTCACG GCGTGATGCA GCAGGTCCAG CTCAGCCGCC AGCGTAACGG CGCCTATGGC TGGTCGATCG CCCGCGACAT CGCCGACCCG GAACTGTGGA CCGAGCGCTA TCACTGCCCG ACCTGGCTGG ACTATCTGCG ACAGCGAAGC CGTTCGACTC AGCACGATCG CGCCATGCAC CAGCGCGCGA TGGCGTTTCA CCGCGGGCCG GCCCCGATCC GGGTGCGCCG GATGCTGGAG CGGCCGTTCG GATCGGTGCG CTGGAAGGAC GAGTCGCCCG ATCGCCCCAC CGGGACCGAA GTGCTGCCGG TCGCAGGCGT CAGCGGCGGT TCGACCTGA
|
Protein sequence | MTGGAKRGVF SAGGVAAPLR YALFRRIWLA SLLSNLGLMI NGVGAAWAMT QMASSADKVA LVQTALMLPI MLVAMPAGAI ADMYDRRIVA LASLTLGLGG STVLAVLAHL GLVTPEILLA FCFVIGTGMA LFGPAWQASV SEQVPGEALP AAVALNGISY NIARSFGPAV GGIVVATAGA VAAFAANAAL YLPLLMVLFL WRRVSEPPRL PPERMNRAIV SGVRYIANSP SIRIVLTRTL VTGIAGSSVL ALMPLVARDL LKSGAETYGI LLGAFGVGAV IGALNVGLAR ERLSSEAAVR SCAIIMGLAM AAVALSRSSL ISAAALVVAG AVWMLAIALF NIGVQLSAPR WVAGRSLAAF QASIAGGIAI GSWVWGHVAD LAGVAPSMLI SAGVMLVSPL VGLLLRMPSV GTQTEDAELL ADPEVRLALT PRSGPVVIEI DYRVDQDDAR AFHGVMQQVQ LSRQRNGAYG WSIARDIADP ELWTERYHCP TWLDYLRQRS RSTQHDRAMH QRAMAFHRGP APIRVRRMLE RPFGSVRWKD ESPDRPTGTE VLPVAGVSGG ST
|
| |