Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0062 |
Symbol | |
ID | 4117906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 64030 |
End bp | 65340 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638034856 |
Product | extracellular solute-binding protein |
Protein accession | YP_642855 |
Protein GI | 108802918 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGAGC GGAGGATCAG CCGTAGGGGG TTTCTCGGGC TCGGCGGCGG CGCGCTGGCC GGGGCCGTGC TGCTGGGGGG GTGTGGTGGC GGAGGAGGAG GCAGCGGGGA GGTGCTCTTC TCCTGGGGGC CGGATGACAC GGGGGTTCTG CCGAGGCTCA TAGAGCGGTT CAACCGGGAG AACGGCTCCG GGATCACCGT CCGCTACCGG GAGATGCCCT CCGACACCGG CCAGTACTTC GACCAGCTCA GGACCCAGTT CCAGGCCGGC GGGGGGGACA TCGACGTCAT CGGGGGGGAT GTGATCTGGC CGGCCCAGTT CGCGGCGAAC GGCTGGATCG TGGACCTCTC CGACCGGTTC GAGGACGCCG GCCGGTTCCT CGAAGGGCCC ATGCAGGCGA TGACCTACGA GGGCAAGGTC TACGGCGTCC CCTGGTACAC CGACGCCGGG CTGCTCTACT ACCGCAAAGA CCTCCTGGAG AAGAGCGGGT ACTCGGAGCC GCCCAGAACC TGGGACGAGC TCAGGGAGAT GGCCCTGCGC GTCAAGCAGG ACTCCGGGGT TCCCGCGGGC TTCGTCTTCC AGGGCGCCGA GTACGAGGGT GGGGTGTGCG ACGGGCTCGA GTACATCTGG ACGCACGGGG GGGACGTGCT GGACCCGGAG GACCCCACGA AGGTCCTCAT AGACAGCCCC GAGTCGGTGG CGGGGCTGAA GACCGAGCGG AGCATGGTGG AGGAAGGGGT GGCGCCCGAG GCCGTCACCA CCTACAAGGA GGACGAGTCG CACGGGGCCT TTCTCAGGGG CGACGCCGTC TTTCTGCGCA ACTGGCCCTA CGTCTACGCC CTCGTGGGCG ACCCCGAGCA GTCCCGGATA GAGCCGGGCC AGGTTGGGAT CTCCGAGCTC CCCGTGGGCG GCGAGGGGCA GCAGAGCTAC AGCTGCCTCG GGGGCTGGAA CTTCTTCATC AACGCCTCCT CGGGGCGGCA GGAGGAGGCC TGGGAGTTCA TCCGGTGGAT GACGGAGCCC GAGCAGCTCA AGGTCAACGC CCTGCAGGGC TCCCGGCTCC CGACGCGGCG CGGCCTCTAC GAGGACCGGG AGGTGCTGGA GAAGGTCCCG GTCGCCAGGC TCGGCAAGGA GGCCATCATC CAGAACTCCC GCCCGCGCCC GGTCTCGCCG TACTACTCGG ACATGTCGCT CAGAATGGCC GAGCAGTTCA GCGCCTCCCT CAAGGGCGAG GTCTCCCCCG AGCAGGCCGT AAAGACCCTG CAGGGCGAGC TGCAGCGCCT CATCGAGGAG GGCGAGGCGG CCACCGGCTA G
|
Protein sequence | MGERRISRRG FLGLGGGALA GAVLLGGCGG GGGGSGEVLF SWGPDDTGVL PRLIERFNRE NGSGITVRYR EMPSDTGQYF DQLRTQFQAG GGDIDVIGGD VIWPAQFAAN GWIVDLSDRF EDAGRFLEGP MQAMTYEGKV YGVPWYTDAG LLYYRKDLLE KSGYSEPPRT WDELREMALR VKQDSGVPAG FVFQGAEYEG GVCDGLEYIW THGGDVLDPE DPTKVLIDSP ESVAGLKTER SMVEEGVAPE AVTTYKEDES HGAFLRGDAV FLRNWPYVYA LVGDPEQSRI EPGQVGISEL PVGGEGQQSY SCLGGWNFFI NASSGRQEEA WEFIRWMTEP EQLKVNALQG SRLPTRRGLY EDREVLEKVP VARLGKEAII QNSRPRPVSP YYSDMSLRMA EQFSASLKGE VSPEQAVKTL QGELQRLIEE GEAATG
|
| |