Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_1991 |
Symbol | |
ID | 4117579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 2010769 |
End bp | 2012136 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638036778 |
Product | extracellular solute-binding protein |
Protein accession | YP_644750 |
Protein GI | 108804813 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.603432 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGGAGA GGAGAAGGGC GGAGAAGACC GGCGGGGTTC TCCGCCGCAG ACTAGGCCGG AGGGAGTTCC TCAGGCTCGG CGGGGCCTCG CTCGCGGGCG CGGCGCTGCT CGGGAGCGCG GCCTGCGGCG GGGACGGGGG AGGCCCGCAG CGGGCGGAGG ACGGCAGCAT CCTATTCAAC TTCTCCTTCG GCCCCGACCC CTCGGGGACC CTGCAGGAGC TGGTCAGGAG GTTCAACGAG CGGTACAAGG GCGAGTACAA GGCCAACTGG CGGGAGATGC CGGCCCAGAC CGAGCAGTAC TTCGACCGCC TCAGAACCCA GTTCCAGGCC GGCGGGGGGG ACATAGCCCT CATCGGGGGG GATGTGATCT GGCCGGCCCA GTTCGCGGCG AACGGCTGGA TCGTGGACCT CTCCGACCGC TTCCCCGAGT CCGAGAGAGA GAAGTTCCTC GACGGCCCCA TCCAGGCCAA CACCTACGAG GGCAAGGTCT ACGGGGTCCC CTGGTTCACC GACGCGGGCA TGCTCTACTA CCGCAAAGAC CTCCTGCAGA AGAGCGGCTT CTCCGAAGCC CCAAAGACCT GGGACGAGCT CAAGGAGATG GCACTGCGTG TCAAGCAGGA CTCCGGGACC AGGGACGGCT TCGTCTTCCA GGGCGCCGAC TACGAGGGGG GCGTCGTCGA CGGTCTCGAG TACATCTGGA CGCACGGGGG GGACGTGCTG GACCCGGAGG ACCCCACGAA GGTCATCATA GACAGCCCCG AGTCGGTGGC GGGGCTGAAG ACCGAGCGGA GCATGGTGGA GGAAGGGGTG GCGCCAGAGG CGGTGGTCAA CTACGCCGAG ATGGAGTCGC ACACCGCCTT TCTGAACGGG GATGCCGTCT TCATGCGCAA CTGGCCCTAC GTCTACGCCC TCTCCAGCGA CCCCAAGCAG TCCAAGATAA AGCCCGAGCA GATAGACATA GCCCGGCTTC CCGCCGCCGA GGGGCAGGAG AGCGTGAGCG GGCTCGGGGG CTGGAACTTC TACATCAACG CCGCCATGGA CGAGGAGACC CAGAACGCGG CCTGGGAGTT CATCCAGTTC GCCACCGCCC CCGAGCAGCA GAAGTTCCGG GCGCTCGAGG GCTCCTTCCT CCCCACGCTG AAGGAGCTCT ACGAGGACCA GGAGATCCTG GACAAGGTGC CGGTCATAGC GCTCGGCAAG GAGGCCATCC TCAGCACCAG GCCGCGCCCG GTCTCGCCGT ACTACTCGGA CATGTCGCTC AGGATGGCCG AGCAGTTCAA CGCCTCCCTC AAGGGCGAGG TCTCCCCCGA GCAGGCCATA AAGACCCTGC AGGAGGAGCT GCAGAACATC GTGGAGCAGG GAAGCTAG
|
Protein sequence | MEERRRAEKT GGVLRRRLGR REFLRLGGAS LAGAALLGSA ACGGDGGGPQ RAEDGSILFN FSFGPDPSGT LQELVRRFNE RYKGEYKANW REMPAQTEQY FDRLRTQFQA GGGDIALIGG DVIWPAQFAA NGWIVDLSDR FPESEREKFL DGPIQANTYE GKVYGVPWFT DAGMLYYRKD LLQKSGFSEA PKTWDELKEM ALRVKQDSGT RDGFVFQGAD YEGGVVDGLE YIWTHGGDVL DPEDPTKVII DSPESVAGLK TERSMVEEGV APEAVVNYAE MESHTAFLNG DAVFMRNWPY VYALSSDPKQ SKIKPEQIDI ARLPAAEGQE SVSGLGGWNF YINAAMDEET QNAAWEFIQF ATAPEQQKFR ALEGSFLPTL KELYEDQEIL DKVPVIALGK EAILSTRPRP VSPYYSDMSL RMAEQFNASL KGEVSPEQAI KTLQEELQNI VEQGS
|
| |