Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_3005 |
Symbol | |
ID | 4115804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | + |
Start bp | 3013191 |
End bp | 3014351 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 638037775 |
Product | extracellular solute-binding protein |
Protein accession | YP_645727 |
Protein GI | 108805790 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0509574 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGGCA CTCTGCGGGT CTTGCTGGTG GGCGGGCCGA TGTACGACCC GCTGTACGGG AGGATCGGGG AGTTCGAGGA GCGGACCGGG GTGCGGGTGG AGCGGGTCCT CTCCCGCGAC CACCCCGACC TCAACGCCCG CATCGAGCGG GAGTTCGGCT CCGGGGAGGC GGACTACGAT CTCGTCTCCA CCCACACCAA GTACGCGCCG GGGCAGCGCC GGTGGCTCAC CCCGCTGGAC GGCGACCTGG CGCCGGAGGA GCTTGCGCCG TTCGCGGAGA GGACGCTGGA GCTCGCCCGG ATAGGCGGCG AGCTCTACGG CCTCCCCCGC AACCTCGACG TCAAGCTCCT CCACTACCGG ACCGACCTGG TAGGGTCCCC GCCCGCGACC TGGGAGGAGC TGCTCGAGGT CGCTGCGCGG CTCAGGTCCG GGGGGCTCTA CGGCTTCGTC TTCCCCGGCA AGGAGAGCGG GCTCTTCGGC CACTTCTTCG AGCTGCACGC CATGTACGGC GGCAGGATGT TCCGGGGGGA GGGGCCGCCC GCGCCGCGCA TAAACGACGA GGCCGGGCGG CGGGCGCTCG GGCTCCTCGT CGAGCTCTAT CGGCGGGCCG CCCCGGAGGA GACCCCGGAC TGGCACTACG ACGAGGTGGC CGCCTGCTTC AGGGAGGGGC GGGCCGCCAT GAGCACCGAC TGGCCCGGAG GGTTCCACCT CTACGAGGGG GAGGGCAGCA GGGTCAGGGG CCGCTACGGG CTCGCCCTCT ACCCGGAGGG CCCGGCCGGG AGGTTCGTCT ACGCCGGCTG CCACTCCTTC GCGATTCCCC GCACCGTGCG GGACAGGGGA GCGGCGGTGG AGCTGCTGCG CTTTCTCGCC TCCCGAGAGT CGCAGGCCCA CGAGGCCCGC TTCGGAACCC TGCCGGCGCG GGAGGACGCC CTGGCGGAGG CGCGGGCGCA GGCCGGGCCC GGCTCGCTCG CCGCCCGGAG GTGGGAGCTG CTGGAGGCGG CGCGCGAGGC CGCCATCATC CCGCCCAAGC ACGAGAACTA CCCGGCGGTG GAGGAGGCCA TCTGGCGCGG CGTCCGGGAG GCGCTGCTGG GCCGAAGTGG CGTCGAGGAG GCGCTGGCCC GCACCGAGGA GGCCGCCCGG CGGGCGGCGG AGGGTTCGTG A
|
Protein sequence | MGGTLRVLLV GGPMYDPLYG RIGEFEERTG VRVERVLSRD HPDLNARIER EFGSGEADYD LVSTHTKYAP GQRRWLTPLD GDLAPEELAP FAERTLELAR IGGELYGLPR NLDVKLLHYR TDLVGSPPAT WEELLEVAAR LRSGGLYGFV FPGKESGLFG HFFELHAMYG GRMFRGEGPP APRINDEAGR RALGLLVELY RRAAPEETPD WHYDEVAACF REGRAAMSTD WPGGFHLYEG EGSRVRGRYG LALYPEGPAG RFVYAGCHSF AIPRTVRDRG AAVELLRFLA SRESQAHEAR FGTLPAREDA LAEARAQAGP GSLAARRWEL LEAAREAAII PPKHENYPAV EEAIWRGVRE ALLGRSGVEE ALARTEEAAR RAAEGS
|
| |