Gene Rxyl_3005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_3005 
Symbol 
ID4115804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp3013191 
End bp3014351 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content74% 
IMG OID638037775 
Productextracellular solute-binding protein 
Protein accessionYP_645727 
Protein GI108805790 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0509574 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGGCA CTCTGCGGGT CTTGCTGGTG GGCGGGCCGA TGTACGACCC GCTGTACGGG 
AGGATCGGGG AGTTCGAGGA GCGGACCGGG GTGCGGGTGG AGCGGGTCCT CTCCCGCGAC
CACCCCGACC TCAACGCCCG CATCGAGCGG GAGTTCGGCT CCGGGGAGGC GGACTACGAT
CTCGTCTCCA CCCACACCAA GTACGCGCCG GGGCAGCGCC GGTGGCTCAC CCCGCTGGAC
GGCGACCTGG CGCCGGAGGA GCTTGCGCCG TTCGCGGAGA GGACGCTGGA GCTCGCCCGG
ATAGGCGGCG AGCTCTACGG CCTCCCCCGC AACCTCGACG TCAAGCTCCT CCACTACCGG
ACCGACCTGG TAGGGTCCCC GCCCGCGACC TGGGAGGAGC TGCTCGAGGT CGCTGCGCGG
CTCAGGTCCG GGGGGCTCTA CGGCTTCGTC TTCCCCGGCA AGGAGAGCGG GCTCTTCGGC
CACTTCTTCG AGCTGCACGC CATGTACGGC GGCAGGATGT TCCGGGGGGA GGGGCCGCCC
GCGCCGCGCA TAAACGACGA GGCCGGGCGG CGGGCGCTCG GGCTCCTCGT CGAGCTCTAT
CGGCGGGCCG CCCCGGAGGA GACCCCGGAC TGGCACTACG ACGAGGTGGC CGCCTGCTTC
AGGGAGGGGC GGGCCGCCAT GAGCACCGAC TGGCCCGGAG GGTTCCACCT CTACGAGGGG
GAGGGCAGCA GGGTCAGGGG CCGCTACGGG CTCGCCCTCT ACCCGGAGGG CCCGGCCGGG
AGGTTCGTCT ACGCCGGCTG CCACTCCTTC GCGATTCCCC GCACCGTGCG GGACAGGGGA
GCGGCGGTGG AGCTGCTGCG CTTTCTCGCC TCCCGAGAGT CGCAGGCCCA CGAGGCCCGC
TTCGGAACCC TGCCGGCGCG GGAGGACGCC CTGGCGGAGG CGCGGGCGCA GGCCGGGCCC
GGCTCGCTCG CCGCCCGGAG GTGGGAGCTG CTGGAGGCGG CGCGCGAGGC CGCCATCATC
CCGCCCAAGC ACGAGAACTA CCCGGCGGTG GAGGAGGCCA TCTGGCGCGG CGTCCGGGAG
GCGCTGCTGG GCCGAAGTGG CGTCGAGGAG GCGCTGGCCC GCACCGAGGA GGCCGCCCGG
CGGGCGGCGG AGGGTTCGTG A
 
Protein sequence
MGGTLRVLLV GGPMYDPLYG RIGEFEERTG VRVERVLSRD HPDLNARIER EFGSGEADYD 
LVSTHTKYAP GQRRWLTPLD GDLAPEELAP FAERTLELAR IGGELYGLPR NLDVKLLHYR
TDLVGSPPAT WEELLEVAAR LRSGGLYGFV FPGKESGLFG HFFELHAMYG GRMFRGEGPP
APRINDEAGR RALGLLVELY RRAAPEETPD WHYDEVAACF REGRAAMSTD WPGGFHLYEG
EGSRVRGRYG LALYPEGPAG RFVYAGCHSF AIPRTVRDRG AAVELLRFLA SRESQAHEAR
FGTLPAREDA LAEARAQAGP GSLAARRWEL LEAAREAAII PPKHENYPAV EEAIWRGVRE
ALLGRSGVEE ALARTEEAAR RAAEGS