Gene Rxyl_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_1991 
Symbol 
ID4117579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp2010769 
End bp2012136 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content67% 
IMG OID638036778 
Productextracellular solute-binding protein 
Protein accessionYP_644750 
Protein GI108804813 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.603432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGGAGA GGAGAAGGGC GGAGAAGACC GGCGGGGTTC TCCGCCGCAG ACTAGGCCGG 
AGGGAGTTCC TCAGGCTCGG CGGGGCCTCG CTCGCGGGCG CGGCGCTGCT CGGGAGCGCG
GCCTGCGGCG GGGACGGGGG AGGCCCGCAG CGGGCGGAGG ACGGCAGCAT CCTATTCAAC
TTCTCCTTCG GCCCCGACCC CTCGGGGACC CTGCAGGAGC TGGTCAGGAG GTTCAACGAG
CGGTACAAGG GCGAGTACAA GGCCAACTGG CGGGAGATGC CGGCCCAGAC CGAGCAGTAC
TTCGACCGCC TCAGAACCCA GTTCCAGGCC GGCGGGGGGG ACATAGCCCT CATCGGGGGG
GATGTGATCT GGCCGGCCCA GTTCGCGGCG AACGGCTGGA TCGTGGACCT CTCCGACCGC
TTCCCCGAGT CCGAGAGAGA GAAGTTCCTC GACGGCCCCA TCCAGGCCAA CACCTACGAG
GGCAAGGTCT ACGGGGTCCC CTGGTTCACC GACGCGGGCA TGCTCTACTA CCGCAAAGAC
CTCCTGCAGA AGAGCGGCTT CTCCGAAGCC CCAAAGACCT GGGACGAGCT CAAGGAGATG
GCACTGCGTG TCAAGCAGGA CTCCGGGACC AGGGACGGCT TCGTCTTCCA GGGCGCCGAC
TACGAGGGGG GCGTCGTCGA CGGTCTCGAG TACATCTGGA CGCACGGGGG GGACGTGCTG
GACCCGGAGG ACCCCACGAA GGTCATCATA GACAGCCCCG AGTCGGTGGC GGGGCTGAAG
ACCGAGCGGA GCATGGTGGA GGAAGGGGTG GCGCCAGAGG CGGTGGTCAA CTACGCCGAG
ATGGAGTCGC ACACCGCCTT TCTGAACGGG GATGCCGTCT TCATGCGCAA CTGGCCCTAC
GTCTACGCCC TCTCCAGCGA CCCCAAGCAG TCCAAGATAA AGCCCGAGCA GATAGACATA
GCCCGGCTTC CCGCCGCCGA GGGGCAGGAG AGCGTGAGCG GGCTCGGGGG CTGGAACTTC
TACATCAACG CCGCCATGGA CGAGGAGACC CAGAACGCGG CCTGGGAGTT CATCCAGTTC
GCCACCGCCC CCGAGCAGCA GAAGTTCCGG GCGCTCGAGG GCTCCTTCCT CCCCACGCTG
AAGGAGCTCT ACGAGGACCA GGAGATCCTG GACAAGGTGC CGGTCATAGC GCTCGGCAAG
GAGGCCATCC TCAGCACCAG GCCGCGCCCG GTCTCGCCGT ACTACTCGGA CATGTCGCTC
AGGATGGCCG AGCAGTTCAA CGCCTCCCTC AAGGGCGAGG TCTCCCCCGA GCAGGCCATA
AAGACCCTGC AGGAGGAGCT GCAGAACATC GTGGAGCAGG GAAGCTAG
 
Protein sequence
MEERRRAEKT GGVLRRRLGR REFLRLGGAS LAGAALLGSA ACGGDGGGPQ RAEDGSILFN 
FSFGPDPSGT LQELVRRFNE RYKGEYKANW REMPAQTEQY FDRLRTQFQA GGGDIALIGG
DVIWPAQFAA NGWIVDLSDR FPESEREKFL DGPIQANTYE GKVYGVPWFT DAGMLYYRKD
LLQKSGFSEA PKTWDELKEM ALRVKQDSGT RDGFVFQGAD YEGGVVDGLE YIWTHGGDVL
DPEDPTKVII DSPESVAGLK TERSMVEEGV APEAVVNYAE MESHTAFLNG DAVFMRNWPY
VYALSSDPKQ SKIKPEQIDI ARLPAAEGQE SVSGLGGWNF YINAAMDEET QNAAWEFIQF
ATAPEQQKFR ALEGSFLPTL KELYEDQEIL DKVPVIALGK EAILSTRPRP VSPYYSDMSL
RMAEQFNASL KGEVSPEQAI KTLQEELQNI VEQGS