Gene Rxyl_0062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0062 
Symbol 
ID4117906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp64030 
End bp65340 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content69% 
IMG OID638034856 
Productextracellular solute-binding protein 
Protein accessionYP_642855 
Protein GI108802918 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAGC GGAGGATCAG CCGTAGGGGG TTTCTCGGGC TCGGCGGCGG CGCGCTGGCC 
GGGGCCGTGC TGCTGGGGGG GTGTGGTGGC GGAGGAGGAG GCAGCGGGGA GGTGCTCTTC
TCCTGGGGGC CGGATGACAC GGGGGTTCTG CCGAGGCTCA TAGAGCGGTT CAACCGGGAG
AACGGCTCCG GGATCACCGT CCGCTACCGG GAGATGCCCT CCGACACCGG CCAGTACTTC
GACCAGCTCA GGACCCAGTT CCAGGCCGGC GGGGGGGACA TCGACGTCAT CGGGGGGGAT
GTGATCTGGC CGGCCCAGTT CGCGGCGAAC GGCTGGATCG TGGACCTCTC CGACCGGTTC
GAGGACGCCG GCCGGTTCCT CGAAGGGCCC ATGCAGGCGA TGACCTACGA GGGCAAGGTC
TACGGCGTCC CCTGGTACAC CGACGCCGGG CTGCTCTACT ACCGCAAAGA CCTCCTGGAG
AAGAGCGGGT ACTCGGAGCC GCCCAGAACC TGGGACGAGC TCAGGGAGAT GGCCCTGCGC
GTCAAGCAGG ACTCCGGGGT TCCCGCGGGC TTCGTCTTCC AGGGCGCCGA GTACGAGGGT
GGGGTGTGCG ACGGGCTCGA GTACATCTGG ACGCACGGGG GGGACGTGCT GGACCCGGAG
GACCCCACGA AGGTCCTCAT AGACAGCCCC GAGTCGGTGG CGGGGCTGAA GACCGAGCGG
AGCATGGTGG AGGAAGGGGT GGCGCCCGAG GCCGTCACCA CCTACAAGGA GGACGAGTCG
CACGGGGCCT TTCTCAGGGG CGACGCCGTC TTTCTGCGCA ACTGGCCCTA CGTCTACGCC
CTCGTGGGCG ACCCCGAGCA GTCCCGGATA GAGCCGGGCC AGGTTGGGAT CTCCGAGCTC
CCCGTGGGCG GCGAGGGGCA GCAGAGCTAC AGCTGCCTCG GGGGCTGGAA CTTCTTCATC
AACGCCTCCT CGGGGCGGCA GGAGGAGGCC TGGGAGTTCA TCCGGTGGAT GACGGAGCCC
GAGCAGCTCA AGGTCAACGC CCTGCAGGGC TCCCGGCTCC CGACGCGGCG CGGCCTCTAC
GAGGACCGGG AGGTGCTGGA GAAGGTCCCG GTCGCCAGGC TCGGCAAGGA GGCCATCATC
CAGAACTCCC GCCCGCGCCC GGTCTCGCCG TACTACTCGG ACATGTCGCT CAGAATGGCC
GAGCAGTTCA GCGCCTCCCT CAAGGGCGAG GTCTCCCCCG AGCAGGCCGT AAAGACCCTG
CAGGGCGAGC TGCAGCGCCT CATCGAGGAG GGCGAGGCGG CCACCGGCTA G
 
Protein sequence
MGERRISRRG FLGLGGGALA GAVLLGGCGG GGGGSGEVLF SWGPDDTGVL PRLIERFNRE 
NGSGITVRYR EMPSDTGQYF DQLRTQFQAG GGDIDVIGGD VIWPAQFAAN GWIVDLSDRF
EDAGRFLEGP MQAMTYEGKV YGVPWYTDAG LLYYRKDLLE KSGYSEPPRT WDELREMALR
VKQDSGVPAG FVFQGAEYEG GVCDGLEYIW THGGDVLDPE DPTKVLIDSP ESVAGLKTER
SMVEEGVAPE AVTTYKEDES HGAFLRGDAV FLRNWPYVYA LVGDPEQSRI EPGQVGISEL
PVGGEGQQSY SCLGGWNFFI NASSGRQEEA WEFIRWMTEP EQLKVNALQG SRLPTRRGLY
EDREVLEKVP VARLGKEAII QNSRPRPVSP YYSDMSLRMA EQFSASLKGE VSPEQAVKTL
QGELQRLIEE GEAATG