Gene Rxyl_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0234 
Symbol 
ID4117725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp239690 
End bp241042 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content51% 
IMG OID638035025 
Productmajor facilitator transporter 
Protein accessionYP_643024 
Protein GI108803087 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGATC AACGAACAGT TGCTATAGCC TCTACAGAGC CTCGGGAAAT CACAGCTAGA 
CGAGCGGTTA GAGCAGCCTG CTTTGGCTTC TTTGTCGATA TGTTCGACGT ATACCTGCCC
ATTGCAGCGC TCGCTCCCGC CATCGTCTAT TTCGTTCCCC CGGGACTTTC CCCAGAGGTA
GAAAGCACCA TCTTTTTCCT CATTTTTGCT GTTACTCTTA TAGGCCGTCC TGTAGGCGCG
ATAATTTTTG GCCACTTCGG TGACACCATC GGCCGCCGAC GTACCACCCT GATCTCGGTT
GCAGGATTCA CCATAATCAC CCTATTAATC GCTTTATTGC CAGGTTACGC CACTTGGGGC
ATGGGTGCCG TAGCGCTCCT GATCTTGTTG AGATTTCTTG ATGGACTGTG TATAGGAGGG
GAATATACCG CAGCCAACCC ACTTGCCATG GAATACTCTC CAAAGGAGCG CCGGGGTCTA
TATGGCTCGC TCATACATGT CGGATATCCT GCTGCGCTAC TCGCAATAAG TGCTCTTACA
GCGCTGCTGA CTTCGATCAT GCCCTCAGGA AGTCCGAGCT CTGCCTACTC AGTTTGGGGG
TGGAGAATAC CCTTCTTCAT AGGTGTCATA CTCTCCGCCG CGCTCTTTGT CTATTATCTG
CGCAAGGTAC CAGAATCAGA TCTGTGGCAG TCAGCACCCA AAACCGGCGC ACCGCTAAAG
GAGTTGCTCA AAGGCTCCAA CTTGCGCCGG ATGGCCCAAC TTTTTGTTGT GATGTCGGGC
GCCTGGCTCA CGTTGAACGC TACTGTAGGG GCATTACCAG GCGTTGGCAA TGTACTAGAA
GCTAACTCCG GAGCCGTTAA TACCGGGACT CTTATCGCTG CAGCTATAGC TGTATTTCTG
TACCCTATGC TTGGTCTGCT CAGCCAAAGG TGGGGAAGGC GCCCAACGAT CACCCTTATC
GGTGTACTTA ACGCGTTTCC TGCAGCGATT CTTTACATAA TGTTAGTGAG TTTTGGCACT
CTATCGAACA CCATATTCGT TGTCCTAGTG GCCGCGGTAT CTCTCCTTGG ATTACTTATC
TGGGCTGTAC ATACACCCTA CCTTGTGGAG AGCTTCAAAA CCAGCGTGCG TTCGGCCGGA
TATGGCATTG CTTACAGCTT GGCTACCATT ATCCCTGGCT TCTATTCTTT TTATCTTCTT
GGGCTTAGCA AGCTTATGCC CTATGCTTAC ACGCCCATAG TGCTGCTCGT CTTAGGCGGT
TTATTTCTCA GCGTAGGGGC ATTGCTTGGA CCAGAGACCA AAGATGTTGA GTTCTCACCT
ATAGAGGGAG ACGAAACCCG CTCCCAGACA TGA
 
Protein sequence
MADQRTVAIA STEPREITAR RAVRAACFGF FVDMFDVYLP IAALAPAIVY FVPPGLSPEV 
ESTIFFLIFA VTLIGRPVGA IIFGHFGDTI GRRRTTLISV AGFTIITLLI ALLPGYATWG
MGAVALLILL RFLDGLCIGG EYTAANPLAM EYSPKERRGL YGSLIHVGYP AALLAISALT
ALLTSIMPSG SPSSAYSVWG WRIPFFIGVI LSAALFVYYL RKVPESDLWQ SAPKTGAPLK
ELLKGSNLRR MAQLFVVMSG AWLTLNATVG ALPGVGNVLE ANSGAVNTGT LIAAAIAVFL
YPMLGLLSQR WGRRPTITLI GVLNAFPAAI LYIMLVSFGT LSNTIFVVLV AAVSLLGLLI
WAVHTPYLVE SFKTSVRSAG YGIAYSLATI IPGFYSFYLL GLSKLMPYAY TPIVLLVLGG
LFLSVGALLG PETKDVEFSP IEGDETRSQT