Gene Rxyl_1907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_1907 
Symbol 
ID4115582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp1930754 
End bp1932118 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content56% 
IMG OID638036692 
Productgeneral substrate transporter 
Protein accessionYP_644666 
Protein GI108804729 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0522136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAATCAC GAGAACAAGT AGTGCCGCAG GTAGATCCAG CGCTTATACG CAGGAGCATT 
ATCGGCGGCG CAGTAGGCGT TTGGGTGCAC TGGTTCGACT GGGCGGTCTA CGCCTACTTG
GCGACTACAC TCGCAGCTGT TTTCTTCCCC AATGAAAACC CCACAGCTGG CCTGCTCTCC
GTCTTCGCGA TCTTTGCTGT GTCGTTTGTC GTGCGGCCCT TGGGGGGATT CTTCTTCGGA
CCTTTGGGTG ACAAGATCGG AAGGCGGACC ACTTTAGCTG TGGTTATCAT TACCATGGGG
GCCGCCACCA CTGCAGTTGG TCTCTTGCCG ACCTACTCCT CGGTGGGCAT CCTGGCACCG
ATCCTACTTG TCACTGTGCG GCTCGTGCAG GGATTTGCTG CGGGAGGCGA GTTTGGAGGC
GCTGCTGCGT TTCTCGCGGA GTATTCGCCG AGAAGGCACA GGGGATTCGG GGTTAGTTGG
CTAGAGAGCT CAAGCCTCCT TGGATTCCTG ACGGCTTCCC TGGCAGTGTT TCTGCTAAAC
TCCGCCCTTA CGGAGGAGGC TGTGACTGCC TGGGGCTGGC GCATCCCCTT CTTGATCGCG
GGCCCTATGG CCGTTGTCGG GCTTTACATA CGACTAAAAC TCGAAGACAC CCCCAATTTC
AGGGTCCTGG AACAAACTAA CGAGGTTTCC CAAGCTCCCC TCCGAGAGCT GTTAAGGCAG
GACTGGAAAC AACTTCTCCA GATGACAGGG ATCGAGATTC TGCAGCACGT CAGCTTCTAC
ATTGTCTTAG TTTATCTACT TACCTACCAA ACGCAAGAGT TGGGTCTCTC GTCTGGATCC
GCTGCCATGC TCTCCACGAT CACCTCAATA GTAGCAATGG TTCTCGTCCC ACTCTTTGGT
GCTCTCTCCG ACCGTGTCGG TCGAAAGCCC CTATTGATAG CGTCAGGCTT GGGGTTTTTG
CTACTTTCCT ACCCTGCCTT TCTTCTCATG AGAACAGGCG ACTTGGGGGC CATTATCCTA
GTGCAGACGG GGCTTGGCAT TCTGCTGGCG CTCATCCTAA GTACGCATGC TGTCGCCATG
AGCGAGATTT TCCCCACGCG GGTGCGTCAA GCAGGTCTCT CACTCGGCTA TCAGGTGACC
GCCGCGATTT TCGCAGGAAC CGTACCGTAC CTGATGACGT ACTTGATCTC TGCGACTGGG
AATCCTTATG TACCGGCCTT TTACCTAATG TTTGTGGGCT TGGTGGGTGT CGGCACCACT
CTCACGCTGA GAGAAACCGC AGGCCTTCCC TTACCACAGA GAGAGCCTGT TACACCAGTA
CAGGAAACTG CCGGAGGTGC AGTTTCGGGT TCAGAGTCCG AGTAG
 
Protein sequence
MESREQVVPQ VDPALIRRSI IGGAVGVWVH WFDWAVYAYL ATTLAAVFFP NENPTAGLLS 
VFAIFAVSFV VRPLGGFFFG PLGDKIGRRT TLAVVIITMG AATTAVGLLP TYSSVGILAP
ILLVTVRLVQ GFAAGGEFGG AAAFLAEYSP RRHRGFGVSW LESSSLLGFL TASLAVFLLN
SALTEEAVTA WGWRIPFLIA GPMAVVGLYI RLKLEDTPNF RVLEQTNEVS QAPLRELLRQ
DWKQLLQMTG IEILQHVSFY IVLVYLLTYQ TQELGLSSGS AAMLSTITSI VAMVLVPLFG
ALSDRVGRKP LLIASGLGFL LLSYPAFLLM RTGDLGAIIL VQTGLGILLA LILSTHAVAM
SEIFPTRVRQ AGLSLGYQVT AAIFAGTVPY LMTYLISATG NPYVPAFYLM FVGLVGVGTT
LTLRETAGLP LPQREPVTPV QETAGGAVSG SESE