Gene Rxyl_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0103 
Symbol 
ID4117770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp107722 
End bp109641 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content75% 
IMG OID638034895 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_642894 
Protein GI108802957 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCCGA GCAGCTTGCA GGGCTACCAG CCGCTCGTCT TCTACGCGGC GGCGGCCGTC 
GCGGGCGCCC TGCTGGCCGG GCTTGCGGCC CTCCTCCTGT GGCGCCGCGG GCGCCCGTCT
CGAAGGAGGG ACGCCCTGCG CGCGGCCGCC TTCGAGCAGG CGGCGGAGGG GATGCTGCTC
GTCTCGGGGG ACCGGGTGCT GGAGGCGAAC GCCGCCTGCG CCGCCCTGCT CGGCAGGGAG
CGCGGGGAGC TTGCGGGGAT GCCCCTCGGG GAGCTCCTCG CCGGCGGGGC CGGGGGTGTG
CCTCTGCCGC CCGGCAAGCG CAGCTGGTCG GGGACGCTCG CCCTGAGGTC CGGCGGGGGC
GAGCCCGTGG AGTGCGAGGC GGCGGCGAGC CGGGTGCTCG CCGGCGGCCG GGAGGCGGTG
TGCCTCGCGC TGCGGGACGT CACGGAGCGG CGGCGCGAGG AGGAGAAGCT GCGGGAGACC
GAGTCCCGCT ACCGGACGCT CGTCGAGCAG GTGCCCGCCA TCGTCTACAT CGAGGACGTG
CAGACGCAGG CCACCCTCTA CGACAGCCCG CGGATAGAGG AGATCCTGGG CTACCCGCGG
GACCTCTGCG AGCGCGAGCC CCTCTACTGG CACCGCATCC TCTACCCCGA GGACCGCGAG
CGGGTGCTGG AGGCCGAGCG GGAGGCGGTC GAGCGGGGCA GCTTTGTCCT GGAGTACCGG
GTCTTTGCCG CCGACGGCCG GGTGGTGTGG GTCCGGGACG AGGCCCGCCT CCTGCGCGAC
GAGTCGGGCG AGCCCCGCTT CTGGCAGGGT GTGATCTCGG ACATCACCGA GCAGCGGCGG
GCGGAGGACG CGCTGCGGGA GAGCGAGGAG CGCTACCGCT CGCTGGTGCA GCTCTCCCCG
GACGGGGTGG CGGTGGAGAG CGAGGGGCGC TTTGTCTACC TGAACGAGGC GGGGGCCCGC
CTGCTGGGGG CCTCCTCCCC GCAGGAGGTG CTCGGCCAGC CGGTCATGGA GCGGGTGCAC
CCGGACTGCC GGGAGAACGC CCGCCGCCGC GCCCGGCGCC TCCGGCGGGG CGAGCGCGTG
GAGCTGCAGG AGGAGCGCTG GCTCCGCCTG GACGGTCGGG AGATGGACGT GGAGGTCTCC
GCGGCGCCGG TGCAGTACGG GGGGCGGCCC TCCGCCCAGC TGGTGCTCCG GGACGTCACC
GGCAGGAAGC GCGCCGAGCG GGAGATCGTC CGCCAGAAGG CGGAGCTTGC CCGCTCCAAC
GCGGAGCTGG AGCAGTTCGC CTACCTGATC GCCCACGACC TCCGCGCCCC GCTCCGGAGC
ATGGACGGCT TCGCCCAGAT CCTGCTGGAG GACTGCGCCC CCCGGCTCGG CCCCGACGGC
CGGGAGTACC TCGCGCGCAT CCAGCGCGCC ACCCGCAAGA TGGCCCGGAT GATCGACGAG
CTTCTCGGCC TCTCCCGGCT CGCCCGCGCG GACATCCGGC GCGAGCCGGT GGACCTCTCG
GCCATGGCCC GCTCCATCGG CGAGGAGCTG CGGCAGGGGG AGCCCGACCG CCGGGTGGAG
TTCATCGTCG CGGGCGGGCT CGCCGCCGAG GGCGACCGCC GCCTGCTGCG GGTCGCCCTC
GCCAACCTCC TGGAGAACGC CTGGAAGTTC ACCCGCCGCA CCCCCCGCCC CCGCATAGTC
TTCGGCCGCA TAGAGCGCGG CGGAGAGCGC GTCTTCTTCG TCCGGGACAA CGGGGTCGGC
TTCGACATGG CCTACGCCGG CAAGCTCTTC GGCCCCTTCC AGCGGCTGCA CGCCGAGGAG
GAGTTCGAGG GGACCGGCAT CGGGCTCGCC GCCGTCGCCC GCGTCATAGA GCGCCACGGC
GGCCGGGTCT GGGCCGAGGG CGCCGAAGGG GAGGGGGCGA CCTTCTACTT CACCCTATAA
 
Protein sequence
MDPSSLQGYQ PLVFYAAAAV AGALLAGLAA LLLWRRGRPS RRRDALRAAA FEQAAEGMLL 
VSGDRVLEAN AACAALLGRE RGELAGMPLG ELLAGGAGGV PLPPGKRSWS GTLALRSGGG
EPVECEAAAS RVLAGGREAV CLALRDVTER RREEEKLRET ESRYRTLVEQ VPAIVYIEDV
QTQATLYDSP RIEEILGYPR DLCEREPLYW HRILYPEDRE RVLEAEREAV ERGSFVLEYR
VFAADGRVVW VRDEARLLRD ESGEPRFWQG VISDITEQRR AEDALRESEE RYRSLVQLSP
DGVAVESEGR FVYLNEAGAR LLGASSPQEV LGQPVMERVH PDCRENARRR ARRLRRGERV
ELQEERWLRL DGREMDVEVS AAPVQYGGRP SAQLVLRDVT GRKRAEREIV RQKAELARSN
AELEQFAYLI AHDLRAPLRS MDGFAQILLE DCAPRLGPDG REYLARIQRA TRKMARMIDE
LLGLSRLARA DIRREPVDLS AMARSIGEEL RQGEPDRRVE FIVAGGLAAE GDRRLLRVAL
ANLLENAWKF TRRTPRPRIV FGRIERGGER VFFVRDNGVG FDMAYAGKLF GPFQRLHAEE
EFEGTGIGLA AVARVIERHG GRVWAEGAEG EGATFYFTL