Gene Rxyl_0716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRxyl_0716 
Symbol 
ID4116524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRubrobacter xylanophilus DSM 9941 
KingdomBacteria 
Replicon accessionNC_008148 
Strand
Start bp748683 
End bp750188 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content80% 
IMG OID638035501 
ProductHAD family hydrolase 
Protein accessionYP_643498 
Protein GI108803561 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0241] Histidinol phosphatase and related phosphatases
[COG1216] Predicted glycosyltransferases 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.358199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCC CGGCGGTGGA CGTGGTGGTC CCCACCGCGG GGCGCCCCTC GCTCCGCGCC 
CTCCTGGAGG CCCTCGCCCG GCAGCCGCCG CCGGGCCGGC TGCTGCTGGT GGACGACCGC
AGGAACCCCG GCGGCCCCCT GCTGCCCGAG GGCCCGCCGG CGGGGCTCGC CGGGAGGGTG
GAGGTGCTGC GCGGCCCCGC CCGCGGCCCC GCCGCGGCCC GCAACGCCGG CTGGCGGGCC
TCCGGGGCCG GGTGGGTGGC CTTCCTGGAC GACGACGTCG TCCCCGAACC CGGCTGGACC
CGGGATCTCC TGCGCGACCT CTCCGGGCTC GGCCCCCGGG TCGCCGCGAG CCAGGGGCAG
CTGCGCGTCC CCCTCCCCGC CGGGAGGAGG CCCACGGACG CCGAGAGGGG CGTCAAGGGG
CTCGAGACCG CCCGCTGGAT CACGGCCAAC ATCGCCTACC GGCGGGAGGT TCTGGAGCGC
CTCGGCGGCT TCGACGAGCG CTTCCCCCGC GCCTACCGGG AGGACGTGGA GCTCGGGCTG
CGGGCCGTGA GATCGGGGCT GCGCATCGTG GGCGGCGAGA GGACCGTGCT GCACCCCGCG
CGCCCGGCCG GCCCCCTCGA GAGCGTGCGC CGGCAGGCCG GGAACGCCGA CGACGCGCTG
ATGCTCCTGC TCCACGGGCG CGCCTGGCGG CGGGAGGCCG GGGTTCCTCC CGGACGCCGG
CCGCGGCACC TGCTCGTCAC CGCCGCCGGG GCCGCCGCGC TCGCCGCCCT GGCGGCCGGG
CGCCGCCGGG ACGCCGCGCT CGGGGCGCTC CTCTGGCTGC TCGGGACCAC CGAGTTCGCC
GCCTCCCGCA TCCTGCCCGG GCCCCGCACC CTCCCCGAGG TCTCCTCGCT GCTGCTGAGC
AGCGCGCTCA TCCCCCCGGC GGCCTCCGCC TGCTTCGCGC TCGGGCTGCT GCGCGCCCGC
CGGCTCGCCC CCGGCGGGGC ACGCCGCCTC CCGGAGGCCG TCCTCTTCGA CCGGGACGGC
ACCCTCGTGC ACGACGTGCC CTACAACGGC GACCCGGAGA AGGTGGTGCC CGTCCCCGGC
GCCCGGCGGG CGCTGCGGAG GCTCAGGGCG GCCGGCGTCC CCGCGGGCGT GATCTCCAAC
CAGAGCGGCG TCGGGCGCGG CCTGATCTCG GCGGAGCAGG TGGCGAGGGT CAACCGCCGG
GTGGAGGAGC TCCTCGGCCC CTTCGGGGCC TGGGCGCTCT GCTGCCACCG CCCGGAGGAC
GGCTGCGGAT GCCGCAAGCC CGCCCCCGGC CTCGTCCTGC GGGCGGCGGC GCAGCTCGGG
GCCGACCCCC GCCGCTGCGT CGTGATCGGC GACATCGGCG CGGACGTGGA GGCCGCCCGG
GCCGCCGGGG CCCGGGGCAT ACTCGTCCCC ACCCCCGCCA CTCGCCCGGA GGAGGTGGAG
GCCGCCCCCG AGGTCGCCCC GGACCTGGAG TCGGCCGTGG AGCTCGCGCT GGGGGGCGGC
GCGTGA
 
Protein sequence
MSRPAVDVVV PTAGRPSLRA LLEALARQPP PGRLLLVDDR RNPGGPLLPE GPPAGLAGRV 
EVLRGPARGP AAARNAGWRA SGAGWVAFLD DDVVPEPGWT RDLLRDLSGL GPRVAASQGQ
LRVPLPAGRR PTDAERGVKG LETARWITAN IAYRREVLER LGGFDERFPR AYREDVELGL
RAVRSGLRIV GGERTVLHPA RPAGPLESVR RQAGNADDAL MLLLHGRAWR REAGVPPGRR
PRHLLVTAAG AAALAALAAG RRRDAALGAL LWLLGTTEFA ASRILPGPRT LPEVSSLLLS
SALIPPAASA CFALGLLRAR RLAPGGARRL PEAVLFDRDG TLVHDVPYNG DPEKVVPVPG
ARRALRRLRA AGVPAGVISN QSGVGRGLIS AEQVARVNRR VEELLGPFGA WALCCHRPED
GCGCRKPAPG LVLRAAAQLG ADPRRCVVIG DIGADVEAAR AAGARGILVP TPATRPEEVE
AAPEVAPDLE SAVELALGGG A