Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0716 |
Symbol | |
ID | 4116524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | - |
Start bp | 748683 |
End bp | 750188 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 80% |
IMG OID | 638035501 |
Product | HAD family hydrolase |
Protein accession | YP_643498 |
Protein GI | 108803561 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0241] Histidinol phosphatase and related phosphatases [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E [TIGR01656] histidinol-phosphate phosphatase family domain [TIGR01662] HAD-superfamily hydrolase, subfamily IIIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.358199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGCC CGGCGGTGGA CGTGGTGGTC CCCACCGCGG GGCGCCCCTC GCTCCGCGCC CTCCTGGAGG CCCTCGCCCG GCAGCCGCCG CCGGGCCGGC TGCTGCTGGT GGACGACCGC AGGAACCCCG GCGGCCCCCT GCTGCCCGAG GGCCCGCCGG CGGGGCTCGC CGGGAGGGTG GAGGTGCTGC GCGGCCCCGC CCGCGGCCCC GCCGCGGCCC GCAACGCCGG CTGGCGGGCC TCCGGGGCCG GGTGGGTGGC CTTCCTGGAC GACGACGTCG TCCCCGAACC CGGCTGGACC CGGGATCTCC TGCGCGACCT CTCCGGGCTC GGCCCCCGGG TCGCCGCGAG CCAGGGGCAG CTGCGCGTCC CCCTCCCCGC CGGGAGGAGG CCCACGGACG CCGAGAGGGG CGTCAAGGGG CTCGAGACCG CCCGCTGGAT CACGGCCAAC ATCGCCTACC GGCGGGAGGT TCTGGAGCGC CTCGGCGGCT TCGACGAGCG CTTCCCCCGC GCCTACCGGG AGGACGTGGA GCTCGGGCTG CGGGCCGTGA GATCGGGGCT GCGCATCGTG GGCGGCGAGA GGACCGTGCT GCACCCCGCG CGCCCGGCCG GCCCCCTCGA GAGCGTGCGC CGGCAGGCCG GGAACGCCGA CGACGCGCTG ATGCTCCTGC TCCACGGGCG CGCCTGGCGG CGGGAGGCCG GGGTTCCTCC CGGACGCCGG CCGCGGCACC TGCTCGTCAC CGCCGCCGGG GCCGCCGCGC TCGCCGCCCT GGCGGCCGGG CGCCGCCGGG ACGCCGCGCT CGGGGCGCTC CTCTGGCTGC TCGGGACCAC CGAGTTCGCC GCCTCCCGCA TCCTGCCCGG GCCCCGCACC CTCCCCGAGG TCTCCTCGCT GCTGCTGAGC AGCGCGCTCA TCCCCCCGGC GGCCTCCGCC TGCTTCGCGC TCGGGCTGCT GCGCGCCCGC CGGCTCGCCC CCGGCGGGGC ACGCCGCCTC CCGGAGGCCG TCCTCTTCGA CCGGGACGGC ACCCTCGTGC ACGACGTGCC CTACAACGGC GACCCGGAGA AGGTGGTGCC CGTCCCCGGC GCCCGGCGGG CGCTGCGGAG GCTCAGGGCG GCCGGCGTCC CCGCGGGCGT GATCTCCAAC CAGAGCGGCG TCGGGCGCGG CCTGATCTCG GCGGAGCAGG TGGCGAGGGT CAACCGCCGG GTGGAGGAGC TCCTCGGCCC CTTCGGGGCC TGGGCGCTCT GCTGCCACCG CCCGGAGGAC GGCTGCGGAT GCCGCAAGCC CGCCCCCGGC CTCGTCCTGC GGGCGGCGGC GCAGCTCGGG GCCGACCCCC GCCGCTGCGT CGTGATCGGC GACATCGGCG CGGACGTGGA GGCCGCCCGG GCCGCCGGGG CCCGGGGCAT ACTCGTCCCC ACCCCCGCCA CTCGCCCGGA GGAGGTGGAG GCCGCCCCCG AGGTCGCCCC GGACCTGGAG TCGGCCGTGG AGCTCGCGCT GGGGGGCGGC GCGTGA
|
Protein sequence | MSRPAVDVVV PTAGRPSLRA LLEALARQPP PGRLLLVDDR RNPGGPLLPE GPPAGLAGRV EVLRGPARGP AAARNAGWRA SGAGWVAFLD DDVVPEPGWT RDLLRDLSGL GPRVAASQGQ LRVPLPAGRR PTDAERGVKG LETARWITAN IAYRREVLER LGGFDERFPR AYREDVELGL RAVRSGLRIV GGERTVLHPA RPAGPLESVR RQAGNADDAL MLLLHGRAWR REAGVPPGRR PRHLLVTAAG AAALAALAAG RRRDAALGAL LWLLGTTEFA ASRILPGPRT LPEVSSLLLS SALIPPAASA CFALGLLRAR RLAPGGARRL PEAVLFDRDG TLVHDVPYNG DPEKVVPVPG ARRALRRLRA AGVPAGVISN QSGVGRGLIS AEQVARVNRR VEELLGPFGA WALCCHRPED GCGCRKPAPG LVLRAAAQLG ADPRRCVVIG DIGADVEAAR AAGARGILVP TPATRPEEVE AAPEVAPDLE SAVELALGGG A
|
| |