Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_1029 |
Symbol | |
ID | 4116715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | + |
Start bp | 1063456 |
End bp | 1064343 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 638035819 |
Product | 5-oxopent-3-ene-1,2,5-tricarboxylate decarboxylase |
Protein accession | YP_643807 |
Protein GI | 108803870 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000527946 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGACA TGGGCTTTAC AGAGGCGATA GAGGCCGGGT ACGGGGTGAG GGTGTTGCAC GGCGGGGAGC CCAGGTGGGG CCGGCGGGAG GGGGAGGAGA TCCTGCTGGA GAGCGGGGAG CGCATCCCGG AGGCGGGGGC GCGGTATCTC GCGCCGGCGG AGCCGACGAA GATCATCGCG GTGCACCTCA CCTACCGGAG CCGGGTCGAG GAGTACGGGG CGCGCGTACC GCCCGAGCCC TCGTACTTCC TCAAGCCGCC GACGGCGCTC AACGGGCACC GGGGGGTGGT GCGGTGGCCG GCGGGGACGC GCTTTCTCAA CTACGAGGGA GAGCTCGCGG TCATCGTCGG GCGCCGGATG AAGGGCGTGG GCATGGAGGA GGCGCTCTCC TGCGTCGCCG GCTACACCTG CGCCAACGAC GTGGGGCTGC ACGACTTCCG GCACGCCGAC CGCGGGTCGA TGCTGCGGGT GAAGGGGCAG GACGGCTTTC TGCCGCTCGG GCCCGAGGTG GTGCCCGCCT CGCGGTTCGA CCCCGAGGAC TTCGCGCTGC GCACCTACCT CAACGGCGAG GTCGTGCAGG AGGGCGGGGC GGAAGACCTG CTCTTCGGCG TAGCCTACCA GCTCGCCGAC CTGTGCCGCC TCATCACGCT CGAGCCCGGC GACGTGGTGC TCACCGGTAC GCCGGCCAAC TCCCGGCCCA TGCGGCCCGG CGACGTGGTG GAGGTCGAGA TAGAGGGCAT AGGCCGGCTC TCCAACACCG TCGAGGAGTG GGAGGTGGAC CTCTCCGGCC CCGGCGAGCG GCCCGAGGTC TCGGCCAGCA CGCTGCACGT CGCGCTCGCC GTCCCCGAGG ACGAGGCCGA GCGGCTGGCG GCGGGGGAGG CGCGGTGA
|
Protein sequence | MSDMGFTEAI EAGYGVRVLH GGEPRWGRRE GEEILLESGE RIPEAGARYL APAEPTKIIA VHLTYRSRVE EYGARVPPEP SYFLKPPTAL NGHRGVVRWP AGTRFLNYEG ELAVIVGRRM KGVGMEEALS CVAGYTCAND VGLHDFRHAD RGSMLRVKGQ DGFLPLGPEV VPASRFDPED FALRTYLNGE VVQEGGAEDL LFGVAYQLAD LCRLITLEPG DVVLTGTPAN SRPMRPGDVV EVEIEGIGRL SNTVEEWEVD LSGPGERPEV SASTLHVALA VPEDEAERLA AGEAR
|
| |