Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rxyl_0960 |
Symbol | |
ID | 4115923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rubrobacter xylanophilus DSM 9941 |
Kingdom | Bacteria |
Replicon accession | NC_008148 |
Strand | + |
Start bp | 997856 |
End bp | 998917 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 638035745 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_643739 |
Protein GI | 108803802 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00600689 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGAACCA GTGATCTGCG CGTAGAGAGC ATCCGACCCC TGATCCCGCC CGCCATCCTC CTGGAGGAGC TCCCCCTCTC GGAGGACGGG GCCCTCACGG TGAGCCGGGC GCGGGAGGAG ATAGTGCGCA TCCTCGAGGG CGAGGACGAC CGCCTCATCG CCGTGGTCGG CCCCTGCTCG GTGCACGACA CCTCGGCGGC GCTGGAGTAC GCCCGGCGGC TGCGGGGGCT CGCGGAGGAG CTGCGCGGGG AGCTCTGCGT GATCATGCGG GTGTACTTCG AGAAGCCCCG GACCACGGTC GGGTGGAAGG GGCTCATCAA CGACCCGCAC CTGGACGGGA GCTTCGCCGT GAACGAGGGG CTCAGGATGG CGCGCGGCCT GCTGCTCGAC GTGGCGGAGC TGGGATTGCC CGCCGGCTGC GAGTTTCTGG ACCCCATCTC GCCGCAGTAC TTCACCGACG CGGTGGCCTG GGCGGCCATC GGGGCGCGCA CCACCGAGAG CCAGGTGCAC CGGCAGCTGG CCTCCGGCCT CTCCATGCCG GTGGGCTTCA AGAACGGCAC CGGCGGCGGG GTGCAGATAG CGGTGGACGC GGTGCGCGCC GCCGCCCACC CGCACAGCTT CCCCGGGGTG ACCCGGCAGG GGCTCGCGGC GGTCGTCACC ACCACCGGCA ACCCGGACTG CCACGTCATC CTGCGCGGCG GCAGGAGCGG CCCGAACTAC GACGAGCGGA GCGTCGGGGA GGCCCTGGAG GCGCTGCGCC GGGCCGGCCT CCCGCCCCGC CTGATGGTGG ACGCCAGCCA CGCCAACAGC GGCAAGGACT ACCGCCGGCA GCCGCTCGTG ATCCGGGACG TGGCCGACCA GGTCGCCCGG GGCCAGAGGG GGATCGTGGG CGTGATGCTG GAGAGCTTCC TGGTCGAGGG GAGCCAGGAG CTCACGGACC GCTCCCGGCT GACCTACGGC CAGAGCATCA CCGACTCCTG CATGGGCTGG GAGATGACGG TCCCCACCCT GCGCGAGCTG GCCGAGGCGG TCAGGGCCCG GCGCACCGTC CGCGTCGGCT GA
|
Protein sequence | MRTSDLRVES IRPLIPPAIL LEELPLSEDG ALTVSRAREE IVRILEGEDD RLIAVVGPCS VHDTSAALEY ARRLRGLAEE LRGELCVIMR VYFEKPRTTV GWKGLINDPH LDGSFAVNEG LRMARGLLLD VAELGLPAGC EFLDPISPQY FTDAVAWAAI GARTTESQVH RQLASGLSMP VGFKNGTGGG VQIAVDAVRA AAHPHSFPGV TRQGLAAVVT TTGNPDCHVI LRGGRSGPNY DERSVGEALE ALRRAGLPPR LMVDASHANS GKDYRRQPLV IRDVADQVAR GQRGIVGVML ESFLVEGSQE LTDRSRLTYG QSITDSCMGW EMTVPTLREL AEAVRARRTV RVG
|
| |