Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_1789 |
Symbol | |
ID | 4618206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | - |
Start bp | 1625015 |
End bp | 1625902 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639784873 |
Product | 3-dehydroquinate dehydratase, type I |
Protein accession | YP_931281 |
Protein GI | 119873274 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0710] 3-dehydroquinate dehydratase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01093] 3-dehydroquinate dehydratase, type I [TIGR01808] monofunctional chorismate mutase, high GC gram positive type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATATGCG GCGCAGTACC AGTTAGAAAG CCGACAGATA TATATAGAGC TCTGGACTCC CCTGTTTCCT GCCTCGAGCT AAGGCTCGAT TACCTAGAGA CCTCACTCGC CGAGGCGAAG CCAGCGTTGG AGGAGGCGGT AGCGAGGAGG ACAGTCATAT TCACCGTCAG GAGGAGGGAG GAGGGGGGCG TCTGGCGGGG CACAGAGGAG GAGAGGGCGG CCCTCTACCT AAAGCTCCTG GAGCTGACGC CCCACTTCGT AGACGTGGAG GCCGCCGCGC CGGCGGCTGA GCAGGTGGCG GCCGCCAAGG GGAGGACAAA GCTGATAGCC AGCAGACACG ACTTCGGCGG GACCCCGCCG TATGAGACCC TCCTCTCCTG GGCCCGGGAG GCGGCAGCCT TGGGCGACGT GGTGAAGATA GTCACCTACG CCAGAGAGCC CCGGGACGGC CTCGCCGTGC TCTCCCTAAT CGGCGCCGTG GAGAAACCGA CGGTGGCCTT TGCCATGGGG CCGGCCGGGG CCTACACCAG GCTGGCGGCG GCGGCCCTGG GGAGCCCCAT CATGTATGTG TCGCTGGGTG AGACGACGGC GCCTGGCCAG ATATCGGTAG ACGCCTACTA CGCCGCGCTC CTAGGCATAG GGGCCGCCCC CCGGGGGGAG GGTCTGCCGG CGCTGAGGGA GGCGCTGGAC TGGATAGACA GTGCCCTCAT GCACCTCCTC AAGAGGAGGC TGGAAGTGTG CCGCGACATG GGGAAGATAA AAAAGGCCGC CGGTCTCCCT ATATACGACG ACGTTAGAGA GACCCAGGTC TTGAAGAGGG CGGGCGACTT TAAACAGATC TTCGAGCTGG TGGTGCAGAT GTGCAAAGCA GTGCAGCTAG TCGCCTAG
|
Protein sequence | MICGAVPVRK PTDIYRALDS PVSCLELRLD YLETSLAEAK PALEEAVARR TVIFTVRRRE EGGVWRGTEE ERAALYLKLL ELTPHFVDVE AAAPAAEQVA AAKGRTKLIA SRHDFGGTPP YETLLSWARE AAALGDVVKI VTYAREPRDG LAVLSLIGAV EKPTVAFAMG PAGAYTRLAA AALGSPIMYV SLGETTAPGQ ISVDAYYAAL LGIGAAPRGE GLPALREALD WIDSALMHLL KRRLEVCRDM GKIKKAAGLP IYDDVRETQV LKRAGDFKQI FELVVQMCKA VQLVA
|
| |