Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0808 |
Symbol | |
ID | 6164298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 725536 |
End bp | 726423 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641667966 |
Product | 3-dehydroquinate dehydratase, type I |
Protein accession | YP_001794193 |
Protein GI | 171185274 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0710] 3-dehydroquinate dehydratase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01093] 3-dehydroquinate dehydratase, type I [TIGR01808] monofunctional chorismate mutase, high GC gram positive type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.716383 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATATGCG GCGCGGTACC GGTTAGAAAG CCGGCGGACG TATATAGAGC TCTGGACTCC CCCGCCCCCT GCCTCGAGCT AAGGCTCGAC TATCTGGAGA GCTCCCTCGC CGAGGCGAAG CCCGCGTTGG AGGAGGCGGT CGCGAGGAGG ACAGTCATAT TAACAGTTAG GAGGAGGGAG GAGGGGGGCG CCTGGCGGGG CACCGAGGAG GAGAGGGCGG CCCTCTACCT AAAGCTCCTG GAGCTGACGC CCCACTTCGT AGACGTGGAG GCCGCCGCGC CGGCGGCTGG GCAGGTGGCG GCCGCCAGGG GGAGGACTAA GCTTATAGCC AGCAGACACG ACTTCGGCGG GACCCCCCCG TATGAAACCC TCCTCTCCTG GGCCCGGGAG GCGGCGGCCT TGGGCGACGT GGTGAAGATA GTCACCTACG CCAGAGAGCC CCGGGACGGC CTCGCCGTTC TCTCCCTAAT CGGCGCCGTG GAGAAGCCGA CGGTGGCCTT CGCCATGGGG CCGGCCGGGG CCTACACCAG GCTGGCGGCG GCGGCCCTGG GGAGCCCCAT CATGTACGTA TCGCTGGGCG AGGCGACGGC GCCTGGCCAG ATATCCCTAG ACGCCTACTA CGCCGCGCTC CTGGGCATGG GGGCCGCCCC CGGGGGTGAG GGTCTGCCGG CGCTGAGGGA GGCGCTGGAC TGGATAGACG GCGCCCTCAT GCACCTTCTC AAGAGGAGGC TGGAGGTGTG CCGCGACATG GGGAAGATAA AGAAGTCCGC CGGTCTCCCC ATCTACGACG ACATCAGAGA GGCCCAGGTC TTGAAGAGGG CGGGCGACTT TAAACAGATC TTCGAGCTGG TGGTGCAGAT GTGCAAGGCG GTACAACTAG TCGCCTAG
|
Protein sequence | MICGAVPVRK PADVYRALDS PAPCLELRLD YLESSLAEAK PALEEAVARR TVILTVRRRE EGGAWRGTEE ERAALYLKLL ELTPHFVDVE AAAPAAGQVA AARGRTKLIA SRHDFGGTPP YETLLSWARE AAALGDVVKI VTYAREPRDG LAVLSLIGAV EKPTVAFAMG PAGAYTRLAA AALGSPIMYV SLGEATAPGQ ISLDAYYAAL LGMGAAPGGE GLPALREALD WIDGALMHLL KRRLEVCRDM GKIKKSAGLP IYDDIREAQV LKRAGDFKQI FELVVQMCKA VQLVA
|
| |