Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0788 |
Symbol | aroB |
ID | 6164698 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 707055 |
End bp | 708086 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641667946 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001794173 |
Protein GI | 171185254 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.989746 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAGGT TTTTCTACAG ACACAGCAGG GGGGTCACCG AGGTGGTGGT GGGGAGCGGT CTGCAGTACG GCGACTACGT CGAGCGTCCC GTGGTTTTGG CGGAGGAGGG GCTTAGGCCC CCCATACCCG GCGCCCCCAC CCTCGCGCTT AGGGGAGGCG AGGAGGTCAA AAGCCTTGAG GTTTTGACCA AGGTCTACGG CTTTTTGAAG GAGGTGGGGG CGGATAGATC CACCACGTTG GTGGCGGTCG GCGGGGGGGC TCTCCTGGAT CTGGCGACCT TCGCCGCGGG GACCTACATG AGGGGCATCC GCCTCGTCCA CATACCGACC ACCCTCCTTG CCATGGTAGA CGCAGCGCTG GGCGGCAAAG GCGCCGTCGA CTGGGGCCCC GTCAAGAACC TAATCGGAGT GTTCTACCAG CCGGCGGCTA TACTCTGCGA TCTGTCTTGG CTTGGGACCC TCCCCGAGAG GGTCTACCGC TCGGCCTTCG CCGAGGTGGT GAAGTACGGC CTGGCGCTAG ACGGGGATTT CTACAGCTGG GTGAGGGAAA ACGCCAAGGC CCTCCTGGCC AGAGACGGGG GGGCTCTGGA GTACGCGGTG TACCGCTCCC TCCAGCTCAA GGCGGGTGTT GTGGAGGTGG ACGAGTTCGA GGAGAGGGGC GTTAGGCAGG TTCTCAACGT GGGCCACACG GTGGGTCACG CCGTGGAGAG GGTGCTAGGG CTTCTACATG GGGAGGCGGT GGCCGTGGGG ATGGTTGCGG AGCTCCGCCT CTCCAGCGAG CTGGGCTACC TCCGGGAGAG CCACGTGGCC GAGGCGGCTG AGGTCCTCAG CTCCCTCGGC TTGCCCACAA GCGTGAAGGC GACTGAGCAA CAGCTGGCGG AGGCGGCGGC CCTCGTGAAG TTCGACAAGA AGAGACGCGG CGGCCACATC TACATCCCCC TGGTCGTTAG GCCGGGGAGG TGGATCCTGG AGAAGATAGC TGTGGAGGAG GTGGAGAAGG CCGTCAGGTA TGTTCTGCGT CAGGGCGGGT AG
|
Protein sequence | MRRFFYRHSR GVTEVVVGSG LQYGDYVERP VVLAEEGLRP PIPGAPTLAL RGGEEVKSLE VLTKVYGFLK EVGADRSTTL VAVGGGALLD LATFAAGTYM RGIRLVHIPT TLLAMVDAAL GGKGAVDWGP VKNLIGVFYQ PAAILCDLSW LGTLPERVYR SAFAEVVKYG LALDGDFYSW VRENAKALLA RDGGALEYAV YRSLQLKAGV VEVDEFEERG VRQVLNVGHT VGHAVERVLG LLHGEAVAVG MVAELRLSSE LGYLRESHVA EAAEVLSSLG LPTSVKATEQ QLAEAAALVK FDKKRRGGHI YIPLVVRPGR WILEKIAVEE VEKAVRYVLR QGG
|
| |