Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0792 |
Symbol | |
ID | 6164535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 711015 |
End bp | 712001 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641667950 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | YP_001794177 |
Protein GI | 171185258 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGTATA TAGTTGGGGG TCCTCAACAG GGAAAGTCCC TAAAGGAGGA GATCGAGGCT AGGGGAGTGC CGGCGTGGTA TGTAGAGCTG TGGGGGCACT ATATAGTGGC TACGCCGCCG AACGCCAAGC TCCAGGATCT GAAAACCCCC GTCAAAGCAG CCATAGAGCT TAGGACAGAC GGCCAACTGG TCTCCAGGGA GTGGAAGAGG GACCCCACGC CTGTCTACAT CGGAGGTAGG GAGGTGAGGG AGGGGAAGAT CTTCATAATA GCCGGCCCCT GCTCGGTGGA GACCGAGGAC CAGATCATGA AGACTGCGAG GTTTGTAAAA GAGGCTGGGG CGGACGCGTT GAGGGGCGGC GCCTTCAAGC CGAGGACAAG CCCCTACACA TTCCAAGGCC TAGGGGAGGA GGGGCTGAAG CTGTTGGCCA AGGCCAGGGA GGAGACGGGC CTCCCCGCCG TCACGGAGCT TATGGACCCG GAGGACATGC CGCTGGTGGT TAAATACGCA GACGCGATAC AGGTGGGGGC TAGGAACATG CAGAACTTCA CCCTTCTTAA GAAGCTCGGC CGGGCGGGCA AGCCGGTGCT CCTCAAGAGA GGCTTTGGAA ACACCATAGA GGAGTGGCTG CTGGCGGCTG AGTACGTGGC TCTCCACGGC AACGGCGGCA TTATCCTAGT GGAGAGGGGC ATCAGAACCT TCGACAAAAC CCTTAGGTTT ACGCTCGACG TAGGCGCGAT AGCCTACGCC AAACAACACA CCCACCTGCC AGTGATAGGC GACCCCAGCC ACCCGGCTGG AGACAGGAGA TATGTCATAC CGCTGGCCCT CGCCATATTA GCGGCTGGGG CAGACGGCCT AATCGTGGAG GTGCACCCCG ACCCCGATAG GGCGTGGAGC GACGCAAAAC AACAGCTCAC CTTCGACCAG TTCAGAGAGC TTGTACAAAA GGCTAGAGAA GTGGCGAGGG CTCTAGGCAA GAGCTAA
|
Protein sequence | MLYIVGGPQQ GKSLKEEIEA RGVPAWYVEL WGHYIVATPP NAKLQDLKTP VKAAIELRTD GQLVSREWKR DPTPVYIGGR EVREGKIFII AGPCSVETED QIMKTARFVK EAGADALRGG AFKPRTSPYT FQGLGEEGLK LLAKAREETG LPAVTELMDP EDMPLVVKYA DAIQVGARNM QNFTLLKKLG RAGKPVLLKR GFGNTIEEWL LAAEYVALHG NGGIILVERG IRTFDKTLRF TLDVGAIAYA KQHTHLPVIG DPSHPAGDRR YVIPLALAIL AAGADGLIVE VHPDPDRAWS DAKQQLTFDQ FRELVQKARE VARALGKS
|
| |