Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1231 |
Symbol | |
ID | 4601163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1168671 |
End bp | 1169714 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639774007 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_920632 |
Protein GI | 119720137 |
COG category | [C] Energy production and conversion |
COG ID | [COG0371] Glycerol dehydrogenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0263473 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGGGCTG AGCTCCCGAA GAGGGTTGTA GTCGAGAGGG GAGCCTTGCA GTTTCTACCC GAGGTTCTAC GCGAGCTTGG ATGCTCGAAG ACCGTAGTCG TAACGGATAG TGGTGTTTGG AGCGTTGTGG GGAGCGTCGT CGAGGGTGCT TTAAGGGGGC TGGCCTACGA GGTCGTATAT ATCGAGGCGG CGGATAACTC GAACGTTGAG AGAGCGCGCT CGGCGGCTAG GAGGGTGGAG GCTTGCGCCG TGGCTGGGCT GGGGGGCGGG CGGCCCGTCG ACGTCGCGAA GTACGCGGCG TTCATGGAGG GGCTCCCCTT CGTGAGCGTG CCCACGGCGA TAAGCCACGA CGGCTTCGCC TCGCCCATAG TGGCGCTCAA GGACCCGGAG GGGAACCCCC TGTCTATATT CACGAGGCCG CCCGCCGCCG TGCTCGTGGA CCTGGCGGTC GTGTCGAGGG CTCCGAGGAG GCTCCTCGCG AGCGGGGTCG GGGACATAGT CGGAAAGGTT ACCAGCGTCG CGGACGCCAG GCTTGCCCAG AGGCTTACAG GTGAGGAGGT CCCGGAGGTA GCCCTCAGGA TGGCGGAGAC GGCGGCCAGG ATGGTCCTGG ACGAGGTGGA CGAGATAGCT TCGTGGACTG AGAGGGGTGT AGGCGTCCTG GCGCAGGCCG GGCTACTCGC AGGCATGGCC ATGGCGGTGG CCGGTAGCTC GAGGCCCTGT AGCGGCTCGG AGCACCTCTT CAGCCACTCC CTGGACAAGT ATGTGCCGTG GAAGAAGAGC CTCCACGGGG AGCAGGTAGG CGTGGGCGCG ATCATAGCGT CGTACCTCCA CGGGTTCAAC TGGAGGGTTA TCCGGGACGC CCTCGCGAAG GTCGGGGCGC CGACGACCGT GGAGGGGCTC GGGGTAACCG GGGAGGACGC GGTCCGCGCC CTCCTCAAGG CCAGGGAGCT GAGAAAGAGG TTCACGATCC TCGACGTAGT CGAGCTCAAC GAGGGGCTCG CCTGGAAGGT GCTCAGGGAA ACCGGGGTAG CGCCCACGGC CTAA
|
Protein sequence | MRAELPKRVV VERGALQFLP EVLRELGCSK TVVVTDSGVW SVVGSVVEGA LRGLAYEVVY IEAADNSNVE RARSAARRVE ACAVAGLGGG RPVDVAKYAA FMEGLPFVSV PTAISHDGFA SPIVALKDPE GNPLSIFTRP PAAVLVDLAV VSRAPRRLLA SGVGDIVGKV TSVADARLAQ RLTGEEVPEV ALRMAETAAR MVLDEVDEIA SWTERGVGVL AQAGLLAGMA MAVAGSSRPC SGSEHLFSHS LDKYVPWKKS LHGEQVGVGA IIASYLHGFN WRVIRDALAK VGAPTTVEGL GVTGEDAVRA LLKARELRKR FTILDVVELN EGLAWKVLRE TGVAPTA
|
| |