Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1093 |
Symbol | |
ID | 4600960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1030326 |
End bp | 1031483 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639773870 |
Product | N-acetylglucosamine-6-phosphate deacetylase |
Protein accession | YP_920495 |
Protein GI | 119720000 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1820] N-acetylglucosamine-6-phosphate deacetylase |
TIGRFAM ID | [TIGR00221] N-acetylglucosamine-6-phosphate deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.11899 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGAGTAC TCGTGAAGAA CGCCCACGTT CTCACCCCTC TGGGCGACCT CGGGGTCGTC AACGTGATGG TGAAGGACGG CGTCGTCGAA GGCTTCGACG TCGAGGCTGT CCCCGATAGG GTCGTAGACG CGGAGCGCTA CTACGTCGCG CCGGGCTTCA TAGACACGCA CATACACGGC TACGGAGGCG TAGACGTAAC CGAAGCCAGC GCGGAGGAGA TACTCGAAAT GTCCGGCGGG CTCGCGGAGC ACGGGGTCAC GGGTTTCCTC GCATCGACGG TGGCGGCGCC CCACGAGAGA CTCCTCCAGG CGTGTAGCAA CGTCGCCGCG GCGAGCTCGC GGTGGAGCCC CTCAAAGGGG GCGAGGATCC TCGGAGTCCA CCTTGAAGGT CCATACCTCA ACCCGAAGAT GAAGGGGGCT ATGAACGAGC AGTACTTCCG CAAGCCTAGC CTAAGGGAGC TCGACGAGTA CGTCTCGGCG TCGAGGGGCC TCGTGAGGCA GGTCACAGTA GCCCCCGAAG TCGAGGGTGC CTTGGAGTTC ATAGAGGAGG CGAGCAGGAG GGGCATCACG GTGAGCGTAG GGCACACGGA CGCCACGTAC GAGCAGGCGC TCAGGGCGGT CGAGGCGGGA GCCCGGAAGG CGAACCACAT CTTCAACCAG ATGAGGGGCT TCCACCACAG GGAGCCTGGC ACCGCCATGG CGTTACTCCT AGATACGGAC GTCTTCGTGG AGATGATAGT GGACTTCGTG CACCTACACC CGGCGACCGT GAGGCTTGTT TACCGCCTGG CGGGCCCCCT GAGGACAGTG CTCATAACTG ACGCAGTGCG CGCGGCGGGG CTCCCCGACG GCGAGTACAC CCTCGGGGGC TTGCGGATAG TGGTGAAGGA GGGGGTTTCC AGGCTGGCAG ACTCGGGGGC TCTCGCCGGC TCGACGCTCA CGATGGACAG GGCGGTCAGG AACATGACGA AGGTGGGTGC GAACACTCTC GAAGCCCTGA CGATGGCGAG CTACACCCCG GCGAAAAGCG TAGGGGCTCT TGGAAGGGAG AGGGTCGGCC TGCTCAGACC CGGGTACGCG GCGGACATGG TAGTCCTAGA CGAGAGGCTA GAGGTTAAGA AAACGATTAT AGCCGGAGAA GTCGTGTACG AGGCTTGA
|
Protein sequence | MRVLVKNAHV LTPLGDLGVV NVMVKDGVVE GFDVEAVPDR VVDAERYYVA PGFIDTHIHG YGGVDVTEAS AEEILEMSGG LAEHGVTGFL ASTVAAPHER LLQACSNVAA ASSRWSPSKG ARILGVHLEG PYLNPKMKGA MNEQYFRKPS LRELDEYVSA SRGLVRQVTV APEVEGALEF IEEASRRGIT VSVGHTDATY EQALRAVEAG ARKANHIFNQ MRGFHHREPG TAMALLLDTD VFVEMIVDFV HLHPATVRLV YRLAGPLRTV LITDAVRAAG LPDGEYTLGG LRIVVKEGVS RLADSGALAG STLTMDRAVR NMTKVGANTL EALTMASYTP AKSVGALGRE RVGLLRPGYA ADMVVLDERL EVKKTIIAGE VVYEA
|
| |