Gene Tpen_1093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1093 
Symbol 
ID4600960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1030326 
End bp1031483 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content64% 
IMG OID639773870 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_920495 
Protein GI119720000 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.11899 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGAGTAC TCGTGAAGAA CGCCCACGTT CTCACCCCTC TGGGCGACCT CGGGGTCGTC 
AACGTGATGG TGAAGGACGG CGTCGTCGAA GGCTTCGACG TCGAGGCTGT CCCCGATAGG
GTCGTAGACG CGGAGCGCTA CTACGTCGCG CCGGGCTTCA TAGACACGCA CATACACGGC
TACGGAGGCG TAGACGTAAC CGAAGCCAGC GCGGAGGAGA TACTCGAAAT GTCCGGCGGG
CTCGCGGAGC ACGGGGTCAC GGGTTTCCTC GCATCGACGG TGGCGGCGCC CCACGAGAGA
CTCCTCCAGG CGTGTAGCAA CGTCGCCGCG GCGAGCTCGC GGTGGAGCCC CTCAAAGGGG
GCGAGGATCC TCGGAGTCCA CCTTGAAGGT CCATACCTCA ACCCGAAGAT GAAGGGGGCT
ATGAACGAGC AGTACTTCCG CAAGCCTAGC CTAAGGGAGC TCGACGAGTA CGTCTCGGCG
TCGAGGGGCC TCGTGAGGCA GGTCACAGTA GCCCCCGAAG TCGAGGGTGC CTTGGAGTTC
ATAGAGGAGG CGAGCAGGAG GGGCATCACG GTGAGCGTAG GGCACACGGA CGCCACGTAC
GAGCAGGCGC TCAGGGCGGT CGAGGCGGGA GCCCGGAAGG CGAACCACAT CTTCAACCAG
ATGAGGGGCT TCCACCACAG GGAGCCTGGC ACCGCCATGG CGTTACTCCT AGATACGGAC
GTCTTCGTGG AGATGATAGT GGACTTCGTG CACCTACACC CGGCGACCGT GAGGCTTGTT
TACCGCCTGG CGGGCCCCCT GAGGACAGTG CTCATAACTG ACGCAGTGCG CGCGGCGGGG
CTCCCCGACG GCGAGTACAC CCTCGGGGGC TTGCGGATAG TGGTGAAGGA GGGGGTTTCC
AGGCTGGCAG ACTCGGGGGC TCTCGCCGGC TCGACGCTCA CGATGGACAG GGCGGTCAGG
AACATGACGA AGGTGGGTGC GAACACTCTC GAAGCCCTGA CGATGGCGAG CTACACCCCG
GCGAAAAGCG TAGGGGCTCT TGGAAGGGAG AGGGTCGGCC TGCTCAGACC CGGGTACGCG
GCGGACATGG TAGTCCTAGA CGAGAGGCTA GAGGTTAAGA AAACGATTAT AGCCGGAGAA
GTCGTGTACG AGGCTTGA
 
Protein sequence
MRVLVKNAHV LTPLGDLGVV NVMVKDGVVE GFDVEAVPDR VVDAERYYVA PGFIDTHIHG 
YGGVDVTEAS AEEILEMSGG LAEHGVTGFL ASTVAAPHER LLQACSNVAA ASSRWSPSKG
ARILGVHLEG PYLNPKMKGA MNEQYFRKPS LRELDEYVSA SRGLVRQVTV APEVEGALEF
IEEASRRGIT VSVGHTDATY EQALRAVEAG ARKANHIFNQ MRGFHHREPG TAMALLLDTD
VFVEMIVDFV HLHPATVRLV YRLAGPLRTV LITDAVRAAG LPDGEYTLGG LRIVVKEGVS
RLADSGALAG STLTMDRAVR NMTKVGANTL EALTMASYTP AKSVGALGRE RVGLLRPGYA
ADMVVLDERL EVKKTIIAGE VVYEA