Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3821 |
Symbol | |
ID | 8744449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 43814 |
End bp | 45595 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 646514407 |
Product | inositol monophosphatase |
Protein accession | YP_003405354 |
Protein GI | 284167076 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0303328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGCC GTAGATTAGC GACGATAGAC GAAATTATCG CCATCGAAAG TCCGAACAGT GACGAAACGC TCGCCCGACT GGAGACTTGG GCCACGGATC GAGGTATCGG GCTCTCGACG GTCGACGTCG GAGACGATAT CAGCGACGTT TACGACGAGA CCAGCGCCAC GCTCGGCGTT ACGCTCGGCG GTGACGGAAC CTTCCTCGAA GGTATCAAAA CGTTCGCGCC ACGGAATATC CCCCTAATAG GGGTTAACAC GGGAACGCTC GCGTTCCTCG CTCGCGTCGA ACCCGACGAT CTCGAAGCGG CCCTAGACGA GACGATCCGC GGACGAGCGT CGGTTGACAG TCGCCAACAG GTGCGCGTTG ATGCACCAGA CGTCGAGGCG ACGGGGATCA ACGACGTGAT GCTCCAACAG GTTCCCCCGG AGAATCCGAT CGACCGCAAG ATCACCCGAC TGGACGTCTA CGCCGACGAC GAGTACGTCG GCGAGTTCGA CGGGACCGGC CTGGCCGTTT CGACGCCGAC GGGATCGACG GGCGTCTCGC TGTCGGCCAA CGGTCCGGTT CACTACCCCG TCAACAACCA CACGCTACAG ATCGTCCCGC TGCACACCCA CAAACTCGGC GTCCGTCCGA TCGTCGTCTC ACCGTCGACG GAGATTCGGA TCGAGACTCA GGGTCAGGCG AGTATGCTCG TCGACGGCGG ACGCGCCCAC ACCGTTCTGA GCCAGGGAGA CGAGATCGTC GTTACCGGTG CGGAGCAACT CGCTCACGTC GTCCGGACCA GCTACGACGA TCATTTCTTC ACAGCGATCT CGAAAAAACT CGGCTGGGGT ATTCGCGACG CAGGTGTTCC GGAGGCGAAA GCCCGCGACG GAACTGACGG CGCGGCGAGC GCGACCGATC CAACGGCCGA AGAGGGCGTA GATACCATCG AGCGCGCGCT GACGATCGCC ACCGAGGCGG CCGAGGCCGC CGGCGAACCG TTACGCGAAC TCCACGGGCA GGTCGAGTCG ATCGACGTCA AGAGCGACAA GTCCGACATC GTCACCGAAG CCGACCATCA GGCCGACCGC GTCATCACTA CCGTCATTCG GAACGAGTTC CCCGATCACG CGATTTTCTC GGAGGAAAGC GTTCGACAGA CGAACGCGGA CGGCGACTAC ACCTGGGTCG TCGACCCGCT CGACGGAACC GGCAACTTCG CTCACGGCAA CCCGAACTAC TCGATCTCGA TCGGGCTCCT CGAGGACGGC GTTCCGGTGA TGGGAGTCGT CTACATCCCC GAAACCGACG AGCTGTTCTC CGCGATCGCA GGCGGGGACG TTCGGAAAGA CAGCGAACCG ATCGTGACGA CCGACCGGGA CTCCTTAGAC GAGAGCATGC TCATCTCCGG GTACGATCCG GACGGAACAT TCCTCACACA TTTCTACCAG GAGTCTCGCG GCGTCCGCCG GCTCGGTTCG GCCGCGCTCA ATCTGTGTTA CCTCGCGAGC GGCAGCGCCG ATGCCGTCTG GGAGCGCGAC ACCTACCCCT GGGACATCGC TGCCGGACTC GTTATCGCGC GCGCTGCCGG CGCCGAGTTC ACCGACGAAA CGGGCGAACC GTTCGAGTTT AGCCTCGACA CCGATGAACA GCGGACGTTA CTGGGATCCA ACGGGTCGCT GCACCCCGAA ATTCTCGACC ACCTCGAGTC GGGACTCGCC GTTCCAAATC GTTCCAAATC AGTAGACTTC GCTCAAGACG GTCCGACGGA AGCGGTAGCA GACGATCAGT AG
|
Protein sequence | MKGRRLATID EIIAIESPNS DETLARLETW ATDRGIGLST VDVGDDISDV YDETSATLGV TLGGDGTFLE GIKTFAPRNI PLIGVNTGTL AFLARVEPDD LEAALDETIR GRASVDSRQQ VRVDAPDVEA TGINDVMLQQ VPPENPIDRK ITRLDVYADD EYVGEFDGTG LAVSTPTGST GVSLSANGPV HYPVNNHTLQ IVPLHTHKLG VRPIVVSPST EIRIETQGQA SMLVDGGRAH TVLSQGDEIV VTGAEQLAHV VRTSYDDHFF TAISKKLGWG IRDAGVPEAK ARDGTDGAAS ATDPTAEEGV DTIERALTIA TEAAEAAGEP LRELHGQVES IDVKSDKSDI VTEADHQADR VITTVIRNEF PDHAIFSEES VRQTNADGDY TWVVDPLDGT GNFAHGNPNY SISIGLLEDG VPVMGVVYIP ETDELFSAIA GGDVRKDSEP IVTTDRDSLD ESMLISGYDP DGTFLTHFYQ ESRGVRRLGS AALNLCYLAS GSADAVWERD TYPWDIAAGL VIARAAGAEF TDETGEPFEF SLDTDEQRTL LGSNGSLHPE ILDHLESGLA VPNRSKSVDF AQDGPTEAVA DDQ
|
| |