Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2978 |
Symbol | |
ID | 8743596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3063520 |
End bp | 3064974 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646513563 |
Product | sulfatase |
Protein accession | YP_003404519 |
Protein GI | 284166240 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAA ATCGTCGACC GAATATCGTC GTCCTCTGTC TCGACACCGT CAGGAAGGAC GTCTACGACC GATTCGCGAC CCGCCTCCGG GAGCGGGCGG CCGTCCGTTT CGAGGGGATG CGGGCGCTCG GCGGCTGGAG CGTTCCGAGC CACGCCGGGA TGCTGACGGG CACCGTCCCG TCGGAAACGG GTGTCCACGC CCACCAGCGG CGGTTCGACC CGATCGACCC CGAGGACACC TGGATCGCGC CCCTCGAGGG GCAGGGGTAC GAGTCGGTCT GCGTCACGTC GAACATCTAC GCCAGCCCCG TCTTCGGGTT CGACCGCTTC TTCGATCGGA CGGTTCCCAT CTCGCCGAGC CGCAGACTCC CGGAGGGGAT GGACGTTCAA GAACACATCT CCGATCGGTC GGCCGAGGGC GTGGAAGCGT ACGCCGATTT CGTGCGCGAG GCCCTCGAGC ACGACCACCC GCTGCGCTCG CTGGCCAACG GCGTCCTGCT CAAACTGGAC GACGTGAGCC GGAAGCTGCC GATCGAGAAG CCGACCGACT TCGGGGGGCG AGCGATCGCC CGCACCCTCG AGCGCGAGGT CGCCGAGCCC GACGGGCCGG TCGTCGCCTT CGCTAACGTT ATGGACGCCC ACGGCCCCCA CACCGCCTTC CGCGGGCTCG ATGACTCGAT CCACGGCGTC TCCGCGGACT TCCACTCGAG TTCGTTTCGC GACTCGGACG TCAACGTCGT GGACGGACTC GGTGCGTACG AATCGGACAT CGAGCGCGTC CGCCGGCTCT ACGCCGCGAC GGTCGACTAC CTCGACCGGG TCGTTACCGA CCTCCTGGAC GCGTTAGCCC GCGAGGACGA CCGCGAGTCG ATTCTGATCG TGACGGCCGA CCACGGCGAG AACCTCGGCT ACGAGTCCGA CGGTTACCTC ATGAACCACA TGAGCAGCCT CTCGGAAGGG CTGTTACACG TCCCGTTCGA CATCGTGGCC ACCGACGACA GCGCCCTCGA GCTCGCGGTC GACGATACTA CCCGTCCCGT CGACGTCGAC GGGCTCGCCT CCCACGCCGA CCTCGGCGAC GCGGTCCGGT CGCTCGCGGG CGAGGAGCCG TTCGATCCGT TCGCCCTCGA GCGTGAGCGT GCTCGCGCCG AGATCGTCGG CTCCGGCTCC GGCATCCCGG AGGGCGGGGA CGAATCGTAC TGGGACCGCG GCCAGCGGGT CGTCTACGAG GACGACCGGA AGTACTACCG CGATCAGCTC GGCGACGAGG CCGTCTACGA CGTCTCCGGA CCGCCGTCGA AACAGGTCGA ACTGCCCGAC GAGACGGTGC CGGACGGGCT CTTCGAGTCC GCCTTCGGCG ACTGGATCAC CGACGAAGAG CGCGACGGCC GGGACCACGC CGAGGAGGTC GACGCGGCGA GTCGCGCCCG GCTGGAGGAT CTGGGATACC TATGA
|
Protein sequence | MTENRRPNIV VLCLDTVRKD VYDRFATRLR ERAAVRFEGM RALGGWSVPS HAGMLTGTVP SETGVHAHQR RFDPIDPEDT WIAPLEGQGY ESVCVTSNIY ASPVFGFDRF FDRTVPISPS RRLPEGMDVQ EHISDRSAEG VEAYADFVRE ALEHDHPLRS LANGVLLKLD DVSRKLPIEK PTDFGGRAIA RTLEREVAEP DGPVVAFANV MDAHGPHTAF RGLDDSIHGV SADFHSSSFR DSDVNVVDGL GAYESDIERV RRLYAATVDY LDRVVTDLLD ALAREDDRES ILIVTADHGE NLGYESDGYL MNHMSSLSEG LLHVPFDIVA TDDSALELAV DDTTRPVDVD GLASHADLGD AVRSLAGEEP FDPFALERER ARAEIVGSGS GIPEGGDESY WDRGQRVVYE DDRKYYRDQL GDEAVYDVSG PPSKQVELPD ETVPDGLFES AFGDWITDEE RDGRDHAEEV DAASRARLED LGYL
|
| |