Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2973 |
Symbol | |
ID | 8743590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3057297 |
End bp | 3058739 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646513557 |
Product | sulfatase |
Protein accession | YP_003404514 |
Protein GI | 284166235 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGGAC ACGCGACCAT GGGAGACGAG CCATCCATCG CCCTCGTCGT GCTCGACACC CTGCGGGCGG ATTCCTTCGA CGAGCACTTC GACTGGCTAC CGGGCGTGCA GTTCACGAAC GCGTGGAGCA CGAGCCACTG GACGGCCCCG GCCCACGCCT CGCTGTTCAC CGGCCGGTAC GCGAGCGAGG CCGGCGTGAC GATCAAATCC CAGGATTTCG ACCGGGACAC GACTCGCCTC CCCGAACTCC TCCGGGACCG CGGCTACCGG ACGCGGGCGT TCAGCTGTAA CGTCAACATC TCGGAGCAGC TAGGCTGGCA CCACGGGTTC GACGAGTTCG ACGGCGGCTG GCGGCTCAGC GGCCTCGGCG AGGACGTCTT CGACTGGGAC GAGTTCATCG CCGAACACCG GGCCGACGGC CCCGAACGGT ACCTCCGAGC GCTCTGGCGC TGCGTCGACG GCGACTGCGA TACGACCCAG TCGCTGAAAC AGGGCGCCCT GATGAAGCTC CGAGACATGG GCCTCAAGGG GCGCCACCCC GACGACGGCG CGTCGGAGTT TCTGGAGTAC GTCCAGAAGC GTTCGTGGAC CCAGGACAGC GAGTTCCTCT TCGCGAATCT GATGGAGGCG CACCTGCCGT ACGACCCGCC CGACGAGTAC AAGACCTATC CGGACGAGGA GTCGCCCCAC TTCGACAGCG TGAAGGCCAC GCTCGGGGAG CCCTCGGCCG ATCCGGAGCG GATCAGGACC GCGTACGACG ACGCCGTACG GTACCTGTCG GACATCTACC GCGACATCTA CGGGGAGCTC GCCGCGGAGT TCGACTACGT CGTGACCCTC GCCGACCACG GCGAAGCGCT CGGCGAGTAC GGCGCGTGGC AACACGGCGG CGGCCTCCAT CCGCCCGTGA CGAAAGTCCC GCTCGTCGTC TCCGCGCCCG GACCGAACGC CGACGACCGA ACCCCCAACG CAGGCAGCGC GGAGCGGCCA ACGCGGGACG CCCTCGTGAA CCTGCTGGAC GTCTACGCGA CGGTGCTCGA CCTCGCCGGT ATCGAGTCGG CCCACCGTCG CGGCGAGTCG TTCCGCCCGC TGTGCTCGAG CGATCCCACC GACACCGAGC CCCACTCGAG CGACACCGTC ATCACTGAGC CGCGCTCGAG CGCGCTGCTC GAGTTCCACG GCATCTCGAA GCGACGCGCG CTCGCGCTCG AGGAGGACGG CTACGATATC GGCCCCGTCG ACCGCGAACG CCACGGCGTC GCGACGGCCG ACTGTTACTA CTTCGAGGGG CTGTCCGGGA CCGAGCTGAT CGGCGACTGT GACCCGGCGG CCCTCGAGTC GGAACTCGGT CGACTGGTCG ACGGCCTCGA GCGCCGCGAG GGACTCTCCG AGGCGGATAT GGACGGTCTG GAATCCCAGC TCGAGGAGCT GGGGTATCTG TGA
|
Protein sequence | MNGHATMGDE PSIALVVLDT LRADSFDEHF DWLPGVQFTN AWSTSHWTAP AHASLFTGRY ASEAGVTIKS QDFDRDTTRL PELLRDRGYR TRAFSCNVNI SEQLGWHHGF DEFDGGWRLS GLGEDVFDWD EFIAEHRADG PERYLRALWR CVDGDCDTTQ SLKQGALMKL RDMGLKGRHP DDGASEFLEY VQKRSWTQDS EFLFANLMEA HLPYDPPDEY KTYPDEESPH FDSVKATLGE PSADPERIRT AYDDAVRYLS DIYRDIYGEL AAEFDYVVTL ADHGEALGEY GAWQHGGGLH PPVTKVPLVV SAPGPNADDR TPNAGSAERP TRDALVNLLD VYATVLDLAG IESAHRRGES FRPLCSSDPT DTEPHSSDTV ITEPRSSALL EFHGISKRRA LALEEDGYDI GPVDRERHGV ATADCYYFEG LSGTELIGDC DPAALESELG RLVDGLERRE GLSEADMDGL ESQLEELGYL
|
| |