Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3519 |
Symbol | |
ID | 8744139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3623017 |
End bp | 3624654 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646514100 |
Product | sulfatase |
Protein accession | YP_003405054 |
Protein GI | 284166775 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGACG CTCAGCAATC CAACGTCCTC TTTGTCGTGC TGGACACGGT CCGGAAGGAC CGACTGGGTC CGTACGGCTA CGAACGGGAA ACGACGCCCG AACTCTCCGC GTTCGCCGAG GAGGCGACCG TCTTCGAGTC GGCCGTCGCG CCCGCGCCGT GGACGCTGCC GGTCCACGCC TCGCTGTTTA CCGGCCGGTA TCCGAGCCAG CACGGGGCCG ATCAGGGGAG TCCGTACCTC GAAGGCGACG CCACCCTCGC GACGGCCCTC TCGGCGGCCG GCTACGACAC GGCGTGTTAC TCCTCGAACG CCTGGATCAC CCCCTACACC GGCCTCACCG AGGGGTTCGA CGCGCAGGAC TCGTTCTTCG AGGTCCTCCC CGGCGACGTC CTCTCGGGGC CGCTGGCCAG CGCCTGGCAG ACCGTCAACG ACAACGACTA CCTCCGCGAT CTGGCGTCGA AACTCGTCAG ACTCGGCGCG ATGGCCCACG AGAAACTCGC CAGCGGCGAG GGCGCCGACA CGAAGACGCC GTCGGTCATC GACCGGACGA AGTCCTTTAT CGACGACAGC GAGAGCGACG AGGGCTGGTT CGCGTTCGTC AACCTGATGG ACGCCCACCT GCCCTACTAC CCGCCCGAGG AGTATCGCGA GGAGTTCGCT CCCGGCGTCG ACCCCAGCGA GGTCTGCCAG AACTCGAAGG AGTACAACTC GGGCGCGCGC GACATCGACG ACGAGGAGTG GGACGACATC CGGAGCCTGT ACGACGCCGA GATCGCCCAC ATGGACGCCG AACTCGGCCG CTTGTTCGAC TGGCTGCGCG AGACCGGCCA GTGGGAGGAG ACGACCGTCG TCGTCTGCGC CGATCACGGC GAACTCCACG GCGAACACGA CCTCTACGGC CACGAGTTCG CCCTCTACGA CCAGTTGATC AACGTCCCGC TGCTGGTCAA ACACCCCGCC CTCGAGGCCG ACCGGCGCGA CGACCTCGTC GAGTTGCTCG ACTGCTATCA CACGGTCCTC GAGGCGCTGG ACGTCGATCC CGACGACGCG CTCGCGCCGG CGGACGACGA CATCTCCGTC ACCGGTCGCG ATCCGACGCG GTCGCTCCTG TCCGACGAGT ACCGCGCCTT CGAGGGGGTC TCGGAGCCGG ATCCCGGCCA GCAGGCGGTG CTCGACGCCG AGGGTGGCGA GGCGTCCGAC GACAGCGAGG GACGAAGTCC TTCGAGCGGC CGAACGCAGT CCAATGACGA CTACGCGTTC GTCGAGTACG CCCAGCCGGT GATCGAACTC CATCACTTAG AAGAGAAGGC CAGCGAGGCA GGGATCGAAC TGCCCGACGA TCACCGGGCC TACTCCCGCC TGCGCGCGGC CCGCAGCACC GACGCGAAGT ACGCCCGGGC CGACCGCATC CCCGACGAGG GCTACCGCCT CGACGAGGAT CCCGCGGAAT CGACGCCGGT CGATCCGGTC GACGACGGGG TCGTCGCCGA CACTGAACGC GCGCTCGCCC GCTTCGAGCA GGCCGCCGGC GGCGCGTGGA TCGATCCCAG CGAGACGGAC GCCGAGGACG CCGACGCGTT AGCCGAGGCC GACGAGGAGA CTCGCGACCG CCTGCGCGAA CTCGGCTACC TCGAGTAA
|
Protein sequence | MDDAQQSNVL FVVLDTVRKD RLGPYGYERE TTPELSAFAE EATVFESAVA PAPWTLPVHA SLFTGRYPSQ HGADQGSPYL EGDATLATAL SAAGYDTACY SSNAWITPYT GLTEGFDAQD SFFEVLPGDV LSGPLASAWQ TVNDNDYLRD LASKLVRLGA MAHEKLASGE GADTKTPSVI DRTKSFIDDS ESDEGWFAFV NLMDAHLPYY PPEEYREEFA PGVDPSEVCQ NSKEYNSGAR DIDDEEWDDI RSLYDAEIAH MDAELGRLFD WLRETGQWEE TTVVVCADHG ELHGEHDLYG HEFALYDQLI NVPLLVKHPA LEADRRDDLV ELLDCYHTVL EALDVDPDDA LAPADDDISV TGRDPTRSLL SDEYRAFEGV SEPDPGQQAV LDAEGGEASD DSEGRSPSSG RTQSNDDYAF VEYAQPVIEL HHLEEKASEA GIELPDDHRA YSRLRAARST DAKYARADRI PDEGYRLDED PAESTPVDPV DDGVVADTER ALARFEQAAG GAWIDPSETD AEDADALAEA DEETRDRLRE LGYLE
|
| |