Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3375 |
Symbol | |
ID | 8743995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3483604 |
End bp | 3485028 |
Gene Length | 1425 bp |
Protein Length | 474 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646513957 |
Product | sulfatase |
Protein accession | YP_003404911 |
Protein GI | 284166632 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCCAC ACATCGTTCT CATCCACTGC CACGATCTGG GGAAGTACGT GGGCTGTTGC GGCGCCGGCG TCGAGACGCC ACGGATCGAC GGCCTCGCCG CGGCGGGCGT CCGGTTCGAT CGCCACTTCG TGACGGCCCC GCAGTGTTCG CCGAGTCGCT CGAGTCTGAT GACCGGCCGT CACCCCCACC AGAACGGGAT GCTCGGACTC GCCCACGGCA ACTGGGAGGT CGGCCCCCAC GAGCGATTCC TGCCGGAGTT ACTCGGCGAG GCCGGCTACG AGACGCACCG CTTCGGACTC CAGCACGTCA CCGAGTACCC CGAACGACTC GGCTACGATC TGACGCACAA CGAGGAGTCC CTGACGAGCG AAACCCCGAC GTCGGTCCAC GAGGGCGCCC GCGCACGCAC CGTCGCCGCC GACGTCGCGG GGTGGCTCGA GGCGGGCGAC CGCGACGATC CGTTCTTCGC CTCGGTCGGC TTCTTCGAAC TCCACCGCAT CGCGGTGGAC GGCGGGTTCA GCTTCGACGG CGAGCGGTAC GACGCCCCCG ATCCCAACGC GGTCGAGCCC CTCGAGTTCC TCCCCGATCG GCCCGGTATC CGGTCGGACA TCGCCGGAAT GAACGGGATG GTCCGCGCGA TCGACGACGG CGTCGGAACG ATCGTCGACG CCCTCGAGAA CGAGGGACTG GCCGAAGACA CCCTCCTGCT CTTCACGACC GAACACGGGC TGGCGATGCC GCGCGCGAAG GGCACCTGTT TCGACGCCGG CATCGAGGCG GCTCTGCTGA TGGCCCAACC GGGAACCCTC GCGTCGGGCC GGGTCGTCGA CGACCTCGTG AGCAACGTCG ACGTCTTCGC GACGCTGCTC GATATCGGGG ACGCGCCGGT CCCCGGCGTC GACACCGATG GGGACGACAT CGCGGGGCAG AGTTTCGCGC CGCAGTTGTT CGACGGCGGG AACGGCGGCG GCACGAACGG AGCCGCTGGC AAGGACGCCT ACAAGCCCCG CGACCGGGTC TTCTCGGGGA TGACCTGGCA CGATCGATAC AACCCGATCC GGGCCATCCG AACCGACCGC TGGAAGTATA TCCGTAATTT CTGGCACCTA CCCGCAGTCT ACATGACGAC GGACGTCTTC TGCAGCGCGG CGGGTCGGGA GGTCCACGAG GACTACTACG GCGTGCAGCG ACCCTACGAG GAACTATACG ACCTCGAGGC CGACCCGCTC GAGCGGGAGA ACCTCGCGGC GGGGGACGAC CCGGACGATC CGGCTACCGA GACCGTTCGC GACGAGCTTC GAACGGACCT GCTCGAGTGG ATGGACGCGA CCGGTGATCC GCTGCTTGAG GGCCCCGTGC TGCCGAACAA CTGGGAGACG GTCCACCCCC GGCTGGAGGA CGACCGCGAC GACATCCGGC GGTAA
|
Protein sequence | MPPHIVLIHC HDLGKYVGCC GAGVETPRID GLAAAGVRFD RHFVTAPQCS PSRSSLMTGR HPHQNGMLGL AHGNWEVGPH ERFLPELLGE AGYETHRFGL QHVTEYPERL GYDLTHNEES LTSETPTSVH EGARARTVAA DVAGWLEAGD RDDPFFASVG FFELHRIAVD GGFSFDGERY DAPDPNAVEP LEFLPDRPGI RSDIAGMNGM VRAIDDGVGT IVDALENEGL AEDTLLLFTT EHGLAMPRAK GTCFDAGIEA ALLMAQPGTL ASGRVVDDLV SNVDVFATLL DIGDAPVPGV DTDGDDIAGQ SFAPQLFDGG NGGGTNGAAG KDAYKPRDRV FSGMTWHDRY NPIRAIRTDR WKYIRNFWHL PAVYMTTDVF CSAAGREVHE DYYGVQRPYE ELYDLEADPL ERENLAAGDD PDDPATETVR DELRTDLLEW MDATGDPLLE GPVLPNNWET VHPRLEDDRD DIRR
|
| |