Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2135 |
Symbol | |
ID | 8384429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2178962 |
End bp | 2180251 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644973204 |
Product | sulfatase |
Protein accession | YP_003131035 |
Protein GI | 257053202 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATG TTGTATTACT CACAATAGAT GCCCTTCGCG CGGATCATCT CTCCTGTTAT GGGTATGATC GGACCACCAC CCCATTTTTA GACAGCTTTG CTGAACAGTC TATATTGTTC GAAAACACCT ATTCTACGAG CTCTCACACC AGGGAGGCGA TGGCATCGTT GCTTTCAGGC ACCTATCCTG ACGAGGCAAT CGAAGACGAT TATTCGATCT CCGCAGAGAC CGTGGCCAGC CATCTTGCTG ACACACATTT GTGTGGTGGA TTTCACTCGA ACCCGTATCT CTCACGAGCC TTCGGATACG ACCGAGATTT TGAGGCTTTC GACGATGATC TTCGACTGGG ACAAAATCGA ATTCTTGGAC TCATTCAACG TGCTCTCGAT AAGTTTGTTT TCAACAGGGG GTCGTATCAT GCACGGGCTT CAGAAATCAA CGAACGATCA CTGAACTGGT TGGATTCGAT TCAGGACGAA GAGCCGTATT TTCTCTGGAA TCATTATATG GACGTACACG GCCCCTATAA TCCACCGACA GACTATAACA CGTGGTCGGA TCCCATCTCT GACAGCGACG CTCAACAGTT ATATGACGCT CTCTCCGGCG GTGAAGCCGT TTCTGAGGAT GACGTAGGCA GAGCCCTGAA TCTCTACGAT GGTGAAATAT TGTATACAGA TGCCCTCATT GAAGAGTTCA TTACCGAACT AGACCGGAGA GATCATCTCG AGGATACGTT GGTACTGATC ACTGCAGACC ACGGGGATCT GTTCGGCGAA TACGATTCGT TCGCCCATCC TCGGTACGTA TATCCCGAGC TAACACGGGT GCCGCTGTTG GTGCGGACGC CGGAGACACA GACTGGGCGC GTCCAGGGGG CCTGTTCGAC GGTTGACATT CTTCCGACGA TACTCGACTG GATTGGTCAG TCAAACGAAC AGCTCGCGGG TCGATCTCTC TTCGACGATG TCGAATCGGA TCGGGTTGTC TACAGTACTG CGAGAGGGGA AGATGACAAC CAGCACTGTC GCCGGATCGC TGCAAACCAG CGTGGCCATT CGCATCTGCT GGAATATGAC ACGCAGAGGG AGACCATCCA GCACGAACGA CGCATCGTGA GTAACCCGGA ACAATCGTCT GCAAACAACT GCGACATTGA TTGGGGCATG TTGCGGGATA ACGCACGTTC GTTCGGTCAG GTCCATGCAG AGTCGAAATC CGCTCCGGGG AGTGGGGCGG AAGTTGAGGA AGAAATAGAA CGACGTCTCG ACGCCCTCGG TTACAAGTAA
|
Protein sequence | MKNVVLLTID ALRADHLSCY GYDRTTTPFL DSFAEQSILF ENTYSTSSHT REAMASLLSG TYPDEAIEDD YSISAETVAS HLADTHLCGG FHSNPYLSRA FGYDRDFEAF DDDLRLGQNR ILGLIQRALD KFVFNRGSYH ARASEINERS LNWLDSIQDE EPYFLWNHYM DVHGPYNPPT DYNTWSDPIS DSDAQQLYDA LSGGEAVSED DVGRALNLYD GEILYTDALI EEFITELDRR DHLEDTLVLI TADHGDLFGE YDSFAHPRYV YPELTRVPLL VRTPETQTGR VQGACSTVDI LPTILDWIGQ SNEQLAGRSL FDDVESDRVV YSTARGEDDN QHCRRIAANQ RGHSHLLEYD TQRETIQHER RIVSNPEQSS ANNCDIDWGM LRDNARSFGQ VHAESKSAPG SGAEVEEEIE RRLDALGYK
|
| |