Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1070 |
Symbol | |
ID | 8383344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1045827 |
End bp | 1047218 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644972135 |
Product | sulfatase |
Protein accession | YP_003129986 |
Protein GI | 257052153 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACACT CTGATATAAT CTGGGTTACG CTTGATAGCG TACGGGCAGA CCACACCTCC CTCTCTCAGT CAGACCGGGC CAAAACGCCA CATCTCGAAT CCGTAGGCGA GATAGGAACC GCTTTCGGGG AGTGCCATTC ACACGATATC TGGACCCGGT CTTCCACTGC CTCCATATTG ACCGGACATC CACCATCAGC ACACAGGACC TGGTCCAATT CGGCAAAGCT TCCAGAACAG ATTACGACAA TCCCTGAATC CCTCAGCGAG CAAGGCTACC GAACGGCCTG TGTTACGTCC AATGGACAAA TCAGTCAAAG TACTGGGCTG GGTCGAGGCT TCGATGCATT CCACTTCATT AACAGGGACA CCGTCGTCCA GGAAGCCGGA CTGCGCTCAG TCATAAAGTG GTTAACCAAA ATTCGAAGCC ATTCGATTGG TCTCTCGACG GCGGCTAACG AGCATTGTGT CGGTTACCTC TCGACACAAA TCGCTAAACG CCACATTTCG AATTCCCAGG GAGCGGACCA CCCGCTCTTC CTCTATACCC ACCTGAATGA TTCCCACCAC CCATACATTC CGCCAGGAAT GTGGGAAAGT GCGGTTTCGG AGGATCTCTC TATCTCGGTG GAACGAGCCG AGGAAATCGC TCGAGACATG AGCAACAACC TGCACGAACG GATCGCCGAC GATGATCCGT ATTCGGAGAC GGAATGGGAG GTACTGAGGG TTTTGTACGA CGCCTTGGTC GAGTACGTGG ATCACTTGAC CGGGGAGATT ATCTCAACTG CTCGGGAACA GTTACAAGAC CCCATCATTG TCGTGACCGG CGATCACGGA GAATTCTTCG GCTATCGAGG GTTGCTCGCC CACATGCTCG AACCCAGTAC GCAAGTATCG AACGTGCCTT TGGCCGTCGA AGGAATTCGG GGCCTCGAGG ACACCGGATT AGTTCAACAC AGCGACGTCA TGAAGACGAT CCTCGAGGAT GTAGGGGTCG ACCACTCAGT GCCAGCTGGT GTAGATATTC GTGATACACC TCGGGAGTAC GCATTCGTGC AACGTGGACA GGACAGGGCG GAGCAAAAGC TTGAGCAAAT AAAACAACAC AACAGCGACT TCTCTGATTC TCACTATCCG AAGGCTACAG TGACGTCGAT GATCACTCCG ACGTACCGAT TCGAAGTTTC TGAGGAAAGT GCCCGCCTCT ACGATCTCCC CGATGAATCG AACGATGTAA GCGACGAGTA TGCCGACCTG GTAGCCGAGT TTAGGGAGCG GTACGAAGAG TGGAAGACAG ATGTCGGAGA ACCGGTCGGA GACGTCCGAA AAGCCGAGTT CGACGAACAA ATGAAAAAAC AACTTCGAGG ACTGGGATAT CTCCAAGAGT GA
|
Protein sequence | MKHSDIIWVT LDSVRADHTS LSQSDRAKTP HLESVGEIGT AFGECHSHDI WTRSSTASIL TGHPPSAHRT WSNSAKLPEQ ITTIPESLSE QGYRTACVTS NGQISQSTGL GRGFDAFHFI NRDTVVQEAG LRSVIKWLTK IRSHSIGLST AANEHCVGYL STQIAKRHIS NSQGADHPLF LYTHLNDSHH PYIPPGMWES AVSEDLSISV ERAEEIARDM SNNLHERIAD DDPYSETEWE VLRVLYDALV EYVDHLTGEI ISTAREQLQD PIIVVTGDHG EFFGYRGLLA HMLEPSTQVS NVPLAVEGIR GLEDTGLVQH SDVMKTILED VGVDHSVPAG VDIRDTPREY AFVQRGQDRA EQKLEQIKQH NSDFSDSHYP KATVTSMITP TYRFEVSEES ARLYDLPDES NDVSDEYADL VAEFRERYEE WKTDVGEPVG DVRKAEFDEQ MKKQLRGLGY LQE
|
| |