Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1597 |
Symbol | |
ID | 7399546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1614554 |
End bp | 1616122 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643708664 |
Product | sulfatase |
Protein accession | YP_002566253 |
Protein GI | 222480016 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00665248 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTATCGG ACACGTCGAC CTCGGACACA GCGGTATCGA ACCCAACGTC GTCGGATTCA ACCGATTCAG ATTCAACCGA TTCAGATTCA ACCGGTTCGG ATTCGACGGA CTCGACCTCA ACATCGGACG GAGCGGTATC AAACGTCCTC CTCGTCACGA TCGATTCGCT CCGGGCGGAC GCGATCGGTC CCTACGACAA CGATCGATAT TCCCCGGTGC TCTCGGATCT CGCCGCCGAC GGAACCGTCT TCGATCGGTC GTTCGCGACC GGCAACTGGA CGCCCTTCTC GTTCCCCTCG ATCCTCGCCT CCGAGCCCGT CTTCGCCCGA AACGGCGACA TCGGTGTGAC GGGCGCTCGC ACGCTCGCGT CGGTGCTCTC CGAGGCCGGA ATCGCGACCG GCGGCTTCAA CGCCGCCAAC GGCTTCCTCA CCTCTCACTG GGGGTATCCC GAGGGGTTCG ACGAGTTCGA GCCGTTCGTC ACGAGCGTGG GATCGAGCCG GTACAGTCGG TACCTCGCGG CCCACCCGAC GGTCGAGGCG TGGATCCAAC TCGCCACGTC GCCGTTCCGC CGTCTCGGCT CCAAGCTCCG GGGCGAGAGC GACGATCGTC CCTTCCTCGA CGCCTCGCGG ATGTTCGACG TCGAGGACTC CGCGACCGAG TTTGTCGACG ACACCGACGA GCCGTTCTTC CTGTGGGTCC ACTACATGGA CACCCACACC CCGTACGTCC CCGCCCCGCG GTACATCCGC GAGGTCTCCG ACGGGCTGAT CGGCACCCAC CGGATGCTCC ACGCCCACAC GCGCACGAGC CTCGGCTGGG AGGTCGGCGA GCGGACCCTC GGCGACCTCC GAACCCTCTA CCAGGCCACG GTGCGACAGG TCGACGCCAG CGTCGGGCGC CTGCTCGACA CGCTTGAGGC GGCCGGGATC GCCGACGAGA CCGCGATCGT CGTCGCCGGC GACCACGGCG AGGAGTTCCA GGAACACGGC CACCTCGCGC ACTACCCGAA GCTGTACGAC GAGCTGATCC ACGTGCCGCT CATCGTGAAC GTCCCCGGCG AGGACGGCGG TCGCCGCGTG TCCGAACACG TCGGGCTCGA CGCGATTCCG CCGACCGTCG CCGACCTGCT CGACGTCGAA TCGCCGCCGG AGTGGCGCGG CGAATCCCTC GAACCGGCGG TCAGTGGCGG CGAGTCGCCG GATCAGGAGC CCGTCGTCTC GGTCACCGTT CGGGGAGAGG AGGTGACCGA ACAGCCGATC CCGCGATCGC TTTCCGACGG CGACCTCCTC GTGAGCGTCC GCGACGCCGA GTGGACGTAC ATCGAGAACG CGGACACGGC GGAGACGGAG CTGTACCACC GACCCTCGGA CCCGACTCAG CAGGAGGATC TGTCGGCGGA CCCGTCCGAC GAGGCGCTCG CGGTCGTCGA GCGGTTCGCG CCGATCGTCG CGGACCACGT CGCCGAACTT CGCGACAGAC AGACGGACGC GGAGGCGGCC GACGACGGCG AGGACGAGGA GGTCGACGAG CACCTCGAGG CCCGCCTCGA AGCGCTCGGC TATCGGTGA
|
Protein sequence | MVSDTSTSDT AVSNPTSSDS TDSDSTDSDS TGSDSTDSTS TSDGAVSNVL LVTIDSLRAD AIGPYDNDRY SPVLSDLAAD GTVFDRSFAT GNWTPFSFPS ILASEPVFAR NGDIGVTGAR TLASVLSEAG IATGGFNAAN GFLTSHWGYP EGFDEFEPFV TSVGSSRYSR YLAAHPTVEA WIQLATSPFR RLGSKLRGES DDRPFLDASR MFDVEDSATE FVDDTDEPFF LWVHYMDTHT PYVPAPRYIR EVSDGLIGTH RMLHAHTRTS LGWEVGERTL GDLRTLYQAT VRQVDASVGR LLDTLEAAGI ADETAIVVAG DHGEEFQEHG HLAHYPKLYD ELIHVPLIVN VPGEDGGRRV SEHVGLDAIP PTVADLLDVE SPPEWRGESL EPAVSGGESP DQEPVVSVTV RGEEVTEQPI PRSLSDGDLL VSVRDAEWTY IENADTAETE LYHRPSDPTQ QEDLSADPSD EALAVVERFA PIVADHVAEL RDRQTDAEAA DDGEDEEVDE HLEARLEALG YR
|
| |