Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0237 |
Symbol | |
ID | 5709156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 273001 |
End bp | 274344 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641274739 |
Product | sulfatase |
Protein accession | YP_001540075 |
Protein GI | 159040823 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.813396 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAATG ATCATTTCAA CGTTATCCTA ATAGCTATAG ATACTTTGAG GGCTGATAGG GTTGGTTGCC TCGGCTCAGG TTACCCAACT ATGCCTAATG TTGACTCCCT CTGCCGTGAC TCCGTGGCGT TTACGAATCA TTACGCTGAA GCCATACCCA CTCACCCATC CTTCACAACA ATATTTACTG GAACAACACC ATTAATACAC AGTATAGTTA GTCATGGTGG TAAGGTTCAA TTAAGTGGAA GCATATTAAC TCTACCACAG ATACTTAATG AAGCCGGCTA CTTAACTATA GCTGTTGATA ACTTAGCCAC ACACATGTAT GCTGGATGGT TCGCTAGGGG CTACAGGTAC TACATTGATA TTGGTGGTTT AGTGGTGATT TCTAACGGTA TTAAGGTTAA TGGTGAGTAT GTTAATGCTA AGGTTAAGGA TGCCTTCGAA TTAATTAATA GGTATAAGAA TGAACCCTTC TTACTCTTTA TACATTACTG GGATCCCCAC GCCCCATATA TACCACCTAA GCCTTATGCT GAGAAGTTCT ATCATGGGGA TTACAGTAAG GGTGATTTGG TTAGTAGGCT TAACTCAACA GCTTGGGGTA GATTACTACT TAAGGATAGT TGGATAAGGG ATTTAATTAA TTCCGGCGTT AATGACCCTG ATTACATTAG GGCTCTTTAC GATAGTGAGG CTGCTTACGT TGATGAGAGG ATTGGTGAAT TAATGAGCAT TATTAATAAT ACTGGCTTAC TTGAGGATAC ATTAATAGTA TTAACATCAG ATCATGGTGA GGGTCTTGGT GAACACAACG TATACTACGA GCACCACGGC TTATATGAGT GGGATGTTAA GACACCATTA ATCATTAGGC TACCTGATAA GTTAATTGAT GAGGTTGGGC GTGGTAAAGC CAAGGGTGTT AAGTATGATG CATTTGTTCA AAACACCGAC ATAACACCAA CAATATTAGA TTCACTAGGT TTAAAGATTC CCGAATACAT GACTGGCTTA AGCCTACTTA AGGTTATTAG GGGTGAGTCT AAGGGTCATG ATGCAGTGTT CAGTCTTGAG AATACTAGGC AGACTGCTAG GATGATTAGG GTTGGTGAAT GGAAGCTAAT ACAGTGGATT AGGAATGATA CATACGGTAG GAGGAGTGGC CACGTTGAAT TATATAATTT AGCTAAAGAC CCAACTGAAT CAAGGAACAT GGCAACCGAG GAAGGTGAAA TTACATTAAG GCTACTTGGA TTAATTGAGA GGAGGTATAG GGAGGTGGCT GGAGCCAATG ACCCATTAAT ACTGCAGGAA ATAAGCGTAC CAATAAAACC ATGA
|
Protein sequence | MVNDHFNVIL IAIDTLRADR VGCLGSGYPT MPNVDSLCRD SVAFTNHYAE AIPTHPSFTT IFTGTTPLIH SIVSHGGKVQ LSGSILTLPQ ILNEAGYLTI AVDNLATHMY AGWFARGYRY YIDIGGLVVI SNGIKVNGEY VNAKVKDAFE LINRYKNEPF LLFIHYWDPH APYIPPKPYA EKFYHGDYSK GDLVSRLNST AWGRLLLKDS WIRDLINSGV NDPDYIRALY DSEAAYVDER IGELMSIINN TGLLEDTLIV LTSDHGEGLG EHNVYYEHHG LYEWDVKTPL IIRLPDKLID EVGRGKAKGV KYDAFVQNTD ITPTILDSLG LKIPEYMTGL SLLKVIRGES KGHDAVFSLE NTRQTARMIR VGEWKLIQWI RNDTYGRRSG HVELYNLAKD PTESRNMATE EGEITLRLLG LIERRYREVA GANDPLILQE ISVPIKP
|
| |