Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1354 |
Symbol | |
ID | 8824187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 1388691 |
End bp | 1390556 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003479495 |
Protein GI | 289581029 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCCTTCC TCGCTGTGTC GCATAGATCG TTGGAGCGCT GCAATCCAAA TCGGTTGATC AACCAGTTAC GTTCGGCTGT CCGCGTTGAA TCGGCAACTA CTGTTCAGGG CGGGGAGGAT GTCAAGGCCC GTGTCGCGAC CAAACGCTTA CTCAGTTGCT CACTGTATCT CATCCGCATG ACCAACGACC GGCCGAACAT CGTTCTCGTC CACTGCCACG ACCTCGGGAC GTATCTGGGC TGTTACGGTG TCGACGTGGA GACACCCCAC ATCGACAGTC TTGCCACCGA CGGCATCCGG TTCGACCGGC ACTTCGTCAC CGCACCCCAG TGCTCGCCGA GCCGGGCGAG TCTGTTCACT GGCCGCCACC CCCACCAGAA CGGTATGCTC GGCCTCGCGC ACGCCGACTG GGAGCTCGGC CCCGACGAGC GCGTCCTCCC GGACCTGCTC TGCGATGCTG GCTACGAAAC CCACCTCTTC GGCCTCCAGC ACATCACCGA GTACCCCGAC CAGCTCGGCT ACGACCACAT TCACACCGAA CAGCCCCTGA CCGTCGAGGC GTCGCCGGCC GTCCACGAAA CCGCCCGCGC GAACGCCGTC GCAGACGAGT TCGCATCCGT ACTCGAGTCC GACGGTCTCG GCGACCCGTT CTTCGCCTCG GTCGGCTTCT TCGAACTCCA CCGCGTCGAG GAAAACGGTG GCTTCGGCTT CGAGGGCGAC CGGTACGACG CGCCGGCCCC CGAGGACGTG GCCCCACTCG AGTTCCTGCC GGACAGGCCC GGCATTCGGT CGGATATCGC CGAGATCAAC GGGATGTTGA ACGCGTTAGA CGAAGCGACG GGGACGGTAC TCGAGGCGCT CGACGAGGCA GGTGTCGCAG ACGAGACGCT GGTCGTCTTT ACGACCGAGC ACGGGCTGGC GATGCCGCGT GCGAAGGGGT GTTGCTTCGA TCCGGGGATC GAGGCGGCGT TGCTCATGCG CTATCCATCA CGAATTGACG GTGGCCAGAC GGTTGACGAT CTCATCAGCA ACGTCGACGT GTTCGCGACA CTCATCGCGG TTGCCGACGC ACCGGTGCCG GAGACGCAGC TTGCCGGGGA ACGGTTCACG CCGCTGTTGT TCGGTGACGC GGGCGACGAG GGCGACGAGG GCAACGCGGG TGACGCGGGT GACGCGGGCG ACGAGGGCAA CGCAGGTGAC ACGGATGACG CGGGCGACGA GGGCAACGCA GGTGACACGG ATGACGCGGG CGACGAGGGC AACGCAGGTG ACACGGATGA CGCGGGCGAC GAGGGCAACG CAGGTGACAC GGATGACGCG GGCGACGAGG GCAACGCAGG TGACGCGGAT GACACGGATG ACGCGGGCGA CGAGGGTGAC AAGCGCGAGA AGCGCGAGGA AAATAGCGAA GACAGTGACC GCGGCGACTA CGAGCCCCGT GACCGCATCT TCGCCGGCAT GACCTGGCAC GACCGCTACA ATCCGATGCG GGCGATCCGG ACGGAGCGCT GGAAGTACGT CCGCAACTTC TGGCACCTCC CCCACGTCTA CATGACGACG GATATCTACT GCAGCGCCGC CGGCCGGGAG ATGCGCGAGG AGTTCACCGG CGATCAGCGC GCCTACGAGG AACTGTACGA CCTCGAGGCC GACCCGCTCG AGCAGGAGAA TCTTCTACTG GCGGACACGC CCGATACAGT GGACACTCCC GACCAGCATG GGAGCCGGGA CGTTGACGAT GTCCGTACCC GACTTCGGGA CGACCTCGTC GACTGGATGA CCGAGACGGA CGATCCGCTA CTCGACGGCC CGGTCGTCCC CAGCGACTGG GAACGCATTC ATCCCGAAAT GGGCGACGAC CGCTAG
|
Protein sequence | MPFLAVSHRS LERCNPNRLI NQLRSAVRVE SATTVQGGED VKARVATKRL LSCSLYLIRM TNDRPNIVLV HCHDLGTYLG CYGVDVETPH IDSLATDGIR FDRHFVTAPQ CSPSRASLFT GRHPHQNGML GLAHADWELG PDERVLPDLL CDAGYETHLF GLQHITEYPD QLGYDHIHTE QPLTVEASPA VHETARANAV ADEFASVLES DGLGDPFFAS VGFFELHRVE ENGGFGFEGD RYDAPAPEDV APLEFLPDRP GIRSDIAEIN GMLNALDEAT GTVLEALDEA GVADETLVVF TTEHGLAMPR AKGCCFDPGI EAALLMRYPS RIDGGQTVDD LISNVDVFAT LIAVADAPVP ETQLAGERFT PLLFGDAGDE GDEGNAGDAG DAGDEGNAGD TDDAGDEGNA GDTDDAGDEG NAGDTDDAGD EGNAGDTDDA GDEGNAGDAD DTDDAGDEGD KREKREENSE DSDRGDYEPR DRIFAGMTWH DRYNPMRAIR TERWKYVRNF WHLPHVYMTT DIYCSAAGRE MREEFTGDQR AYEELYDLEA DPLEQENLLL ADTPDTVDTP DQHGSRDVDD VRTRLRDDLV DWMTETDDPL LDGPVVPSDW ERIHPEMGDD R
|
| |