Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1529 |
Symbol | |
ID | 8824363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 1558382 |
End bp | 1559881 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003479667 |
Protein GI | 289581201 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.856317 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGACG CCGACACCCG ACCGAACGTC CTGCTCGTGC TCACGGATCA GGAACGCTAC GACTGCAGCG CTCTCGACGG GCCGGTCGCC GAAACGGTCG AAACGGAGAC GATAGACCAC CTCTCGGCGA CCGGTACGCA CTTCGAGCGC GCGTTCACTC CGATCAGCAT CTGCTCGAGC GCTCGCGCAT CGCTCCTGAC CGGCCAGTTC CCCCACGGTC ACGGCATGCT GAACAACTGC CACGAGGACG ACGCCCTCCA ACCTAACCTG CCACCCGGCG TCCCGACGTT CTCGGAGAAA CTCGACGACG CTGGCTACCA CCTGACCTAC ACCGGGAAGT GGCACGTCGG CCGCGATCAA ACTCCCGAGG ACTTCGGGTT CTCCTATCTC GGCGGGAGCG ACAAACACCA CGACGACATC GACGACGCGT TCCGCGAGTA CCGCGCCGAA CGCGGGACGC CCGTCGGTGA GGCCGATCTC GACGATGTCA TCTACACCGG CACCAATCCG CGGGACGACA GCAACGGAAC GTTCGTCGCC GCCACAACAT CGGTCGAGGT CGAGGAGACG CGCGCCTGGT TCCTGGCCGA GCGTACTATC GACGCAATCG AGGAACACGC GAGCCGCGAC CGCGACGCTC CATTTTTCCA CCGAGCGGAC TTCTACGGCC CACACCACCC CTACGTCGTC CCCGAACCCT ACGCCTCGAT GTACGACCCC GAGAACATTG ATCTTCCCGA GAGCTACGCC GAAACCGACG CCGGGAAACC CCGAGTCCAC GCGAACTACC GCTCCTACCG CGGTGTCGAA CAGTTCGACC GAGACGTCTG GAAAGAGGCC ATCGCGAAGT ACTGGGGTTT CGTCACCCTG ATCGACGACC AGTTCGGCCG GATTCTGGAT GCACTCGAGT CCACCGGCCT CACGGACGAG ACGGTGGTCG TCCACGCCTC GGATCACGGC GATTTCGCTG GAGGGCACCG CCAGTTCAAC AAGGGGCCGC TGATGTACGA CGATACGTAT CACATCCCGC TGCAGGTGCG CTGGCCGGGC GTCACGGAGC CCGGATCGGT TCGCGAGGAA CCGGTTCACC TCCACGATCT GGCAGCGACG TTCCTCGAGA TGGGTGGCGT GGCGATCCCC GAGAGTTTTG ATTCGCGGAG CCTCGTGCCG TTGCTGGACG CTGACGGTCC GGAACAGGAA TCAGCACCGT CGGCATGGCC TGATTCCGTC TTCGCCCAGT ACCACGGCGA CGAGTTCGGG CTCTACACCC AGCGAATGGT CCGAACGGAC CGGTACAAGT ACGTCTACAA CGCGCCGGAC GTAGACGAGT TGTACGACCT CGAAGCGGAT CCGGCGGAGT TGCAGAATTT GATCGACCAC CCCGACTACG CCGACGTTCG TCGAGAGCTC CGAACCAGGC TCATCGACTG GATGGAGGAG ACCGATGATC CAAATCGGCA GTGGGTGCCG GACGTGCTGC GGGCGGCAGA GGAGTCGTGA
|
Protein sequence | MVDADTRPNV LLVLTDQERY DCSALDGPVA ETVETETIDH LSATGTHFER AFTPISICSS ARASLLTGQF PHGHGMLNNC HEDDALQPNL PPGVPTFSEK LDDAGYHLTY TGKWHVGRDQ TPEDFGFSYL GGSDKHHDDI DDAFREYRAE RGTPVGEADL DDVIYTGTNP RDDSNGTFVA ATTSVEVEET RAWFLAERTI DAIEEHASRD RDAPFFHRAD FYGPHHPYVV PEPYASMYDP ENIDLPESYA ETDAGKPRVH ANYRSYRGVE QFDRDVWKEA IAKYWGFVTL IDDQFGRILD ALESTGLTDE TVVVHASDHG DFAGGHRQFN KGPLMYDDTY HIPLQVRWPG VTEPGSVREE PVHLHDLAAT FLEMGGVAIP ESFDSRSLVP LLDADGPEQE SAPSAWPDSV FAQYHGDEFG LYTQRMVRTD RYKYVYNAPD VDELYDLEAD PAELQNLIDH PDYADVRREL RTRLIDWMEE TDDPNRQWVP DVLRAAEES
|
| |