Gene Nmag_1529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1529 
Symbol 
ID8824363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1558382 
End bp1559881 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content64% 
IMG OID 
Productsulfatase 
Protein accessionYP_003479667 
Protein GI289581201 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.856317 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGACG CCGACACCCG ACCGAACGTC CTGCTCGTGC TCACGGATCA GGAACGCTAC 
GACTGCAGCG CTCTCGACGG GCCGGTCGCC GAAACGGTCG AAACGGAGAC GATAGACCAC
CTCTCGGCGA CCGGTACGCA CTTCGAGCGC GCGTTCACTC CGATCAGCAT CTGCTCGAGC
GCTCGCGCAT CGCTCCTGAC CGGCCAGTTC CCCCACGGTC ACGGCATGCT GAACAACTGC
CACGAGGACG ACGCCCTCCA ACCTAACCTG CCACCCGGCG TCCCGACGTT CTCGGAGAAA
CTCGACGACG CTGGCTACCA CCTGACCTAC ACCGGGAAGT GGCACGTCGG CCGCGATCAA
ACTCCCGAGG ACTTCGGGTT CTCCTATCTC GGCGGGAGCG ACAAACACCA CGACGACATC
GACGACGCGT TCCGCGAGTA CCGCGCCGAA CGCGGGACGC CCGTCGGTGA GGCCGATCTC
GACGATGTCA TCTACACCGG CACCAATCCG CGGGACGACA GCAACGGAAC GTTCGTCGCC
GCCACAACAT CGGTCGAGGT CGAGGAGACG CGCGCCTGGT TCCTGGCCGA GCGTACTATC
GACGCAATCG AGGAACACGC GAGCCGCGAC CGCGACGCTC CATTTTTCCA CCGAGCGGAC
TTCTACGGCC CACACCACCC CTACGTCGTC CCCGAACCCT ACGCCTCGAT GTACGACCCC
GAGAACATTG ATCTTCCCGA GAGCTACGCC GAAACCGACG CCGGGAAACC CCGAGTCCAC
GCGAACTACC GCTCCTACCG CGGTGTCGAA CAGTTCGACC GAGACGTCTG GAAAGAGGCC
ATCGCGAAGT ACTGGGGTTT CGTCACCCTG ATCGACGACC AGTTCGGCCG GATTCTGGAT
GCACTCGAGT CCACCGGCCT CACGGACGAG ACGGTGGTCG TCCACGCCTC GGATCACGGC
GATTTCGCTG GAGGGCACCG CCAGTTCAAC AAGGGGCCGC TGATGTACGA CGATACGTAT
CACATCCCGC TGCAGGTGCG CTGGCCGGGC GTCACGGAGC CCGGATCGGT TCGCGAGGAA
CCGGTTCACC TCCACGATCT GGCAGCGACG TTCCTCGAGA TGGGTGGCGT GGCGATCCCC
GAGAGTTTTG ATTCGCGGAG CCTCGTGCCG TTGCTGGACG CTGACGGTCC GGAACAGGAA
TCAGCACCGT CGGCATGGCC TGATTCCGTC TTCGCCCAGT ACCACGGCGA CGAGTTCGGG
CTCTACACCC AGCGAATGGT CCGAACGGAC CGGTACAAGT ACGTCTACAA CGCGCCGGAC
GTAGACGAGT TGTACGACCT CGAAGCGGAT CCGGCGGAGT TGCAGAATTT GATCGACCAC
CCCGACTACG CCGACGTTCG TCGAGAGCTC CGAACCAGGC TCATCGACTG GATGGAGGAG
ACCGATGATC CAAATCGGCA GTGGGTGCCG GACGTGCTGC GGGCGGCAGA GGAGTCGTGA
 
Protein sequence
MVDADTRPNV LLVLTDQERY DCSALDGPVA ETVETETIDH LSATGTHFER AFTPISICSS 
ARASLLTGQF PHGHGMLNNC HEDDALQPNL PPGVPTFSEK LDDAGYHLTY TGKWHVGRDQ
TPEDFGFSYL GGSDKHHDDI DDAFREYRAE RGTPVGEADL DDVIYTGTNP RDDSNGTFVA
ATTSVEVEET RAWFLAERTI DAIEEHASRD RDAPFFHRAD FYGPHHPYVV PEPYASMYDP
ENIDLPESYA ETDAGKPRVH ANYRSYRGVE QFDRDVWKEA IAKYWGFVTL IDDQFGRILD
ALESTGLTDE TVVVHASDHG DFAGGHRQFN KGPLMYDDTY HIPLQVRWPG VTEPGSVREE
PVHLHDLAAT FLEMGGVAIP ESFDSRSLVP LLDADGPEQE SAPSAWPDSV FAQYHGDEFG
LYTQRMVRTD RYKYVYNAPD VDELYDLEAD PAELQNLIDH PDYADVRREL RTRLIDWMEE
TDDPNRQWVP DVLRAAEES