Gene Nmag_1354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1354 
Symbol 
ID8824187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1388691 
End bp1390556 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content65% 
IMG OID 
Productsulfatase 
Protein accessionYP_003479495 
Protein GI289581029 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCTTCC TCGCTGTGTC GCATAGATCG TTGGAGCGCT GCAATCCAAA TCGGTTGATC 
AACCAGTTAC GTTCGGCTGT CCGCGTTGAA TCGGCAACTA CTGTTCAGGG CGGGGAGGAT
GTCAAGGCCC GTGTCGCGAC CAAACGCTTA CTCAGTTGCT CACTGTATCT CATCCGCATG
ACCAACGACC GGCCGAACAT CGTTCTCGTC CACTGCCACG ACCTCGGGAC GTATCTGGGC
TGTTACGGTG TCGACGTGGA GACACCCCAC ATCGACAGTC TTGCCACCGA CGGCATCCGG
TTCGACCGGC ACTTCGTCAC CGCACCCCAG TGCTCGCCGA GCCGGGCGAG TCTGTTCACT
GGCCGCCACC CCCACCAGAA CGGTATGCTC GGCCTCGCGC ACGCCGACTG GGAGCTCGGC
CCCGACGAGC GCGTCCTCCC GGACCTGCTC TGCGATGCTG GCTACGAAAC CCACCTCTTC
GGCCTCCAGC ACATCACCGA GTACCCCGAC CAGCTCGGCT ACGACCACAT TCACACCGAA
CAGCCCCTGA CCGTCGAGGC GTCGCCGGCC GTCCACGAAA CCGCCCGCGC GAACGCCGTC
GCAGACGAGT TCGCATCCGT ACTCGAGTCC GACGGTCTCG GCGACCCGTT CTTCGCCTCG
GTCGGCTTCT TCGAACTCCA CCGCGTCGAG GAAAACGGTG GCTTCGGCTT CGAGGGCGAC
CGGTACGACG CGCCGGCCCC CGAGGACGTG GCCCCACTCG AGTTCCTGCC GGACAGGCCC
GGCATTCGGT CGGATATCGC CGAGATCAAC GGGATGTTGA ACGCGTTAGA CGAAGCGACG
GGGACGGTAC TCGAGGCGCT CGACGAGGCA GGTGTCGCAG ACGAGACGCT GGTCGTCTTT
ACGACCGAGC ACGGGCTGGC GATGCCGCGT GCGAAGGGGT GTTGCTTCGA TCCGGGGATC
GAGGCGGCGT TGCTCATGCG CTATCCATCA CGAATTGACG GTGGCCAGAC GGTTGACGAT
CTCATCAGCA ACGTCGACGT GTTCGCGACA CTCATCGCGG TTGCCGACGC ACCGGTGCCG
GAGACGCAGC TTGCCGGGGA ACGGTTCACG CCGCTGTTGT TCGGTGACGC GGGCGACGAG
GGCGACGAGG GCAACGCGGG TGACGCGGGT GACGCGGGCG ACGAGGGCAA CGCAGGTGAC
ACGGATGACG CGGGCGACGA GGGCAACGCA GGTGACACGG ATGACGCGGG CGACGAGGGC
AACGCAGGTG ACACGGATGA CGCGGGCGAC GAGGGCAACG CAGGTGACAC GGATGACGCG
GGCGACGAGG GCAACGCAGG TGACGCGGAT GACACGGATG ACGCGGGCGA CGAGGGTGAC
AAGCGCGAGA AGCGCGAGGA AAATAGCGAA GACAGTGACC GCGGCGACTA CGAGCCCCGT
GACCGCATCT TCGCCGGCAT GACCTGGCAC GACCGCTACA ATCCGATGCG GGCGATCCGG
ACGGAGCGCT GGAAGTACGT CCGCAACTTC TGGCACCTCC CCCACGTCTA CATGACGACG
GATATCTACT GCAGCGCCGC CGGCCGGGAG ATGCGCGAGG AGTTCACCGG CGATCAGCGC
GCCTACGAGG AACTGTACGA CCTCGAGGCC GACCCGCTCG AGCAGGAGAA TCTTCTACTG
GCGGACACGC CCGATACAGT GGACACTCCC GACCAGCATG GGAGCCGGGA CGTTGACGAT
GTCCGTACCC GACTTCGGGA CGACCTCGTC GACTGGATGA CCGAGACGGA CGATCCGCTA
CTCGACGGCC CGGTCGTCCC CAGCGACTGG GAACGCATTC ATCCCGAAAT GGGCGACGAC
CGCTAG
 
Protein sequence
MPFLAVSHRS LERCNPNRLI NQLRSAVRVE SATTVQGGED VKARVATKRL LSCSLYLIRM 
TNDRPNIVLV HCHDLGTYLG CYGVDVETPH IDSLATDGIR FDRHFVTAPQ CSPSRASLFT
GRHPHQNGML GLAHADWELG PDERVLPDLL CDAGYETHLF GLQHITEYPD QLGYDHIHTE
QPLTVEASPA VHETARANAV ADEFASVLES DGLGDPFFAS VGFFELHRVE ENGGFGFEGD
RYDAPAPEDV APLEFLPDRP GIRSDIAEIN GMLNALDEAT GTVLEALDEA GVADETLVVF
TTEHGLAMPR AKGCCFDPGI EAALLMRYPS RIDGGQTVDD LISNVDVFAT LIAVADAPVP
ETQLAGERFT PLLFGDAGDE GDEGNAGDAG DAGDEGNAGD TDDAGDEGNA GDTDDAGDEG
NAGDTDDAGD EGNAGDTDDA GDEGNAGDAD DTDDAGDEGD KREKREENSE DSDRGDYEPR
DRIFAGMTWH DRYNPMRAIR TERWKYVRNF WHLPHVYMTT DIYCSAAGRE MREEFTGDQR
AYEELYDLEA DPLEQENLLL ADTPDTVDTP DQHGSRDVDD VRTRLRDDLV DWMTETDDPL
LDGPVVPSDW ERIHPEMGDD R