Gene Nmag_1352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1352 
Symbol 
ID8824185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1385643 
End bp1387289 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content63% 
IMG OID 
Productsulfatase 
Protein accessionYP_003479493 
Protein GI289581027 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAGG AATCAGCAGC CGAGTCTAAC GGACGTGAAT CGGAGTCACA TTCAACAGCC 
AGGAACGTGG TTCTCGTCGT ACTCGATACG GCACGTGCGA GGAGTGTCGG CGAGTGGCCG
ACCACGGACG CAAGTTCGAA CGATCCAGCC GAAGACGACC CAGACGAGCG CCACGATCCG
ACCCAACCAA CACCGACGCT GTGTCGGCTC GCCGAAACTG GCACCGTTTT CGAGAACGCG
TTTGCGACGG CTCCGTGGAC ACTTCCCTCC CACGGCTCGA TGTTCACCGG CCTGTATGCC
TCCGAACACG GCACGCACGG CGGGCACACG TTCCTCGATC CGGAACTTCG AACGCTTCCC
GAGGCGTTCG CCGACGCCGG CTACGAGACC GTCGGTATCT CGAACAACAC CTGGATAACC
GAGGAGTTCG GCTTTGACCG CGGTTTCGAC GACCTCCGGA AAGGATGGCA GTACATCCAG
TCCGACGCCG ACATGGGGGC CGTCGTCCGG GGTGAGGATC TGCGCGAAAA GCTCCAGGCG
ACCCGGAACC GACTGTTCGA CGGCAATCCC GTCGTCAACG CGGCGAACAT CCTCTACAGC
GAAGTCCTGC AGCCATCGGG TGACGACGGT GCCGCCCGCT CTGCGGACTG GGTCGACGGC
TGGCTCGGCG ACCGCGACGA CGACAAGCCG TTCTTCCTCT TCTGTAACTT CATCGAACCC
CACGTCGAGT ACGACCCGCC GCAAGAGTAC GCAGAACGCT TCCTCCCCGA GGACGCGACC
TACGAGGAGG CGACCGCGAT CAGACAGGAC CCCCGCGCCT ACGATTGCGA GGACTACGAA
ATCACCGAGC GTGAGTTCGA ACTGCTCCGT GGCCTCTACC GCGCCGAACT CGCCTACGCC
GACGCCCAGG TTGGTCGTCT CCGGGAGGCA CTCGAGTCCC ACGGCGAATG GGAGGATACC
CTCTTCGTGG TCTGTGGCGA CCACGGCGAG CATATCGGCG AACACGATTT CTTCGGCCAC
CAGTACAACC TGTACGATAC GTTGATCAAC GTGCCGCTGG TCTGTCACGG GGGGCCGTTT
ACTGACGCCG ATTTCGAGTC CGGAAGTGGA ACCGAAACTG GAACTGAATC CGGAACCACC
ACTGACGATG TGACGGGTAC CCACCGCGAC GACCTCGTCC AACTGCTCGA CCTGCCGCTC
ACACTCCTCG ACGCCGTCGG TGTTTCTGAT CCCGAACTGC GGGAACAGGG AAGCGGGCGC
TCACTCCACC CCGCGTCGGA CGACGATCCG AGAGACGCTG TCTTCGCCGA GTACGTCGCC
CCACAGCCGT CGATCGACCG GCTCGAAGCC CGATTCGGCG ATATTCCGGA CCGCGTCCGC
GAGTTCGACC GCCGACTTCG TGCGATTCGG ACACACGAGT ACAAGTACGT CCGTGGCGAC
GACGGCTTCG AACGGCTCCA TCACGTCCCG ACCGATCCAG CCGAGCAGTC GAACCTCGTC
CAGGCCGAAC CCGACACCGT CAGCGCGCTC CAGGAGCAAC TCGAGGAGCG CTTCGATCCG
CTGGCCGAGT CGGAACCCGA CTCGACGGAC GAGGTGGCGA TGCGAGAGGG GACGAAAGAG
CGACTGGCTG ATCTCGGTTA TCTGTAA
 
Protein sequence
MAEESAAESN GRESESHSTA RNVVLVVLDT ARARSVGEWP TTDASSNDPA EDDPDERHDP 
TQPTPTLCRL AETGTVFENA FATAPWTLPS HGSMFTGLYA SEHGTHGGHT FLDPELRTLP
EAFADAGYET VGISNNTWIT EEFGFDRGFD DLRKGWQYIQ SDADMGAVVR GEDLREKLQA
TRNRLFDGNP VVNAANILYS EVLQPSGDDG AARSADWVDG WLGDRDDDKP FFLFCNFIEP
HVEYDPPQEY AERFLPEDAT YEEATAIRQD PRAYDCEDYE ITEREFELLR GLYRAELAYA
DAQVGRLREA LESHGEWEDT LFVVCGDHGE HIGEHDFFGH QYNLYDTLIN VPLVCHGGPF
TDADFESGSG TETGTESGTT TDDVTGTHRD DLVQLLDLPL TLLDAVGVSD PELREQGSGR
SLHPASDDDP RDAVFAEYVA PQPSIDRLEA RFGDIPDRVR EFDRRLRAIR THEYKYVRGD
DGFERLHHVP TDPAEQSNLV QAEPDTVSAL QEQLEERFDP LAESEPDSTD EVAMREGTKE
RLADLGYL