Gene Nmag_3913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3913 
Symbol 
ID8826783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp310755 
End bp315887 
Gene Length5133 bp 
Protein Length1710 aa 
Translation table11 
GC content62% 
IMG OID 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003482016 
Protein GI289583606 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.806199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGACT CACGAGAACA ACTGACCGCC CTCCTGCTCA CCGCCCTCAT GGTGCTTTCG 
CTCGTTGCGA TCGCTGGCAC CGGGCTGGCA GGGAGCGTTG CGGCAACGGA GCACAGTGAC
GCCCAATCAG CTGTCGATGC CGACGCCGAC GGACCGGCAT CGATTGACGC CGAACTCGAA
GAAGCGAGTG GGACAGAACA GATATACATC GTCCTCGATC AGTACGATGG GCAACTGAGT
GACGATCGCG ACCGTGCGAT CGCACAGTTG CAGGAGCACG CTGAGGTGTC GCAGGCGGCT
GCGACCGAGG ATATCGACGC ACTCGCTGGC GTCGACGTAA TCAACGACTA CTGGATCACG
AACGCGATTC GCGCCGAGGT TGACACGAGC GCAGTCGACA CGGCTGAACT TGCGATGATC
GACGGCGCAG AAACGATTCA GCTCCGCCCG GACTACGAGG TGCCGGAACC GGAGCCATCC
GTCGAGCCAG CCGTTGAACC TGACGAAGAC GACTACACGT ACGGTCTCGA TCAGGTCAAC
GCCTCTGAGA CCTGGGCTGA CTTCAACACC CAGGGTGAGG ACGTGAAGGT CGCAGTGCTC
GACACTGGCT TCGACATTGA TCACCAGGAT CTGGACCTCT ACACCGAGGA TGCAGATGAC
CCGACCTACC CAGGTGGCTG GGTCGAGATC GACGCGGACG GTGACCCAGT CGAGGGCTCC
GAGCCATACG ACACGCACTA TCACGGCACG CACGTCGGTG GCACGGTTGG TGCAGCCGCA
CCGGCTGACG ACGATACGCC GGCCTACGGT GTCGCGCCGA ACGTCGACCT GCAGCACGGC
CTCGTGCTGC CGGACGGCTC GGGTGCAGAC TCCGACACCA TCGCTGGCTT CGAGTACGTC
GTCGAAGAGA TGGATTCGGA CGTGGTCAGC ATGAGCTTCG GTGCCGGCTG TGGCCTCACC
GGTCCAGTGT ACGAAGACGC CTGGATCCCG GTGATTCAGA ACGCCAACGA CATGGATGTC
GTCGCAGTCA CCTCCTCGGG TAACTCCGGC GAGGGCTGTG TCGGCTCGCC AGCGAACACC
TACGACTCGT TCAGTATCGG TGCCTCGAAC GAAGCCGGCG ATATCACCGA CTTCTCCAGC
GGCAACACCA TCGCCGCCGA CAACTGGGAC AACCCCGACC CAGAGTGGCC AGACGAGTGG
GTCAAACCAG ACGTCTCCGC ACCCGGCGAA GACGTCCTGA GCGCGATGCC AGACGACGAG
TACGACTACC TGGACGGCAC CTCGATGTCC GCACCGCACG TCTCCGGCGT GATCGCCCTC
ATGCTCTCGG CCAACGACGA CCTCACACAG GAGGAAATCG AGGAGACACT CGAGGAGACC
GCCTGGAAGC CTGACGGCGA ACCCGACGAA AAGGACGTTC GCTACGGCCA CGGCATCGTC
GACGCTTACG CCGCTGTCGA CGAAGTTGCT GCGGGTGCAC TCGAGTACGA ACTCGGTGAC
GTCGACCAGG ACGGCGATGT AACGGTCCAG GACGTGCAGC TAACCCAGCA GTACCTCCAG
GATATGGATC CTGAGCCGTT CGCCGAGGAC CTCGCGGACA TGGACCGCGA CGGTGAGGTC
ACGACGGACG ACCTGAGCCT GCTCCAGCAG AAGGTACAGG GCATGCTCGA CGAGGGTGAG
ATCGACATTA CGGGACTCGA CGTGCCCGAC GAAGTCGACG ATGGTGAGAT GTTCGAAGTC
ACCGTCGACC TCGAAAACCT CGGTGAGGAG GGTGCTGTCC AGGAAGTTGA CCTCTTCCTT
GGCGACGACG ATGAGCCGGT CGACACCGAA GTCGTCGACA TGGCAGCACC CGGCGTCGAC
GACCCGATTG ACCACCCAGC AGAGACGACG ATCACGTTCG AACTCGACGC GGGTGACCTC
GACGGCGGCG ACCACACTAT CACCGTCGAA ACCGAGGACG CGGACGCGAG CGACACGGTC
ACAGTCCTTG CGTCGAACTT CGAACTCTCG AACCTCGACG CACCTGACGA AGCCGACCGC
GGTGACGAAA TTACCGTCAG CGCAGATGTC GAGAACACGG GCAACGTCGA CGACACCCAG
GCCGTCGAGT ACCGCTTCGA CGACCTCGGT GACGCACTGT ACGCACAGAA CGTCACGCTC
GAGGCCGACG AGGAGACGAC GGTCACGTTC GATGTCGGCA CGGAGAACAT CACTGAAGGG
ACCTACGACC ACGGTGTCTT CACCGAGGAC GATGAGGCAA CGGCCACAAT CGACATCCTC
GAAGCGTTCT TCAGCGTCGA CATCACCGAC GCACCTGAGG AACTCGCACC TGGCGAGACC
TATAACGTCA GCGCGGATGT CGTGAACTCC GGCGACGCGC CGGACGAACA GACGATCTCT
TACGAGGTCA CGGAAGGCGA AACCGACGTT GCGGTCGTCG ACGGCGACGT CGACGAAGAG
GAACTCCGCG AGGAACTCGC CGACCGCGGT GTCGATGATG TCGATCTCGA CGAACTCGTC
GAGGGAGCCG ACAACCTGGC AGATACGCTA CAGGACGAAC TCGACGACTC CTACGACGTA
GAGACGGTCA ACGCAGATGA CCTGCTCGAC GAGGTCGATG CCTACGATGT CTTCGTCGTC
AACGACTTCG ACGGCGCAGA CGTCGATACG TTCCTCGACG AACTCGCAGA CGACCAGGGC
GTCATCTACC TCGAGAACTG GGGCAGCAGC TCGAACGCAG TGAGCGACCT GTCCGATGCT
ACTGACGACC CATCGTCCGT CGAGGACGGC TTCAGTGGCT CTGCACCGAT TATGCTCGAC
ATCACCGACG ACCACGCACT GTTCGATGGC GTCGCTGATG CGGGCGAGAC GATCGAACTT
CACGACGCGT CCGATGCTGA CCGTGCATGG TTTGACGGCT ACAGCGGCGA CGTCATCGCT
GACGTCGGCG ATGGCACGCC AGACGGCGCC TCCGTCGGCG TGAGCGAGGA CGACGACCAC
GTCCTGCTCG CCGCGTTCGG TCGCTCCTCG TTCATCACGA ACGCGGACTT CAGCGACGAG
TCGAACGCCA TCATGGCAAA CGCTGTCGAG CACGTCGACG ACTACGCTGG CACAGCCAGC
GCAACCGACC ACACGAGTGA GGTCGTCTCG CTCGAACCGA ACGAATCGGC CACCGTCGAG
TTCACGAACA CCCTCGCCGA AGACGACGGC GAACTCGAGT GGCACCACGT TGTCGAGAGC
GACGACGATA TTGCGACTGC ACCGTTCACC ATCGACGACG GACCGAACTG GAACGTCAAG
GGAACGGTCT CCGACGATGT CACCGACGAA CCGATCGAAA ACGCGAGTGT CGAACTGGAA
GCTGGCAACG AAACGTACAC GAACGTGACC GATGCCGACG GCGAATTCGG TCTCGCAAAC
GTTCCTGCCG GCGAGCACAA CCTGACGGTT GACGCCGAGG GCTACGCGGC CCACACTGAG
TCCGTCGAGG TTCCCGAAGA CGACATCGTC ACCGTCGACG TTGGTCTCGA GGAACTGCCC
GGAACGATCA GCGGTGACGT GACCGCAAGC GACGACGACG CACCAGTCGA GAATGCAACC
ATCGTCGCCG AGAACGACGA CGGCGATGTC CACGAGGCGA CGACCGACGA GAACGGCTCG
TACGAACTCG ACGGCGTCTC GGCAGGCACG TACGTCGTCA ACGTCGTCGA CACACCACCG
GGCTACGAGA TTGACGAGAT CGTCACCGTC GCACCCGGCG AACACGTCGA CGATGTCGAC
TTCGTCGTCG ACCGCACCGC CGGTTCGATC GAAGGCACCG TCACGAACGC CGCTGGCGTC
CCAATCGCTG ACGCGAACGT GATTGACGCC GACGACGGCG CGTTCAACGT GACGACCGCC
GAGGACGGCT CCTACGAAAT CGAGGACGTC ACGCCCGGCA CGAACGCGCT CCGCGCGGTC
GCTGATGGCT ACGACGACTC GAACGTCGAG TTCGTCGACG TTGAGACCGG CGAGACGACG
ACCGCGAACC TCACGCTCGG CACCTACTTC GAGGTCGACG ACCTCGCAGC ACCTGACACC
GCCGAGCAGG GTGAGGAGAT CACCGTCAAC GCGACGGTCA CCAACACCGG CGAGCAGGAA
GACACCCGAA CGGTGTTCTA CTTCCCGCCG GGCACTGACT TCGGCACCGA CGTTATCGAC
TACCAGCCTG AACTGGCCGA GACGGTCACA CTCGAGGGCG GCGAGTCGAC GACGGTCGAG
TTCACCTACG AGGTCGGTGC TGACGACGAA CCGGGCGAGT ACGAACACGG TGTCTCGGCT
GACGAGGTCG AATCGACGTT CATCACCATC GAAGGCGACG AAGACGACGG CGAACCCGTC
TTCGACGTGT CGGACCTCGA CGCACCGGCA GATGCTGAAC CGAACGAGAC CGTGTCGGTC
ACCGCCACGC TCGAAAACAC CGGCGACGAC GCCGGCACGC AGACGGTCAC CTACGCGTTC
GACGACGAGA CCGTCGCCAA CGAGACCGTC TCGCTCGACG CAACCGAGTC GACGGAACTC
GAGTTCAGTA CCGAACTCCC AGCGGATGAA GGTACGTACG AACACACTGT CTCGTCCGAG
AACGACTCTG CGACGGCACA GACCGACGTT GAAGAGGCTG GCGAGCCGGA GCCAGCGTAC
TTCGAAGTGA CCGAGCACGA CGCACCGGAG TCCGTTGACG CAGGCGAGGA ACTGACGGTC
AACGCGACGA TCACGAACAC CGGCGACGAA GCGGACCAGC AGGACATCTT CCTGTTCTGG
GACGCTGCGA TGAGCGACCT CGAGGACGCG GAGTCTGTGG CTGAACTCCG CGAGGCAGGT
ATCGACTCGA TGGAATCGAT CGAACTCGAG GGTGGCGAAT CCGACACGGT GACGCTCACC
CACGAGGTCG ATTCGGAGAC GGAACCGGGT ACCTACCAGT ACACGGTTTC GACGCTGCAG
GAGATGGCTG ACGGTGAAGT GACGGTTGAT GACGCTGCAG AAGTCGCCTT CCCAGCAGAT
CCGGGCGGTT TCGATGGGGC ACCACCGGGA GCAGCGCTCG GTGCGAACCC GTTTGAAAAC
GGTATCGCGA ACGGAATCGG ACTGCTCGGC TAA
 
Protein sequence
MTDSREQLTA LLLTALMVLS LVAIAGTGLA GSVAATEHSD AQSAVDADAD GPASIDAELE 
EASGTEQIYI VLDQYDGQLS DDRDRAIAQL QEHAEVSQAA ATEDIDALAG VDVINDYWIT
NAIRAEVDTS AVDTAELAMI DGAETIQLRP DYEVPEPEPS VEPAVEPDED DYTYGLDQVN
ASETWADFNT QGEDVKVAVL DTGFDIDHQD LDLYTEDADD PTYPGGWVEI DADGDPVEGS
EPYDTHYHGT HVGGTVGAAA PADDDTPAYG VAPNVDLQHG LVLPDGSGAD SDTIAGFEYV
VEEMDSDVVS MSFGAGCGLT GPVYEDAWIP VIQNANDMDV VAVTSSGNSG EGCVGSPANT
YDSFSIGASN EAGDITDFSS GNTIAADNWD NPDPEWPDEW VKPDVSAPGE DVLSAMPDDE
YDYLDGTSMS APHVSGVIAL MLSANDDLTQ EEIEETLEET AWKPDGEPDE KDVRYGHGIV
DAYAAVDEVA AGALEYELGD VDQDGDVTVQ DVQLTQQYLQ DMDPEPFAED LADMDRDGEV
TTDDLSLLQQ KVQGMLDEGE IDITGLDVPD EVDDGEMFEV TVDLENLGEE GAVQEVDLFL
GDDDEPVDTE VVDMAAPGVD DPIDHPAETT ITFELDAGDL DGGDHTITVE TEDADASDTV
TVLASNFELS NLDAPDEADR GDEITVSADV ENTGNVDDTQ AVEYRFDDLG DALYAQNVTL
EADEETTVTF DVGTENITEG TYDHGVFTED DEATATIDIL EAFFSVDITD APEELAPGET
YNVSADVVNS GDAPDEQTIS YEVTEGETDV AVVDGDVDEE ELREELADRG VDDVDLDELV
EGADNLADTL QDELDDSYDV ETVNADDLLD EVDAYDVFVV NDFDGADVDT FLDELADDQG
VIYLENWGSS SNAVSDLSDA TDDPSSVEDG FSGSAPIMLD ITDDHALFDG VADAGETIEL
HDASDADRAW FDGYSGDVIA DVGDGTPDGA SVGVSEDDDH VLLAAFGRSS FITNADFSDE
SNAIMANAVE HVDDYAGTAS ATDHTSEVVS LEPNESATVE FTNTLAEDDG ELEWHHVVES
DDDIATAPFT IDDGPNWNVK GTVSDDVTDE PIENASVELE AGNETYTNVT DADGEFGLAN
VPAGEHNLTV DAEGYAAHTE SVEVPEDDIV TVDVGLEELP GTISGDVTAS DDDAPVENAT
IVAENDDGDV HEATTDENGS YELDGVSAGT YVVNVVDTPP GYEIDEIVTV APGEHVDDVD
FVVDRTAGSI EGTVTNAAGV PIADANVIDA DDGAFNVTTA EDGSYEIEDV TPGTNALRAV
ADGYDDSNVE FVDVETGETT TANLTLGTYF EVDDLAAPDT AEQGEEITVN ATVTNTGEQE
DTRTVFYFPP GTDFGTDVID YQPELAETVT LEGGESTTVE FTYEVGADDE PGEYEHGVSA
DEVESTFITI EGDEDDGEPV FDVSDLDAPA DAEPNETVSV TATLENTGDD AGTQTVTYAF
DDETVANETV SLDATESTEL EFSTELPADE GTYEHTVSSE NDSATAQTDV EEAGEPEPAY
FEVTEHDAPE SVDAGEELTV NATITNTGDE ADQQDIFLFW DAAMSDLEDA ESVAELREAG
IDSMESIELE GGESDTVTLT HEVDSETEPG TYQYTVSTLQ EMADGEVTVD DAAEVAFPAD
PGGFDGAPPG AALGANPFEN GIANGIGLLG