Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3913 |
Symbol | |
ID | 8826783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013923 |
Strand | - |
Start bp | 310755 |
End bp | 315887 |
Gene Length | 5133 bp |
Protein Length | 1710 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | peptidase S8 and S53 subtilisin kexin sedolisin |
Protein accession | YP_003482016 |
Protein GI | 289583606 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.806199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGACT CACGAGAACA ACTGACCGCC CTCCTGCTCA CCGCCCTCAT GGTGCTTTCG CTCGTTGCGA TCGCTGGCAC CGGGCTGGCA GGGAGCGTTG CGGCAACGGA GCACAGTGAC GCCCAATCAG CTGTCGATGC CGACGCCGAC GGACCGGCAT CGATTGACGC CGAACTCGAA GAAGCGAGTG GGACAGAACA GATATACATC GTCCTCGATC AGTACGATGG GCAACTGAGT GACGATCGCG ACCGTGCGAT CGCACAGTTG CAGGAGCACG CTGAGGTGTC GCAGGCGGCT GCGACCGAGG ATATCGACGC ACTCGCTGGC GTCGACGTAA TCAACGACTA CTGGATCACG AACGCGATTC GCGCCGAGGT TGACACGAGC GCAGTCGACA CGGCTGAACT TGCGATGATC GACGGCGCAG AAACGATTCA GCTCCGCCCG GACTACGAGG TGCCGGAACC GGAGCCATCC GTCGAGCCAG CCGTTGAACC TGACGAAGAC GACTACACGT ACGGTCTCGA TCAGGTCAAC GCCTCTGAGA CCTGGGCTGA CTTCAACACC CAGGGTGAGG ACGTGAAGGT CGCAGTGCTC GACACTGGCT TCGACATTGA TCACCAGGAT CTGGACCTCT ACACCGAGGA TGCAGATGAC CCGACCTACC CAGGTGGCTG GGTCGAGATC GACGCGGACG GTGACCCAGT CGAGGGCTCC GAGCCATACG ACACGCACTA TCACGGCACG CACGTCGGTG GCACGGTTGG TGCAGCCGCA CCGGCTGACG ACGATACGCC GGCCTACGGT GTCGCGCCGA ACGTCGACCT GCAGCACGGC CTCGTGCTGC CGGACGGCTC GGGTGCAGAC TCCGACACCA TCGCTGGCTT CGAGTACGTC GTCGAAGAGA TGGATTCGGA CGTGGTCAGC ATGAGCTTCG GTGCCGGCTG TGGCCTCACC GGTCCAGTGT ACGAAGACGC CTGGATCCCG GTGATTCAGA ACGCCAACGA CATGGATGTC GTCGCAGTCA CCTCCTCGGG TAACTCCGGC GAGGGCTGTG TCGGCTCGCC AGCGAACACC TACGACTCGT TCAGTATCGG TGCCTCGAAC GAAGCCGGCG ATATCACCGA CTTCTCCAGC GGCAACACCA TCGCCGCCGA CAACTGGGAC AACCCCGACC CAGAGTGGCC AGACGAGTGG GTCAAACCAG ACGTCTCCGC ACCCGGCGAA GACGTCCTGA GCGCGATGCC AGACGACGAG TACGACTACC TGGACGGCAC CTCGATGTCC GCACCGCACG TCTCCGGCGT GATCGCCCTC ATGCTCTCGG CCAACGACGA CCTCACACAG GAGGAAATCG AGGAGACACT CGAGGAGACC GCCTGGAAGC CTGACGGCGA ACCCGACGAA AAGGACGTTC GCTACGGCCA CGGCATCGTC GACGCTTACG CCGCTGTCGA CGAAGTTGCT GCGGGTGCAC TCGAGTACGA ACTCGGTGAC GTCGACCAGG ACGGCGATGT AACGGTCCAG GACGTGCAGC TAACCCAGCA GTACCTCCAG GATATGGATC CTGAGCCGTT CGCCGAGGAC CTCGCGGACA TGGACCGCGA CGGTGAGGTC ACGACGGACG ACCTGAGCCT GCTCCAGCAG AAGGTACAGG GCATGCTCGA CGAGGGTGAG ATCGACATTA CGGGACTCGA CGTGCCCGAC GAAGTCGACG ATGGTGAGAT GTTCGAAGTC ACCGTCGACC TCGAAAACCT CGGTGAGGAG GGTGCTGTCC AGGAAGTTGA CCTCTTCCTT GGCGACGACG ATGAGCCGGT CGACACCGAA GTCGTCGACA TGGCAGCACC CGGCGTCGAC GACCCGATTG ACCACCCAGC AGAGACGACG ATCACGTTCG AACTCGACGC GGGTGACCTC GACGGCGGCG ACCACACTAT CACCGTCGAA ACCGAGGACG CGGACGCGAG CGACACGGTC ACAGTCCTTG CGTCGAACTT CGAACTCTCG AACCTCGACG CACCTGACGA AGCCGACCGC GGTGACGAAA TTACCGTCAG CGCAGATGTC GAGAACACGG GCAACGTCGA CGACACCCAG GCCGTCGAGT ACCGCTTCGA CGACCTCGGT GACGCACTGT ACGCACAGAA CGTCACGCTC GAGGCCGACG AGGAGACGAC GGTCACGTTC GATGTCGGCA CGGAGAACAT CACTGAAGGG ACCTACGACC ACGGTGTCTT CACCGAGGAC GATGAGGCAA CGGCCACAAT CGACATCCTC GAAGCGTTCT TCAGCGTCGA CATCACCGAC GCACCTGAGG AACTCGCACC TGGCGAGACC TATAACGTCA GCGCGGATGT CGTGAACTCC GGCGACGCGC CGGACGAACA GACGATCTCT TACGAGGTCA CGGAAGGCGA AACCGACGTT GCGGTCGTCG ACGGCGACGT CGACGAAGAG GAACTCCGCG AGGAACTCGC CGACCGCGGT GTCGATGATG TCGATCTCGA CGAACTCGTC GAGGGAGCCG ACAACCTGGC AGATACGCTA CAGGACGAAC TCGACGACTC CTACGACGTA GAGACGGTCA ACGCAGATGA CCTGCTCGAC GAGGTCGATG CCTACGATGT CTTCGTCGTC AACGACTTCG ACGGCGCAGA CGTCGATACG TTCCTCGACG AACTCGCAGA CGACCAGGGC GTCATCTACC TCGAGAACTG GGGCAGCAGC TCGAACGCAG TGAGCGACCT GTCCGATGCT ACTGACGACC CATCGTCCGT CGAGGACGGC TTCAGTGGCT CTGCACCGAT TATGCTCGAC ATCACCGACG ACCACGCACT GTTCGATGGC GTCGCTGATG CGGGCGAGAC GATCGAACTT CACGACGCGT CCGATGCTGA CCGTGCATGG TTTGACGGCT ACAGCGGCGA CGTCATCGCT GACGTCGGCG ATGGCACGCC AGACGGCGCC TCCGTCGGCG TGAGCGAGGA CGACGACCAC GTCCTGCTCG CCGCGTTCGG TCGCTCCTCG TTCATCACGA ACGCGGACTT CAGCGACGAG TCGAACGCCA TCATGGCAAA CGCTGTCGAG CACGTCGACG ACTACGCTGG CACAGCCAGC GCAACCGACC ACACGAGTGA GGTCGTCTCG CTCGAACCGA ACGAATCGGC CACCGTCGAG TTCACGAACA CCCTCGCCGA AGACGACGGC GAACTCGAGT GGCACCACGT TGTCGAGAGC GACGACGATA TTGCGACTGC ACCGTTCACC ATCGACGACG GACCGAACTG GAACGTCAAG GGAACGGTCT CCGACGATGT CACCGACGAA CCGATCGAAA ACGCGAGTGT CGAACTGGAA GCTGGCAACG AAACGTACAC GAACGTGACC GATGCCGACG GCGAATTCGG TCTCGCAAAC GTTCCTGCCG GCGAGCACAA CCTGACGGTT GACGCCGAGG GCTACGCGGC CCACACTGAG TCCGTCGAGG TTCCCGAAGA CGACATCGTC ACCGTCGACG TTGGTCTCGA GGAACTGCCC GGAACGATCA GCGGTGACGT GACCGCAAGC GACGACGACG CACCAGTCGA GAATGCAACC ATCGTCGCCG AGAACGACGA CGGCGATGTC CACGAGGCGA CGACCGACGA GAACGGCTCG TACGAACTCG ACGGCGTCTC GGCAGGCACG TACGTCGTCA ACGTCGTCGA CACACCACCG GGCTACGAGA TTGACGAGAT CGTCACCGTC GCACCCGGCG AACACGTCGA CGATGTCGAC TTCGTCGTCG ACCGCACCGC CGGTTCGATC GAAGGCACCG TCACGAACGC CGCTGGCGTC CCAATCGCTG ACGCGAACGT GATTGACGCC GACGACGGCG CGTTCAACGT GACGACCGCC GAGGACGGCT CCTACGAAAT CGAGGACGTC ACGCCCGGCA CGAACGCGCT CCGCGCGGTC GCTGATGGCT ACGACGACTC GAACGTCGAG TTCGTCGACG TTGAGACCGG CGAGACGACG ACCGCGAACC TCACGCTCGG CACCTACTTC GAGGTCGACG ACCTCGCAGC ACCTGACACC GCCGAGCAGG GTGAGGAGAT CACCGTCAAC GCGACGGTCA CCAACACCGG CGAGCAGGAA GACACCCGAA CGGTGTTCTA CTTCCCGCCG GGCACTGACT TCGGCACCGA CGTTATCGAC TACCAGCCTG AACTGGCCGA GACGGTCACA CTCGAGGGCG GCGAGTCGAC GACGGTCGAG TTCACCTACG AGGTCGGTGC TGACGACGAA CCGGGCGAGT ACGAACACGG TGTCTCGGCT GACGAGGTCG AATCGACGTT CATCACCATC GAAGGCGACG AAGACGACGG CGAACCCGTC TTCGACGTGT CGGACCTCGA CGCACCGGCA GATGCTGAAC CGAACGAGAC CGTGTCGGTC ACCGCCACGC TCGAAAACAC CGGCGACGAC GCCGGCACGC AGACGGTCAC CTACGCGTTC GACGACGAGA CCGTCGCCAA CGAGACCGTC TCGCTCGACG CAACCGAGTC GACGGAACTC GAGTTCAGTA CCGAACTCCC AGCGGATGAA GGTACGTACG AACACACTGT CTCGTCCGAG AACGACTCTG CGACGGCACA GACCGACGTT GAAGAGGCTG GCGAGCCGGA GCCAGCGTAC TTCGAAGTGA CCGAGCACGA CGCACCGGAG TCCGTTGACG CAGGCGAGGA ACTGACGGTC AACGCGACGA TCACGAACAC CGGCGACGAA GCGGACCAGC AGGACATCTT CCTGTTCTGG GACGCTGCGA TGAGCGACCT CGAGGACGCG GAGTCTGTGG CTGAACTCCG CGAGGCAGGT ATCGACTCGA TGGAATCGAT CGAACTCGAG GGTGGCGAAT CCGACACGGT GACGCTCACC CACGAGGTCG ATTCGGAGAC GGAACCGGGT ACCTACCAGT ACACGGTTTC GACGCTGCAG GAGATGGCTG ACGGTGAAGT GACGGTTGAT GACGCTGCAG AAGTCGCCTT CCCAGCAGAT CCGGGCGGTT TCGATGGGGC ACCACCGGGA GCAGCGCTCG GTGCGAACCC GTTTGAAAAC GGTATCGCGA ACGGAATCGG ACTGCTCGGC TAA
|
Protein sequence | MTDSREQLTA LLLTALMVLS LVAIAGTGLA GSVAATEHSD AQSAVDADAD GPASIDAELE EASGTEQIYI VLDQYDGQLS DDRDRAIAQL QEHAEVSQAA ATEDIDALAG VDVINDYWIT NAIRAEVDTS AVDTAELAMI DGAETIQLRP DYEVPEPEPS VEPAVEPDED DYTYGLDQVN ASETWADFNT QGEDVKVAVL DTGFDIDHQD LDLYTEDADD PTYPGGWVEI DADGDPVEGS EPYDTHYHGT HVGGTVGAAA PADDDTPAYG VAPNVDLQHG LVLPDGSGAD SDTIAGFEYV VEEMDSDVVS MSFGAGCGLT GPVYEDAWIP VIQNANDMDV VAVTSSGNSG EGCVGSPANT YDSFSIGASN EAGDITDFSS GNTIAADNWD NPDPEWPDEW VKPDVSAPGE DVLSAMPDDE YDYLDGTSMS APHVSGVIAL MLSANDDLTQ EEIEETLEET AWKPDGEPDE KDVRYGHGIV DAYAAVDEVA AGALEYELGD VDQDGDVTVQ DVQLTQQYLQ DMDPEPFAED LADMDRDGEV TTDDLSLLQQ KVQGMLDEGE IDITGLDVPD EVDDGEMFEV TVDLENLGEE GAVQEVDLFL GDDDEPVDTE VVDMAAPGVD DPIDHPAETT ITFELDAGDL DGGDHTITVE TEDADASDTV TVLASNFELS NLDAPDEADR GDEITVSADV ENTGNVDDTQ AVEYRFDDLG DALYAQNVTL EADEETTVTF DVGTENITEG TYDHGVFTED DEATATIDIL EAFFSVDITD APEELAPGET YNVSADVVNS GDAPDEQTIS YEVTEGETDV AVVDGDVDEE ELREELADRG VDDVDLDELV EGADNLADTL QDELDDSYDV ETVNADDLLD EVDAYDVFVV NDFDGADVDT FLDELADDQG VIYLENWGSS SNAVSDLSDA TDDPSSVEDG FSGSAPIMLD ITDDHALFDG VADAGETIEL HDASDADRAW FDGYSGDVIA DVGDGTPDGA SVGVSEDDDH VLLAAFGRSS FITNADFSDE SNAIMANAVE HVDDYAGTAS ATDHTSEVVS LEPNESATVE FTNTLAEDDG ELEWHHVVES DDDIATAPFT IDDGPNWNVK GTVSDDVTDE PIENASVELE AGNETYTNVT DADGEFGLAN VPAGEHNLTV DAEGYAAHTE SVEVPEDDIV TVDVGLEELP GTISGDVTAS DDDAPVENAT IVAENDDGDV HEATTDENGS YELDGVSAGT YVVNVVDTPP GYEIDEIVTV APGEHVDDVD FVVDRTAGSI EGTVTNAAGV PIADANVIDA DDGAFNVTTA EDGSYEIEDV TPGTNALRAV ADGYDDSNVE FVDVETGETT TANLTLGTYF EVDDLAAPDT AEQGEEITVN ATVTNTGEQE DTRTVFYFPP GTDFGTDVID YQPELAETVT LEGGESTTVE FTYEVGADDE PGEYEHGVSA DEVESTFITI EGDEDDGEPV FDVSDLDAPA DAEPNETVSV TATLENTGDD AGTQTVTYAF DDETVANETV SLDATESTEL EFSTELPADE GTYEHTVSSE NDSATAQTDV EEAGEPEPAY FEVTEHDAPE SVDAGEELTV NATITNTGDE ADQQDIFLFW DAAMSDLEDA ESVAELREAG IDSMESIELE GGESDTVTLT HEVDSETEPG TYQYTVSTLQ EMADGEVTVD DAAEVAFPAD PGGFDGAPPG AALGANPFEN GIANGIGLLG
|
| |