Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_4209 |
Symbol | |
ID | 8828943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013924 |
Strand | - |
Start bp | 247861 |
End bp | 252717 |
Gene Length | 4857 bp |
Protein Length | 1618 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | peptidase S8 and S53 subtilisin kexin sedolisin |
Protein accession | YP_003482279 |
Protein GI | 289937677 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGACG AAATGGGAAC GGCAGCGACT GACAAATATG ACTTCTCTTT AGATGAATCA ATACCATCGT CTATTGGTAC TGATGATCCT GACCAGTCAA ATAAAGGCGA AGTGATAATT GATGAGAACA CTTTTCAGGC TGAAGGCACC GTCGAAGTGC TTGTATCTGC CGAAGATGTC GAACTTCCAA TTGTAACCGA TGTTGACGTC AAGACGGCCC TTCAAACGAC TGCGGAAGAA AGCCAGGAAC CAATAGTCGA ATACGCGGAG TCGACTGAGG GAGTAGAGGT TCTGAACCAG TTCTGGATTA CGAACGCATT GCTACTTGAA CTGGATAAAG ACCAAGTTAG TGCAACCGAG ATTGCTGCCC AAGAAGGGGT CGAGTCAATC GACTACAACG CCGATATTGA ACTGGATGAA CCAGTAGAGT ATGACGACGA CGTCAATACT ACCTACGGAC TTGATCAAAT CAATGCACCC GATGTGTGGG ATGACCACGA CACAATGGGC GAAGGTGCAG AGATTGCAAT CCTTGATACC GGTGTCGACC CCGACCATCC CGATATAGAC ATCGAAGATG AAAATTGGGC CGAATTTGAC GAAGATGGCG AGCAGGTTGA CAGCGACCCG TATGAATCTC ACTCCAACGG GCACGGGACT CACGTTTCCG GTACTGCGAC CGGCGGTAAT GCATCAGGAG AGTACATTGG CGTTGCTCCT GAAGCTGATC TGATGCATGG TCTCGTGCTG GACGCTGGCA GTGGTAGTTT GGCACAGATC ATCGGAGGAA TCGAATGGGC CGTTGAAGAA GATGCCGACG CTGTTTCGAT GAGTCTCGGT GTATCCGCCT ACGAAGAAGC GTTCATCGAG CCTCAGATGA ACGCGTTAGA TGCTGGTACA TTGGTGATCG CTTCTTCTGG TAACGATGGC GAGGGTAGCT CAAGCTCACC TGGCAATGAT TACGATTCAT TTGCCATTGG TGCCACTGAC GAATCAGAAG ATATCGCTCA GTTCTCAAGT GGTGAATGGA TCGACACCCA AATTGCCTGG GGTTCTGACG CTCCTGACCA CTGGCCTGAA GAGTACGTTG TGCCAGACAT TGCAGCACCA GGAGTTGGTG TTAACAGTGC TCAACCCGAT GGCGAGTACG ATACGTTGTC TGGCACCTCC ATGGCAGCGC CTCACGTTGC TGGAACTGTT GGACTGATGG CCGCAGCAAG TGAAGATGAT CTCTCTCCGG AACAGTACGA AACTGCGATG ACCGCTCACG CTTGGCAGCC TGATAATGAC CTAGATGACC CCAATGATCG ATTTGGCCAA GGCATTCTAG ACGCAAGCCA CTCGGTTGAT CAGGTTGCAG CTGACCAAGG GATTAACGGA ACCATCACGG ATAGCAATGG AGATCCAATA GAGGGTGTTG ATGTCGAACT TGACAGCGGT TACTCGGTCT CAACCGATGA AAACGGGGAA TACACTCTTC GAGGGACGGA AGACACATTT GAGTTTGAAG CTAGCGCCTT TGGGTATGAG TCCGTTACAG AGACTGTCGA GGTTTCCGAG GGTGAGTTTG AGCAGATAGA CATCACGCTC GAAGACTTCG TTGATGCAGA ACCAACAACA TTGCCCACCG GGATCGTTTC TGGACAGAAT AACTCCTTTG CGTTCGAAAC TGCTCATCTC GAAGAGTTGT CCGTATCGAT GGATGGGAAC TACGATGAAG AGAATGCAAC CCTCTACGTC GATGGACAAG AGATAGACTT CAACGAAGCA TGGGAGGCTG AGGAACCATT CGACGAGGTA GAGATTGATC TCTACGCGGA AGATGGTACC ACCGGCGAGA TGATGCTCGA GTTCGTATTT GCCGATGGTG ATGATGAATT CGTGTTCGAA ACGGACGAAC TCGAGGTCTT CGCTGAGGAA ATCGATACTG CAGTTGTCGA CGCCCCAGAC GGTGAATTTG GCGCCTTGCT TGCCAGCGAT CTCGATGACA ACCTTGATGC GAATTATAAC ATCACAACCG TTGACTCAGA CGAGGTGGTA GACCGTGTCG ACGATTTCGA CACGTTCGTT GTTCAACGGT TCGCTGATTC AAACGAACAA GATGAGGACC TAGTGCGGGA GTTCTTCGAG GCCACTGACC GAGCGGATAT TGGCGTTGTG TACCTCGATC AGTACGACGA GTCAGGGTCC TGGAGTGACA GTATCCATGA ACTGTCTCGT GCGACCGGTG TACCTGGTTA CACTGATCAA GACTTTGTTG GCGAAATTGA ACCCGTAGAG TACGAGGCAA CAGAAGAGCA CCCCATTCTT GATGGCGTTG TTGAAGAAGG TGAGCGGGTT GAACTATATG ACACACCAGA CCACACCTGG TTCGGCGGCC ATCACGGAAA TGACATCGCC GATGTAGGTT GGAAAAACAA CAGCGCTAGC GGTCATGGTC TAGCGACAGA CGATGCACGA AACAATGTCT ATGCCGCCTC GCTGGGGTAC TCCACGCTGG TTGGGGATGT AATTCCGTTC ACTGAGGAAG CCGAGGAAGT GCTGACCAAC TCCGTTGAAC ATGTTAGCGA GAGTCGTACT GCGACACTCG ATGAGGATCA ACCCGATCGT GTAGATCCCG GTGAAGCCAT CTCCGCAGAG TTCAGTGTTG ACGATATCGA TGATGTCGAA ATCACCGTTG AGTTAGATGA CTACCACACT ACAACTGCGG AAGAAAACCT GACACTGTAC GTCCAAGAAA CAGAATATGC CTTCGGAGAA CCTATCGAAC CAGATGAGCC AATCGACCCA ACCGATACAC GGTTCAACGA TGCAATTAAC GTTACCGTGG TTCCGGAAGA TGGCGAAACG GGAATTGTGG CGCTCGAACA TGATATCGAC GTTGGCGGAG AATCAGTCGA CGGGTTCACT GGTCCAACAT CGGTTTATGA ACCACCAATC GAAGTAGGTG AGGAGGGCGA CGCAGAGACG ATACAAGATG CAGTTGATCT GGCTATTCCT GAAGCGACTA TCGTCGTTGA GGATGGGACC TACAAAGAGC AGGTGGTAAT GTGGGCGGAC GCGAATCACG ACGTGACAAT CGAAGCAGCT GAAGACGCAG AGCCATCGCT CGAGCTTCCT GATGATGCGG TCGAAATGCC CGCCTATAAC TTACAGACTG ACGATCAAAC TCCAGTCGTA TATGTACGAC ATAACGACGG AGTCACGATC GACGGATTCG ACATTGACGC AGGTGGAGAG GTCGGCTCCT ACGCGACCGA AGCGCACAAC TACACGGTCA GTGATGTCAC CGTAACAAAT GCTTCGACTG GTTTGTGGTC TGATCTCAGT CACATTGGCC ATGTTGTAGA AGATTCCTAT ATCGAAGCTG ATGAGACAGG CGTCTTCTAC TACTGGGCTG CCGACGATGG ATTAATCCAG AATAACACCA TTACCGGAGC TGATACGGGA ATCATGACCG ACCACGTCTC TTCGGGTCAC GATGTGGTCG ATAACGAGAT TTACGATGTC GGAACCGGGA TGCTAATCGA CACCGATGAT GGTGAGACCG ATCGCAACGT CATCACAGAC GCGGATGTTG GTATTCAGTT AGGCGTTTCA GGTAACGTTG AGTCTGTCGA TGATAACCAC ATCGAAAACG CATCTGTCGG CCTTGATGAG GGATCGGTCA ACATGCCCCC TGAAGTCCAT CACAACTACT TCGATTCGGA GGTCGGTGTC ATCGATGGCG ACTGGAATGG CCCACCATCG TACTATCTAA ACGATTTCTC CGAATCTGAC ACGGTCATAG AAACGGACCA AAGGATCGAT GTGCGCATGA ACTACCTCGG CGAGCGCGAT GGACAAGAAC CCATCGCTGA CGGCCCAACC GACTACAGTC CTCACTTGAC CAGCCCACCA AGCGTTGAGG AGGGCATGGA TACGACTGAG ATTGGATACC ATCTTCAGGT CGAAGCCAAC GAATCCTACG GTATCGGGTT GCCTGGTCCA AGCGAACAGA CAATTGGAGA GATCGTCCCA GCCAACTTCG AAGGCGCCAT CTACGGATTT GACGCTGATG AGCAAGAGTG GAAAATGAAG TCTGGTGACG ACTCGGTAGA TACGCTCGCG GCACTCGTGG TCGTATCTGA AAGCGACGCC AAATTGGACT TCAACTTCCA GTCAGACGAT GATAATCCAT CGGCACCAGG ACAGCACGAA TTCGAAGAGG GATGGAACTA CGTGACCGCG CCAGCACACC TAGACGCAGA ATCCGCCTTT GAAATTGGCT CATTTGAACC AGAGATGATA ATGAGCCTCC AGGAAGCCCC TGGAAACCAG CTCGGGCCTG AAGGGGAATT AGATCGGACG CATATCTTCG GTGACGGTGA TGCTGGGTAT GTCAGTCCCT TTGAAGGGTA CTTCGTCTAC AGCGAGTCGG ATGGCACGAT GCCATCTCAA GTCGCAGCCG ACCCAACTGC CGAAGAGATG TTCGATGAGC TGAATATCTC TGCCCACGAA GACCATCCGG AAACCGCGAC CATCGAGTCA GTCATCGAGA CTGAAGGAGT CTCTGATAAG GAAACTGAAG AGGCCCTGAT CGCGCTCATT AAAAATGATC TCATGGATGA AATTGAGGAT GCAAATAACT CAACAGAGCA ATTCAAAGCG CTCGATGATG CCGCCGAGAC TATTGTCTCG GATGCACCAG ACGAGCATGA GACGAAAGTG CAGAATGCCA CCTCTCAATC GATTGAACTA GTCATTCAGG CTGAGTTCGG GCAGAATATT GACGTAGAAC ATTTCGATCA GGTTGTTGAT GATGAGGACG AGGGCGCTGC ATCCTCAGTA TCTTCACCAT CCCTGGTAAC TCCCTGA
|
Protein sequence | MGDEMGTAAT DKYDFSLDES IPSSIGTDDP DQSNKGEVII DENTFQAEGT VEVLVSAEDV ELPIVTDVDV KTALQTTAEE SQEPIVEYAE STEGVEVLNQ FWITNALLLE LDKDQVSATE IAAQEGVESI DYNADIELDE PVEYDDDVNT TYGLDQINAP DVWDDHDTMG EGAEIAILDT GVDPDHPDID IEDENWAEFD EDGEQVDSDP YESHSNGHGT HVSGTATGGN ASGEYIGVAP EADLMHGLVL DAGSGSLAQI IGGIEWAVEE DADAVSMSLG VSAYEEAFIE PQMNALDAGT LVIASSGNDG EGSSSSPGND YDSFAIGATD ESEDIAQFSS GEWIDTQIAW GSDAPDHWPE EYVVPDIAAP GVGVNSAQPD GEYDTLSGTS MAAPHVAGTV GLMAAASEDD LSPEQYETAM TAHAWQPDND LDDPNDRFGQ GILDASHSVD QVAADQGING TITDSNGDPI EGVDVELDSG YSVSTDENGE YTLRGTEDTF EFEASAFGYE SVTETVEVSE GEFEQIDITL EDFVDAEPTT LPTGIVSGQN NSFAFETAHL EELSVSMDGN YDEENATLYV DGQEIDFNEA WEAEEPFDEV EIDLYAEDGT TGEMMLEFVF ADGDDEFVFE TDELEVFAEE IDTAVVDAPD GEFGALLASD LDDNLDANYN ITTVDSDEVV DRVDDFDTFV VQRFADSNEQ DEDLVREFFE ATDRADIGVV YLDQYDESGS WSDSIHELSR ATGVPGYTDQ DFVGEIEPVE YEATEEHPIL DGVVEEGERV ELYDTPDHTW FGGHHGNDIA DVGWKNNSAS GHGLATDDAR NNVYAASLGY STLVGDVIPF TEEAEEVLTN SVEHVSESRT ATLDEDQPDR VDPGEAISAE FSVDDIDDVE ITVELDDYHT TTAEENLTLY VQETEYAFGE PIEPDEPIDP TDTRFNDAIN VTVVPEDGET GIVALEHDID VGGESVDGFT GPTSVYEPPI EVGEEGDAET IQDAVDLAIP EATIVVEDGT YKEQVVMWAD ANHDVTIEAA EDAEPSLELP DDAVEMPAYN LQTDDQTPVV YVRHNDGVTI DGFDIDAGGE VGSYATEAHN YTVSDVTVTN ASTGLWSDLS HIGHVVEDSY IEADETGVFY YWAADDGLIQ NNTITGADTG IMTDHVSSGH DVVDNEIYDV GTGMLIDTDD GETDRNVITD ADVGIQLGVS GNVESVDDNH IENASVGLDE GSVNMPPEVH HNYFDSEVGV IDGDWNGPPS YYLNDFSESD TVIETDQRID VRMNYLGERD GQEPIADGPT DYSPHLTSPP SVEEGMDTTE IGYHLQVEAN ESYGIGLPGP SEQTIGEIVP ANFEGAIYGF DADEQEWKMK SGDDSVDTLA ALVVVSESDA KLDFNFQSDD DNPSAPGQHE FEEGWNYVTA PAHLDAESAF EIGSFEPEMI MSLQEAPGNQ LGPEGELDRT HIFGDGDAGY VSPFEGYFVY SESDGTMPSQ VAADPTAEEM FDELNISAHE DHPETATIES VIETEGVSDK ETEEALIALI KNDLMDEIED ANNSTEQFKA LDDAAETIVS DAPDEHETKV QNATSQSIEL VIQAEFGQNI DVEHFDQVVD DEDEGAASSV SSPSLVTP
|
| |