Gene Nmag_4209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_4209 
Symbol 
ID8828943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013924 
Strand
Start bp247861 
End bp252717 
Gene Length4857 bp 
Protein Length1618 aa 
Translation table11 
GC content51% 
IMG OID 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003482279 
Protein GI289937677 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGACG AAATGGGAAC GGCAGCGACT GACAAATATG ACTTCTCTTT AGATGAATCA 
ATACCATCGT CTATTGGTAC TGATGATCCT GACCAGTCAA ATAAAGGCGA AGTGATAATT
GATGAGAACA CTTTTCAGGC TGAAGGCACC GTCGAAGTGC TTGTATCTGC CGAAGATGTC
GAACTTCCAA TTGTAACCGA TGTTGACGTC AAGACGGCCC TTCAAACGAC TGCGGAAGAA
AGCCAGGAAC CAATAGTCGA ATACGCGGAG TCGACTGAGG GAGTAGAGGT TCTGAACCAG
TTCTGGATTA CGAACGCATT GCTACTTGAA CTGGATAAAG ACCAAGTTAG TGCAACCGAG
ATTGCTGCCC AAGAAGGGGT CGAGTCAATC GACTACAACG CCGATATTGA ACTGGATGAA
CCAGTAGAGT ATGACGACGA CGTCAATACT ACCTACGGAC TTGATCAAAT CAATGCACCC
GATGTGTGGG ATGACCACGA CACAATGGGC GAAGGTGCAG AGATTGCAAT CCTTGATACC
GGTGTCGACC CCGACCATCC CGATATAGAC ATCGAAGATG AAAATTGGGC CGAATTTGAC
GAAGATGGCG AGCAGGTTGA CAGCGACCCG TATGAATCTC ACTCCAACGG GCACGGGACT
CACGTTTCCG GTACTGCGAC CGGCGGTAAT GCATCAGGAG AGTACATTGG CGTTGCTCCT
GAAGCTGATC TGATGCATGG TCTCGTGCTG GACGCTGGCA GTGGTAGTTT GGCACAGATC
ATCGGAGGAA TCGAATGGGC CGTTGAAGAA GATGCCGACG CTGTTTCGAT GAGTCTCGGT
GTATCCGCCT ACGAAGAAGC GTTCATCGAG CCTCAGATGA ACGCGTTAGA TGCTGGTACA
TTGGTGATCG CTTCTTCTGG TAACGATGGC GAGGGTAGCT CAAGCTCACC TGGCAATGAT
TACGATTCAT TTGCCATTGG TGCCACTGAC GAATCAGAAG ATATCGCTCA GTTCTCAAGT
GGTGAATGGA TCGACACCCA AATTGCCTGG GGTTCTGACG CTCCTGACCA CTGGCCTGAA
GAGTACGTTG TGCCAGACAT TGCAGCACCA GGAGTTGGTG TTAACAGTGC TCAACCCGAT
GGCGAGTACG ATACGTTGTC TGGCACCTCC ATGGCAGCGC CTCACGTTGC TGGAACTGTT
GGACTGATGG CCGCAGCAAG TGAAGATGAT CTCTCTCCGG AACAGTACGA AACTGCGATG
ACCGCTCACG CTTGGCAGCC TGATAATGAC CTAGATGACC CCAATGATCG ATTTGGCCAA
GGCATTCTAG ACGCAAGCCA CTCGGTTGAT CAGGTTGCAG CTGACCAAGG GATTAACGGA
ACCATCACGG ATAGCAATGG AGATCCAATA GAGGGTGTTG ATGTCGAACT TGACAGCGGT
TACTCGGTCT CAACCGATGA AAACGGGGAA TACACTCTTC GAGGGACGGA AGACACATTT
GAGTTTGAAG CTAGCGCCTT TGGGTATGAG TCCGTTACAG AGACTGTCGA GGTTTCCGAG
GGTGAGTTTG AGCAGATAGA CATCACGCTC GAAGACTTCG TTGATGCAGA ACCAACAACA
TTGCCCACCG GGATCGTTTC TGGACAGAAT AACTCCTTTG CGTTCGAAAC TGCTCATCTC
GAAGAGTTGT CCGTATCGAT GGATGGGAAC TACGATGAAG AGAATGCAAC CCTCTACGTC
GATGGACAAG AGATAGACTT CAACGAAGCA TGGGAGGCTG AGGAACCATT CGACGAGGTA
GAGATTGATC TCTACGCGGA AGATGGTACC ACCGGCGAGA TGATGCTCGA GTTCGTATTT
GCCGATGGTG ATGATGAATT CGTGTTCGAA ACGGACGAAC TCGAGGTCTT CGCTGAGGAA
ATCGATACTG CAGTTGTCGA CGCCCCAGAC GGTGAATTTG GCGCCTTGCT TGCCAGCGAT
CTCGATGACA ACCTTGATGC GAATTATAAC ATCACAACCG TTGACTCAGA CGAGGTGGTA
GACCGTGTCG ACGATTTCGA CACGTTCGTT GTTCAACGGT TCGCTGATTC AAACGAACAA
GATGAGGACC TAGTGCGGGA GTTCTTCGAG GCCACTGACC GAGCGGATAT TGGCGTTGTG
TACCTCGATC AGTACGACGA GTCAGGGTCC TGGAGTGACA GTATCCATGA ACTGTCTCGT
GCGACCGGTG TACCTGGTTA CACTGATCAA GACTTTGTTG GCGAAATTGA ACCCGTAGAG
TACGAGGCAA CAGAAGAGCA CCCCATTCTT GATGGCGTTG TTGAAGAAGG TGAGCGGGTT
GAACTATATG ACACACCAGA CCACACCTGG TTCGGCGGCC ATCACGGAAA TGACATCGCC
GATGTAGGTT GGAAAAACAA CAGCGCTAGC GGTCATGGTC TAGCGACAGA CGATGCACGA
AACAATGTCT ATGCCGCCTC GCTGGGGTAC TCCACGCTGG TTGGGGATGT AATTCCGTTC
ACTGAGGAAG CCGAGGAAGT GCTGACCAAC TCCGTTGAAC ATGTTAGCGA GAGTCGTACT
GCGACACTCG ATGAGGATCA ACCCGATCGT GTAGATCCCG GTGAAGCCAT CTCCGCAGAG
TTCAGTGTTG ACGATATCGA TGATGTCGAA ATCACCGTTG AGTTAGATGA CTACCACACT
ACAACTGCGG AAGAAAACCT GACACTGTAC GTCCAAGAAA CAGAATATGC CTTCGGAGAA
CCTATCGAAC CAGATGAGCC AATCGACCCA ACCGATACAC GGTTCAACGA TGCAATTAAC
GTTACCGTGG TTCCGGAAGA TGGCGAAACG GGAATTGTGG CGCTCGAACA TGATATCGAC
GTTGGCGGAG AATCAGTCGA CGGGTTCACT GGTCCAACAT CGGTTTATGA ACCACCAATC
GAAGTAGGTG AGGAGGGCGA CGCAGAGACG ATACAAGATG CAGTTGATCT GGCTATTCCT
GAAGCGACTA TCGTCGTTGA GGATGGGACC TACAAAGAGC AGGTGGTAAT GTGGGCGGAC
GCGAATCACG ACGTGACAAT CGAAGCAGCT GAAGACGCAG AGCCATCGCT CGAGCTTCCT
GATGATGCGG TCGAAATGCC CGCCTATAAC TTACAGACTG ACGATCAAAC TCCAGTCGTA
TATGTACGAC ATAACGACGG AGTCACGATC GACGGATTCG ACATTGACGC AGGTGGAGAG
GTCGGCTCCT ACGCGACCGA AGCGCACAAC TACACGGTCA GTGATGTCAC CGTAACAAAT
GCTTCGACTG GTTTGTGGTC TGATCTCAGT CACATTGGCC ATGTTGTAGA AGATTCCTAT
ATCGAAGCTG ATGAGACAGG CGTCTTCTAC TACTGGGCTG CCGACGATGG ATTAATCCAG
AATAACACCA TTACCGGAGC TGATACGGGA ATCATGACCG ACCACGTCTC TTCGGGTCAC
GATGTGGTCG ATAACGAGAT TTACGATGTC GGAACCGGGA TGCTAATCGA CACCGATGAT
GGTGAGACCG ATCGCAACGT CATCACAGAC GCGGATGTTG GTATTCAGTT AGGCGTTTCA
GGTAACGTTG AGTCTGTCGA TGATAACCAC ATCGAAAACG CATCTGTCGG CCTTGATGAG
GGATCGGTCA ACATGCCCCC TGAAGTCCAT CACAACTACT TCGATTCGGA GGTCGGTGTC
ATCGATGGCG ACTGGAATGG CCCACCATCG TACTATCTAA ACGATTTCTC CGAATCTGAC
ACGGTCATAG AAACGGACCA AAGGATCGAT GTGCGCATGA ACTACCTCGG CGAGCGCGAT
GGACAAGAAC CCATCGCTGA CGGCCCAACC GACTACAGTC CTCACTTGAC CAGCCCACCA
AGCGTTGAGG AGGGCATGGA TACGACTGAG ATTGGATACC ATCTTCAGGT CGAAGCCAAC
GAATCCTACG GTATCGGGTT GCCTGGTCCA AGCGAACAGA CAATTGGAGA GATCGTCCCA
GCCAACTTCG AAGGCGCCAT CTACGGATTT GACGCTGATG AGCAAGAGTG GAAAATGAAG
TCTGGTGACG ACTCGGTAGA TACGCTCGCG GCACTCGTGG TCGTATCTGA AAGCGACGCC
AAATTGGACT TCAACTTCCA GTCAGACGAT GATAATCCAT CGGCACCAGG ACAGCACGAA
TTCGAAGAGG GATGGAACTA CGTGACCGCG CCAGCACACC TAGACGCAGA ATCCGCCTTT
GAAATTGGCT CATTTGAACC AGAGATGATA ATGAGCCTCC AGGAAGCCCC TGGAAACCAG
CTCGGGCCTG AAGGGGAATT AGATCGGACG CATATCTTCG GTGACGGTGA TGCTGGGTAT
GTCAGTCCCT TTGAAGGGTA CTTCGTCTAC AGCGAGTCGG ATGGCACGAT GCCATCTCAA
GTCGCAGCCG ACCCAACTGC CGAAGAGATG TTCGATGAGC TGAATATCTC TGCCCACGAA
GACCATCCGG AAACCGCGAC CATCGAGTCA GTCATCGAGA CTGAAGGAGT CTCTGATAAG
GAAACTGAAG AGGCCCTGAT CGCGCTCATT AAAAATGATC TCATGGATGA AATTGAGGAT
GCAAATAACT CAACAGAGCA ATTCAAAGCG CTCGATGATG CCGCCGAGAC TATTGTCTCG
GATGCACCAG ACGAGCATGA GACGAAAGTG CAGAATGCCA CCTCTCAATC GATTGAACTA
GTCATTCAGG CTGAGTTCGG GCAGAATATT GACGTAGAAC ATTTCGATCA GGTTGTTGAT
GATGAGGACG AGGGCGCTGC ATCCTCAGTA TCTTCACCAT CCCTGGTAAC TCCCTGA
 
Protein sequence
MGDEMGTAAT DKYDFSLDES IPSSIGTDDP DQSNKGEVII DENTFQAEGT VEVLVSAEDV 
ELPIVTDVDV KTALQTTAEE SQEPIVEYAE STEGVEVLNQ FWITNALLLE LDKDQVSATE
IAAQEGVESI DYNADIELDE PVEYDDDVNT TYGLDQINAP DVWDDHDTMG EGAEIAILDT
GVDPDHPDID IEDENWAEFD EDGEQVDSDP YESHSNGHGT HVSGTATGGN ASGEYIGVAP
EADLMHGLVL DAGSGSLAQI IGGIEWAVEE DADAVSMSLG VSAYEEAFIE PQMNALDAGT
LVIASSGNDG EGSSSSPGND YDSFAIGATD ESEDIAQFSS GEWIDTQIAW GSDAPDHWPE
EYVVPDIAAP GVGVNSAQPD GEYDTLSGTS MAAPHVAGTV GLMAAASEDD LSPEQYETAM
TAHAWQPDND LDDPNDRFGQ GILDASHSVD QVAADQGING TITDSNGDPI EGVDVELDSG
YSVSTDENGE YTLRGTEDTF EFEASAFGYE SVTETVEVSE GEFEQIDITL EDFVDAEPTT
LPTGIVSGQN NSFAFETAHL EELSVSMDGN YDEENATLYV DGQEIDFNEA WEAEEPFDEV
EIDLYAEDGT TGEMMLEFVF ADGDDEFVFE TDELEVFAEE IDTAVVDAPD GEFGALLASD
LDDNLDANYN ITTVDSDEVV DRVDDFDTFV VQRFADSNEQ DEDLVREFFE ATDRADIGVV
YLDQYDESGS WSDSIHELSR ATGVPGYTDQ DFVGEIEPVE YEATEEHPIL DGVVEEGERV
ELYDTPDHTW FGGHHGNDIA DVGWKNNSAS GHGLATDDAR NNVYAASLGY STLVGDVIPF
TEEAEEVLTN SVEHVSESRT ATLDEDQPDR VDPGEAISAE FSVDDIDDVE ITVELDDYHT
TTAEENLTLY VQETEYAFGE PIEPDEPIDP TDTRFNDAIN VTVVPEDGET GIVALEHDID
VGGESVDGFT GPTSVYEPPI EVGEEGDAET IQDAVDLAIP EATIVVEDGT YKEQVVMWAD
ANHDVTIEAA EDAEPSLELP DDAVEMPAYN LQTDDQTPVV YVRHNDGVTI DGFDIDAGGE
VGSYATEAHN YTVSDVTVTN ASTGLWSDLS HIGHVVEDSY IEADETGVFY YWAADDGLIQ
NNTITGADTG IMTDHVSSGH DVVDNEIYDV GTGMLIDTDD GETDRNVITD ADVGIQLGVS
GNVESVDDNH IENASVGLDE GSVNMPPEVH HNYFDSEVGV IDGDWNGPPS YYLNDFSESD
TVIETDQRID VRMNYLGERD GQEPIADGPT DYSPHLTSPP SVEEGMDTTE IGYHLQVEAN
ESYGIGLPGP SEQTIGEIVP ANFEGAIYGF DADEQEWKMK SGDDSVDTLA ALVVVSESDA
KLDFNFQSDD DNPSAPGQHE FEEGWNYVTA PAHLDAESAF EIGSFEPEMI MSLQEAPGNQ
LGPEGELDRT HIFGDGDAGY VSPFEGYFVY SESDGTMPSQ VAADPTAEEM FDELNISAHE
DHPETATIES VIETEGVSDK ETEEALIALI KNDLMDEIED ANNSTEQFKA LDDAAETIVS
DAPDEHETKV QNATSQSIEL VIQAEFGQNI DVEHFDQVVD DEDEGAASSV SSPSLVTP