Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1273 |
Symbol | |
ID | 8824105 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 1301728 |
End bp | 1303401 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | metalloendopeptidase, glycoprotease family |
Protein accession | YP_003479415 |
Protein GI | 289580949 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0267289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCACAA GTTCAGCCAC ACGAACACGC GTTCTCGGCA TCGAAGGCAC CGCCTGGGCC GCCAGCGCGG CCGTCTTCGA CACCGAGTCC GACGACGTTT TTATCGAAAC CGACGCCTAC GAGCCAGACA GCGGCGGCAT TCACCCCCGC GAGGCCGCAG AACACATGCA CGACGCCATC CCGCGCGTCG TCGAGACGGC ACTCGCACAC GCCCGGGAGA CGTTCGACGG GCCCGACACA GAGCCGCCCG TAGACGCCGT CGCCTTCTCG CGTGGCCCCG GCCTCGGACC GTGTCTTCGG ACGGTCGGCA CGGCAGCGCG GGCGCTCGCC CAGTCGCTCG ACGTGCCGCT TATCGGCGTG AATCACATGG TGGCCCACCT CGAGATCGGT CGCCACACAG CCGACTTCGA CTCGCCGGTC TGTCTGAACG CGAGCGGCGC GAACGCGCAC CTGCTCGCGT ACCGCAACGG CCGCTATCGC GTGCTCGGGG AGACGATGGA CACCGGCGTC GGCAACGCCA TCGACAAGTT CACCCGCCAC GTCGGCTGGT CCCATCCCGG CGGACCGAAG GTCGAGGCGG CCGCGAAAGA CGGCGAACTC ATCGACTTGC CCTACGTCGT GAAGGGAATG GATTTCTCCT TCTCGGGTAT TATGAGTGCC GCGAAGCAGC GCTACGATAA TGGGATTCCT GTCGAGGACA TCTGTTACTC GCTGCAGGAG ACCATCTTCG CCATGCTGAC GGAAGTCGCC GAACGCGCGC TCTCGCTGAC CGGCAGCGAC GAACTCGTCC TCGGCGGCGG TGTCGGACAG AACGCCCGCC TCCGTGAGAT GCTCGCAGAC ATGTGCGACC AGCGCGGGGC GGACTTTCAC GCACCCGAGC CGCGATTCCT GCGCGACAAC GCGGGGATGA TCGCCGTCCT CGGCGCGAAG ATGTACGAGG CCGGCGAGAC GCTTGCGATC GAAGACTCGC GCGTCGACCC GAACTTCCGG CCGGATCAGG TACCCGTCAC CTGGCGTACG GACGAGCCAG AGCTCGCGGT CGGCCGTGGA GGAGACAGCG CAGGCGAGGA AACAGAACAG GTCCAGGGCG CTGAGGCCGT CGTCAACCTC GACTCCACCA CCGGCCGCGT CACCAAACGC CGACGGCCCA AAGCCTACCG CCACCCCGAC CTCGACGAAC GCCTCCGCAC GGAGCGGACT CGTCTCGAAG CCCGCCTGAC GAACCTCGCC CGCCGCGAGG GCGTCCCTAC ACCCGTGCTC TCGGACATCG ATCCGAAGGA GTCGGTACTC GAGTTCGCCT TCGTCGGCGA CTGCGATCTG CGCGCGGTTC TCGACGACGA GTCGGGGGAG ACGCACGTCC GGAACGTGGG TCGCCATCTC GCGCGACTCC ACCGGGCTGG CATCGTTCAC GGAGATCCGA CGACACGAAA CGTACGGATC GCTGCAGACC GCACCTATCT CATCGACTTC GGACTGGGCT ACCACACCGA CCACGTCGAG GACTACGCGA TGGACCTACA CGTCTTCGAC CAGAGTCTCG TCGGGACCGC GAACGACCCC GAGCCGCTCC GCGAAGCGGT CCGTGAGGGG TATCGCGAGG TCGGCGAGGA GCGGGTACTC GAGCGGCTGC TCGACGTCGA GGGACGCGGC CGGTACGTGG GCGGAGAGAG CTGA
|
Protein sequence | MTTSSATRTR VLGIEGTAWA ASAAVFDTES DDVFIETDAY EPDSGGIHPR EAAEHMHDAI PRVVETALAH ARETFDGPDT EPPVDAVAFS RGPGLGPCLR TVGTAARALA QSLDVPLIGV NHMVAHLEIG RHTADFDSPV CLNASGANAH LLAYRNGRYR VLGETMDTGV GNAIDKFTRH VGWSHPGGPK VEAAAKDGEL IDLPYVVKGM DFSFSGIMSA AKQRYDNGIP VEDICYSLQE TIFAMLTEVA ERALSLTGSD ELVLGGGVGQ NARLREMLAD MCDQRGADFH APEPRFLRDN AGMIAVLGAK MYEAGETLAI EDSRVDPNFR PDQVPVTWRT DEPELAVGRG GDSAGEETEQ VQGAEAVVNL DSTTGRVTKR RRPKAYRHPD LDERLRTERT RLEARLTNLA RREGVPTPVL SDIDPKESVL EFAFVGDCDL RAVLDDESGE THVRNVGRHL ARLHRAGIVH GDPTTRNVRI AADRTYLIDF GLGYHTDHVE DYAMDLHVFD QSLVGTANDP EPLREAVREG YREVGEERVL ERLLDVEGRG RYVGGES
|
| |