Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3971 |
Symbol | |
ID | 8828705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013924 |
Strand | + |
Start bp | 7934 |
End bp | 9949 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_003482069 |
Protein GI | 289937467 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.330522 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATCAA CACAACTGTG GGGTATGCTC ACTCTCAACG ATGTTCTCGA TCTTGAATGG CCCACTGCCC CCAAGTGGTC GTCGTGCGGT TCGTTTCTCG CCGCAACGGT ACACGAGGAC GATGGGAAGG TATTGCTCGT TGGTCACCCA GGAAAAGAGC CGTTGCGCGT CCGACCGGCC GACGAACACG TCGCCGAGTT CACGTGGTCG CCAACGGAGC CACAACTCGT CTGTACGACC GAGGATGGAT TGACTGCCCT CGTTGATCCA CGTGAGCAAT CCGTCTCGGA GATTGCGCGC ACACCCGACG GCGACATCTT GCCCACCTGG TCGCACGATG GGTCGCGCCT CGCGTTCTAT CGGGATGGCA AACCCCGAAC TAAGTCACTG ACCGACGGCG TCGAACGCGG GTTCGATGTT CCGGAGCGGG GCCCATTTCT TGGTGGAGAA CGCGCGTTCG CCTGGCGTGG GGACGGGCTG CTTGCCTACC GATTCACAGA CTGCGAGACG AAGTGCGTCG GCGTGATCAA CACCGAAAGC GGGGACCTCG TCTGGCGAAC CCGCCCTGAT TCTTCCTCTC ATTCACCGTT CTGGCTCGAC GATGGACGGC TGTGCTACGA GCGCCGAGGT GAGTACGGTA CCGTCCGCGA GTTTATCGCC GTCGACATCG ATGATCCTGC GACCGAGAGT ACACTCTTCC AGGAGGTCGA CGAAGAGACC GGCGCACTCT CGCGTGGGTC ACCACGTCTC TCCCCCGATG GAACACTGCT GGCCGCGGCG CTCCCAGTCG ATGGCTACGA ACACATCCAC GTGATCGATG TCTCCTCAGG CGAGCGAACG CAACTCACTG AGGGGGCCTT CGAGGACAAA GGACTCGCCG ATTCTACGCC GCGATGGATC GACGATGAAC GATTAGTTTT CGCATCAAAC CGGCGAGACA CAGGACAGCG ACAGCTCTAC ACGGTCACGC TCGATGGCAC GACAGAACCG CTGGTCGAAA CGGCGGGAAC AAACGTCGAA CCGCGACCGT CGCCAGCCGG CGATCACGTC GCCTACCTCC ATGCAAGCCG CGACCGGTCT CCGGAGGTTC GTGTCCGCGA ACTCGAGTCC GATTCGGCTG ATGCTGCCCC GACCGAACCA GCGACGCAAA CCGACGCCAA GCTGACCCAC TCCGGTCTCC GCGACTGGCC GGAACCACCG ATCGAACCGG AGCGAGTCTC GTTCGAGAGT GGTGAGCTGA CGATCGAGGG ATATCTCCTC GACCCACGCC AAAGCGAGTC CGTGCCCGAC GACGCAACCG ACCTCCCGAG TGTCGTCTAC GTCCACGGCG GGCCGATGCG ACAGATACGT GATGGCTTCC ATCCCTCCCG ATCGTACGGG CTTGCGTACG CGTACCAGCA GTACCTCGCC ACGAACGGGT ACGTCGGTCT GTTCGTGAAC TACCGTGGTG GCATCGGCTA CGGCCGGGCG TTCAGAGGAG CGATCGGCGG CGACCGAGGG AGGGTCGAGA TGGACGACAT CGCTCGCGCC GCTGACTATC TGCGTGCTCT CGAGTACACC GCTGATTCAG TAGGACAATG GGGACTCTCC TACGGCGGCT ACGCCGCGCT TCAGCTCCCC GGAACCCACC CTGGAACATT TGATGTCACG GTCAACATCG CCGGACTCGC CGATACAGCG AACTACCACG AGTGGGCGAC CGAGACGAAA TTCCCCGCCA TCGCCTCCGC AGCGACGACC GTGATGGGCC ATCCACTCGA GAACCCCGAC CGGTGGGATG ACGCTAGCCC GGTCACCCAC ATGGACCGAT ACGAGACGCC GGTGTACAAC TTCCACGGCA CGGCTGACCG GTACGTGAAC GTCGAGCAGC AGGATATCGT GGTGAACACG CTGCTCGATC TCGATGTTGA GTTCGAGGCC GAGTACTATC CTGACGAAGG GCATGTGTTC TCGAAGCGAT CGACGTGGCG GCGAACGTTC GAGAAGATCG AAGCGGCGTT CGACGAGCAC CTCTAG
|
Protein sequence | MESTQLWGML TLNDVLDLEW PTAPKWSSCG SFLAATVHED DGKVLLVGHP GKEPLRVRPA DEHVAEFTWS PTEPQLVCTT EDGLTALVDP REQSVSEIAR TPDGDILPTW SHDGSRLAFY RDGKPRTKSL TDGVERGFDV PERGPFLGGE RAFAWRGDGL LAYRFTDCET KCVGVINTES GDLVWRTRPD SSSHSPFWLD DGRLCYERRG EYGTVREFIA VDIDDPATES TLFQEVDEET GALSRGSPRL SPDGTLLAAA LPVDGYEHIH VIDVSSGERT QLTEGAFEDK GLADSTPRWI DDERLVFASN RRDTGQRQLY TVTLDGTTEP LVETAGTNVE PRPSPAGDHV AYLHASRDRS PEVRVRELES DSADAAPTEP ATQTDAKLTH SGLRDWPEPP IEPERVSFES GELTIEGYLL DPRQSESVPD DATDLPSVVY VHGGPMRQIR DGFHPSRSYG LAYAYQQYLA TNGYVGLFVN YRGGIGYGRA FRGAIGGDRG RVEMDDIARA ADYLRALEYT ADSVGQWGLS YGGYAALQLP GTHPGTFDVT VNIAGLADTA NYHEWATETK FPAIASAATT VMGHPLENPD RWDDASPVTH MDRYETPVYN FHGTADRYVN VEQQDIVVNT LLDLDVEFEA EYYPDEGHVF SKRSTWRRTF EKIEAAFDEH L
|
| |