Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_0714 |
Symbol | |
ID | 8823542 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 710447 |
End bp | 712498 |
Gene Length | 2052 bp |
Protein Length | 683 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | peptidase S8 and S53 subtilisin kexin sedolisin |
Protein accession | YP_003478861 |
Protein GI | 289580395 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.207998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACCAG AACGACGCGA TACGCGAAGT CGTCGGTCGA TACTCAGGGG TGCTGGGGCC GTCGGTGGAC TGATCGGAGC CACCGGTCTC TCGAGTGCCA CGCCCGGTCG CAGCCCCGGG CCGAAACAGA CAGAACTGCT TGTCGGGATC AACGACGAGT ACGCCCGCAG TTCAGCAAAC GGTACCGAAC GAACAGCGCG CGAAGCGGTG CCGGCCGATG CGACGGTCGT TCACTCGAAC GAGCAACTCG AGTACGTCGC CATCGAGGTG CCGGCGGAAC TCCCATCGGA GGCGGTCGAA CACGTTCGCG ACGTGCTCGA CTCGAACGAC GCAATCGAGT ACGTCGAGGA GAACGCGACG CTGCAGTCGC TGTACACGCC GAACGACCCG TACTACGACA GCCAGCACGC GCCACAGCAG GTCAACTGCG AGGACGCCTG GGCAGAGACG CTCGGTGACG AAGACGTCAC GATCGCAGTC GTCGACCAGG GAGTCGAATA CGAGCACGAG AACCTGTCCG GAAACGTCGA CGATCGAATC GGCGAGGTCT TCGTCGGCCG GGGAAGTGAT CCCGGGCCAC TCTCGAGAGG CGACACGCAC GGGACGCTCG TTGCCGGCAT CGCCGCCGGC GAAACGGGGA ACGGAACGGG ACAGGCGGGA ATCAGCAACT GCTCACTGCT CGCCGCACGC GCGCTCGACG AGCAGGGTCA GGGTGCGCTT TCGGACATTG CCGATGCAAT TCAGTGGGCA GCAGACGAGG GAGTCGACGT GATCAACCTC TCGCTGGGAA GTTCTAGCGG GCTGTGGACG CTCGAGAACG CCTGTGAGTA CGCCACCGAA CGGGGCTGTC TCGTCGTCGC AGCGGCCGGC AACAGCGGCG GCAGCGTCAT GTACCCGGCT GCATACGACG ACGTGCTGGC CGTCTCGGCG CTCACGTCGC GCGACAGGCT CGCGTCGTTC TCGAACCGTG GCCCCGAGAT CGATCTTGCC GCACCCGGCC AGAACCTCCT CTCGACGACG CTCAACGACG GCTACGACCG AATCTCCGGC ACATCGGCGG CCGCGCCCGT CGTCGCCGGC GTCGCCGGAC TGGTCCGGTC TGTCTATCCG GGGCTCTCGA GTAGCGCGCT GCGCGAGCAC CTGCGGGAGA CAGCAGTCGA TGTCGGCCTC AGGTCAGCCG CACAGGGTGC TGGCCGCGTC GATGCTGGGA ACGCAGTCAC GACAGTGCCG GAGGGGTACG ACGGGCCAGA TGAACGAGAT GGCGACACAG ATGAGGATGA AGAGGACGAC GAGGATGACG AGGAGACAGA CACCGACGGT CATCTGCTCT CGTTCGTTAC CGAGCCAGAG GCATCGTTTG CAGACTACGA ATTCACCGCG ACGGGTCCGG TCGAGTTCAC GACCGCCCCT GGACAGACGC CCTCCGGAGG CACCATCGAG GGAGGCACGT ACGCCGCGGA GGACTACATC GAGGAAGACG ACGAAACCTG GCATGCCGGC GGCGTGACTG GCGGCGGACA GGGTGATGCC TTCAGCGTCG AGGGTGCGAT CACCTCGATC GAGGTCGGCC AGCCAGACGT GATGTGGGTG GAACTCGACG GCGAGCGGCT ATCACCCGAG GAGGTTATCG AGGAGACGAC GGGCGACGAT ACGGACGAAA CGGACGGGGG AGAGGAAGAC GAGAGCGACG AGACCGACGA GACCGACGAG CCACAGTGTG GCGACGAAAC CAACACCGCC CGCGTCGAGG GCGACCTCGA CGGCAGCGGC TGGTGGCCCG ACACCGCGCG CTGGCAATAC ACGCCGCAGA CGGAGAACCC GTGCGAACTG ACGCTGACCG TCGACGGACC GTCCGGTGCC GACGTTGAAC TCTACATGAC GCGTGACGGG CGGCGACCGA CCCAGTGGGA CGCCGACGAG TCTGCCACCG CCACCGGCGA CACGCAGTCA CTCACGACGG CCCTCGAGTC CGACGACACG GTTCGGATTC TCATCACTGC GACGGGCGGG AGTGGAACGT ACGAACTCGA AATTGTCGAA CAGGGCTACT GA
|
Protein sequence | MQPERRDTRS RRSILRGAGA VGGLIGATGL SSATPGRSPG PKQTELLVGI NDEYARSSAN GTERTAREAV PADATVVHSN EQLEYVAIEV PAELPSEAVE HVRDVLDSND AIEYVEENAT LQSLYTPNDP YYDSQHAPQQ VNCEDAWAET LGDEDVTIAV VDQGVEYEHE NLSGNVDDRI GEVFVGRGSD PGPLSRGDTH GTLVAGIAAG ETGNGTGQAG ISNCSLLAAR ALDEQGQGAL SDIADAIQWA ADEGVDVINL SLGSSSGLWT LENACEYATE RGCLVVAAAG NSGGSVMYPA AYDDVLAVSA LTSRDRLASF SNRGPEIDLA APGQNLLSTT LNDGYDRISG TSAAAPVVAG VAGLVRSVYP GLSSSALREH LRETAVDVGL RSAAQGAGRV DAGNAVTTVP EGYDGPDERD GDTDEDEEDD EDDEETDTDG HLLSFVTEPE ASFADYEFTA TGPVEFTTAP GQTPSGGTIE GGTYAAEDYI EEDDETWHAG GVTGGGQGDA FSVEGAITSI EVGQPDVMWV ELDGERLSPE EVIEETTGDD TDETDGGEED ESDETDETDE PQCGDETNTA RVEGDLDGSG WWPDTARWQY TPQTENPCEL TLTVDGPSGA DVELYMTRDG RRPTQWDADE SATATGDTQS LTTALESDDT VRILITATGG SGTYELEIVE QGY
|
| |