Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2126 |
Symbol | |
ID | 5104419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2045997 |
End bp | 2046890 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640508015 |
Product | methionine aminopeptidase |
Protein accession | YP_001192189 |
Protein GI | 146304873 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0024] Methionine aminopeptidase |
TIGRFAM ID | [TIGR00500] methionine aminopeptidase, type I [TIGR00501] methionine aminopeptidase, type II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.110507 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0130853 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGG ACGAATTAAA GCTAGTAAAG ACTGCAGGCG AGATAGCTTC TAGGGCCAGG GATATGGGTG CTAGAATGAT AAAACCAGGT GTCAAGGTCA TTGATGTCTG TGAGACCGTA GAAAAGGCCA TAATAGAGGC TGGAGCTAAA CCCGCCTTTC CCTGCAACCT TTCCATAAAC CACGAGGCGG CGCACTACAG CCCAGTAATT GGAGACGAGA AAGTGATTCC TGAGGGAGCA ATAGTAAAGC TGGACATAGG TGCTCATATA GAGGGTTACA TAACCGACAC TGCGGTAACT GTTTACCTCG ACGATAGAAT GGAAAGATTA GCTGAGGCTG CCAAGGACGC ACTGAAATCC GCCATTTCCA ATTTCAAGAT GGGCGCATCA CTCTCGGATA TTGGGAGGGT TATTGAGAAG ACTATAAAGG GATACGGATT CAAACCCATA AGGAATCTTG GGGGACATCT AATCAGAAGA TACGAGCTCC ACGCTGGCAT CTTTGTACCT AACGTCTTCG AAAGGATCTC GGGAAGAATC CAGGGAGGAA ATACCTACGC GATTGAACCA TTTGCCACTG ATGGTGGTGG AGAGGTCGTT GAGGGAAAGG ACGTGACAAT TTACTCATTG CGATCGAAAC AGTTGAGGGG GTTAACCGAA ATCGAGAGGA AATACCTGGA GGAGATTGAT AAAAGGTTCA AGACATTGCC CTTCTCTGAA AGGTGGCTAG CGGACCTTGG TGGCAAAGAA GAGGTTGAGC AGACCCTTAG GAACCTAAGC AAACGCGGAG CACTTCACGC CTACCCAGTT CTACTAGAGG TCAGAAAGGG TATGGTGTCC CAATTTGAAC ATACAGTTTA CGTCGACGAG AAGGGAACTC TCGTTTTAAC CTGA
|
Protein sequence | MTEDELKLVK TAGEIASRAR DMGARMIKPG VKVIDVCETV EKAIIEAGAK PAFPCNLSIN HEAAHYSPVI GDEKVIPEGA IVKLDIGAHI EGYITDTAVT VYLDDRMERL AEAAKDALKS AISNFKMGAS LSDIGRVIEK TIKGYGFKPI RNLGGHLIRR YELHAGIFVP NVFERISGRI QGGNTYAIEP FATDGGGEVV EGKDVTIYSL RSKQLRGLTE IERKYLEEID KRFKTLPFSE RWLADLGGKE EVEQTLRNLS KRGALHAYPV LLEVRKGMVS QFEHTVYVDE KGTLVLT
|
| |