Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0575 |
Symbol | |
ID | 5103735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 529405 |
End bp | 531741 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506479 |
Product | peptidase M1, membrane alanine aminopeptidase |
Protein accession | YP_001190674 |
Protein GI | 146303358 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.509843 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATAA ACTCGTACGA TATTCATGTG ATCTTCAATT TCAAGGAATC TACCTACAAG GGAACCGAGA TCATTAACCT GGATACTGAG GACGGGGTAG AGTTAGATGC GGTAGGACTA GAGATCCACT CGGTGGAGAT TGACGGAAGA TCAGCTGACT TCAAACTGGA AGACAACAAG GTGAAGGTAA AAACGGGTAA GTTCTCTGGG GACCTGAGGG TAACCTTTTC AGGCAAGGTA AGGGATACCC TCGTCGGGAT ATACAGGGCC CCCTACAACG GAAGTTACAT GTTCTCAACC CAATTTGAAT CAAGTCATGC AAGGGAATTC ATACCGTGCG TGGACCATCC AGCATACAAG GCCAAGTTCA GGCTATCAGT AACGGTAGAT AGGGGACTAC AGGTAATCTC TAACATGCCC GTTAAGGAGA CCAGGGAGGA AGGGGATCAG GTTACCTACG TTTTTCACGA AACTCCTCCC ATGTCTACTT ACCTTCTATA CGTGGGAGTG GGTAAATTCG AGGAATTTAG ATTGCAAAAC GTTCCCGAGA TAATCGTTGC CACAGTTCCA GGAAAGATTA GCAAGGCGAA GTTACCGGCT GAGTTTGCGA GGGATTTCAT CAGGAAATAT GAAGAGTATT ATGGCATAAA GTACCAACTA CCTAAGGTTC ATCTCATTGC GGTTCCAGAG TTCGCCTTTG GCGCAATGGA GAACTGGGGC GCGATCACCT TTAGGGAAAC TGCCCTTCTC GCAGACGAAA AATCTGGCTT CTCCAACATT AGACGCGTAG CCGAGGTAGT AGCACATGAG CTGGCCCATC AATGGTTTGG GAATTTAGTT ACCATGAAAT GGTGGAACGA TCTATGGCTC AATGAGAGCT TTGCCACGTT CATGAGCTAC AAGATAATAG ACATGTTACA TCCAGAATGG TACATGTGGG GGGAGTTCCT ACTGGACGAG ACTGCTGGTG CACTCCTGAA GGATTCCATT CCCACTACCC ATCCCATTGA GACCAAGGTG AATTCTCCGG AGGAGGTTGA ACAAATCTTT GACGATATAA GCTATGGGAA GGGTGCGTCA ATCCTGAGAA TGATAGAGTC CTACATTGGC AAGGATGAAT TCAGAAGGGG GATTTCCAAG TACTTGCAGA AGTTCAGTTA CGGAAACGCC GAGGGAAAAG ACCTATGGAA CAGCTTGGAG GAGGCTTCTG GTAAGCCAGT CTCCAAGATA ATGCCCCACT GGGTATTGGA GGATGGATAC CCCATGGTTA AGGTTCAGAT AGTCGGCAAC CAGCTTGAAC TTACACAGGA GAGGTTTGGA CTTCACCCTG TTCCAGAAAA GACCTATCCC ATACCCATAA CTCTCATGGT CAACGGAGAG AAGAAAGACC TAGTTATGGA AGGGAAGAGC GTCAGAATCG AGGTGGGCCA CGTAAACGAG CTCAAGGTTA ACTTGGATAA GGCCGGATTC TACAGGGTCA TGTACTTTGA TCTGGGACCT GTCCTGGCAT CCGAGCTAAC ACCTGAGGAG CAATGGGGCT TAGCCAACGA TTATTTCGCT TTCCTCCTTG CAGGTAAGGT GTCCCGGGAT GAGTACTTCA AGGTAGTGAG GAGTTTAATG AGCGCTAAGC ATCACCTCCC CGTTCTAGAG CTTGCTGACC AGCTCTCCTT TTTGTATGCC GTAAATAGCC AGAAGTACGG GGAGATAGCG AGGGAGTTCC ATTCCAAGCA GGTAAAGGAG TGGAGTACTA GGCAGGATCC AGTGGGTAGA AGGACTTACT CCACGCTGGC CATGAACCTC TCAAAGATGG ATCCCAAGTT TGCTACATCC CTTTCAGCCC AATTCAGCCA GTATGACCAA CTTGACGGGG ACCTGAAGAG CGCGGTTGCA ATAGCCTATG CCGTGTCCGC AGGATCGCAA GCGCTTGATC AACTCTTGAC CATGTATAGA CAGTCAAAGT TTGATGAGGA TAAGACCAGG CTCCTCAATG CGTTGCTTTC CATGAATTCC CCACACTCGG TAGTTAACGT CCTTAGTATG GTCTTCACCG GGGAGATGAA AAAACAGGAC ATCATCAGGT CTCTCCAGTA CTCATTATTC TATCCCAACG TTAGAGATGC CGTATGGGAA TGGATAAAGA TACACTCCAA GAAGGTTGCG GAGATCTACC AGGGGACCGG AATATTCGGC AGAGTTATGG CAGATGTAAT ACCTCTCCTC GGGATAGGCA GGGTAGAGGA AGTGGAGAGA TTCTTCGAGG CCAATCCAAT AAAGGGTGCG GAAAAAGGGA TAAGACAGGG AATAGAGATT CTCAAGGCCG TTTCAAGGAT TGTATAG
|
Protein sequence | MKINSYDIHV IFNFKESTYK GTEIINLDTE DGVELDAVGL EIHSVEIDGR SADFKLEDNK VKVKTGKFSG DLRVTFSGKV RDTLVGIYRA PYNGSYMFST QFESSHAREF IPCVDHPAYK AKFRLSVTVD RGLQVISNMP VKETREEGDQ VTYVFHETPP MSTYLLYVGV GKFEEFRLQN VPEIIVATVP GKISKAKLPA EFARDFIRKY EEYYGIKYQL PKVHLIAVPE FAFGAMENWG AITFRETALL ADEKSGFSNI RRVAEVVAHE LAHQWFGNLV TMKWWNDLWL NESFATFMSY KIIDMLHPEW YMWGEFLLDE TAGALLKDSI PTTHPIETKV NSPEEVEQIF DDISYGKGAS ILRMIESYIG KDEFRRGISK YLQKFSYGNA EGKDLWNSLE EASGKPVSKI MPHWVLEDGY PMVKVQIVGN QLELTQERFG LHPVPEKTYP IPITLMVNGE KKDLVMEGKS VRIEVGHVNE LKVNLDKAGF YRVMYFDLGP VLASELTPEE QWGLANDYFA FLLAGKVSRD EYFKVVRSLM SAKHHLPVLE LADQLSFLYA VNSQKYGEIA REFHSKQVKE WSTRQDPVGR RTYSTLAMNL SKMDPKFATS LSAQFSQYDQ LDGDLKSAVA IAYAVSAGSQ ALDQLLTMYR QSKFDEDKTR LLNALLSMNS PHSVVNVLSM VFTGEMKKQD IIRSLQYSLF YPNVRDAVWE WIKIHSKKVA EIYQGTGIFG RVMADVIPLL GIGRVEEVER FFEANPIKGA EKGIRQGIEI LKAVSRIV
|
| |