Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1300 |
Symbol | |
ID | 5104551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1278069 |
End bp | 1279781 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640507189 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001191382 |
Protein GI | 146304066 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCCT TTGAATACAT TGAAAACCTA GAAGATCCTA GAACTAAGGC ATTCATAGAG GAGGAAACGA GGAACTCCTC TTTCTTTCAG GAGAGGGCAA AACTTCACTA TCAGCCCATT CTCGAGAGAC TCACCGAGGA AAGGCCCATC ACGTTGGTGG GCACGGAAAA GGGAGTGGCA ATTTTAGTTA GGTCCAAGAG TGGAGTCCAC GCTGAGGTCA ACGGGAACAT CATCAGGAGT GAGAGGGAAC AAGACATCTT CAATTCCCTG GAGAGGGTAT GGAACTCAGA CCTGGTGAGA ATAGGGGTAG GGATAGGAGG ATCTGATCAG GGTTACTCGA TCCTAGTGAA TGAGCAGGGT AAGGTAGTGA GAAGGGTTGA GGGGCTCGTT AACCAGTTTT TCTTCTTAAG GGGCAAGCTG TGTTACGTTA GGGAGTATAG GACAGAGAGC TCACCTGATG GAGTTCCTCC TGCAGTGGAA AGGTTGTTCT GTGGGGAGGA GATGCTCCCC TTTTACCCTG GAAGGGGTGA GTGGATCTCA GTTAAGGCTG AGGGAGATAA CCTTCTCCTG GTTAGGGGAA TAGGTTGGAG CAAGAAGGTA CTCTATCGAG ACTTTGAAAA GGTGGATGAA GGCGATATCA CCTCCTACGA CATGAAGGGA GGAAGGATAT ATTACGTGAA GGGAAACTCT CTCATGCGTG ATGGTGTGGA GTTATTCAAG ATTTCAAGAC CCACACTGGA CATGAAGGTT ATGGACGATG GGATTCTGAC CCTCGAGATC AGGAATTACA AGACGTCTCT AGTGAAGTAC TCAGAGGAGG GGAGGGAGAC CTGGAACTAC ACGACGGACC ACATCCTCAC CTTCGATACA GTTGGCGATC AGATCTACGT CCTGGAGACA TCATTTGACA CGTCATACAC CATCTCCAGG ATAAAGGATC AGAGAGTCGA GGTGCTGAGA AGGGGGAGGG AGGAGAGGCT CACGGTCAAG GAGATTTACG TCCAGGGAGA CGTCCTCCTG CACGGGTTCC TCCTAAGTAA GGGAGGTAAT AGGGGAGTTG TGGTTTACGG TTACGGTGGG TTCGCGATCC CGCTCCTTCC CAGTTACAAT CCTCTATTCC TCGAACTTAT GGACTCTGGT TACTCCGTCC TAGTCACAAA CCTCAGGGGA GGCTTTGAGA ACGGGGAGGA GTGGCACAAG GCGGGGATGC TCAGGAACAA GATGAACGTA TTCAAGGATT TCTCGGAGTT CCTACAGACC GTGAAAATGA TGGGAGGAAG GACAATAGCC ATGGGTGGAA GTAACGGTGG ACTGCTGGTG GGAGCTACCC TTAACCTCTA CACGTCCCTG GTGGACTGTG GAGTCATAGG TTACCCTGTC CTTGATATGT TGAAATTTCA CAAGTACCTC GCTGGTATGT ATTGGGTACC CGAGTACGGT GACCCTGAAA AGGACTCCGA GTTCCTCCTT TCCTACAGTC CCTATCACAA CCTGAAGAAA GGGCTACCTC CAACCCTAGT GTACACAGGG CTTAATGACG ATAGGGTCCA TCCCATGCAC GCCTTGAAAT ACGTTGCTAA GTCTAGGGAG ATGGGAAACA AGGTTTACCT CTTCGTAAAT AGGAGAGCTG GACATAACTT GAGCAGACCG GAGGCAAGTG CCGAGGAGAT GTCCACCGTG GTGGCGTTCG TGGAACAGTG TCACTCACTC TGA
|
Protein sequence | MDPFEYIENL EDPRTKAFIE EETRNSSFFQ ERAKLHYQPI LERLTEERPI TLVGTEKGVA ILVRSKSGVH AEVNGNIIRS EREQDIFNSL ERVWNSDLVR IGVGIGGSDQ GYSILVNEQG KVVRRVEGLV NQFFFLRGKL CYVREYRTES SPDGVPPAVE RLFCGEEMLP FYPGRGEWIS VKAEGDNLLL VRGIGWSKKV LYRDFEKVDE GDITSYDMKG GRIYYVKGNS LMRDGVELFK ISRPTLDMKV MDDGILTLEI RNYKTSLVKY SEEGRETWNY TTDHILTFDT VGDQIYVLET SFDTSYTISR IKDQRVEVLR RGREERLTVK EIYVQGDVLL HGFLLSKGGN RGVVVYGYGG FAIPLLPSYN PLFLELMDSG YSVLVTNLRG GFENGEEWHK AGMLRNKMNV FKDFSEFLQT VKMMGGRTIA MGGSNGGLLV GATLNLYTSL VDCGVIGYPV LDMLKFHKYL AGMYWVPEYG DPEKDSEFLL SYSPYHNLKK GLPPTLVYTG LNDDRVHPMH ALKYVAKSRE MGNKVYLFVN RRAGHNLSRP EASAEEMSTV VAFVEQCHSL
|
| |