Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0395 |
Symbol | |
ID | 5103638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 343473 |
End bp | 345224 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640506301 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001190496 |
Protein GI | 146303180 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCCCA AGGAAGCGTA TTCCTTGAAA GTCATATCTG AAGTCAAACT TGAGTCCCAG GGACTCATAC ACGGTGAGAC TTGGATAAAG GACAATGCCT ACTTCACGAC CATCTTCCTG AACAATAGAC CAATCCTAGA GGGCAAGGTC TCCCTCCCTA GGTTCCTGGG AGAAGATCTA TATTACGTTA GGAATGATGG TTCAGCAACA CTGCTGGTTC AGTCACCCTA TGGGGAACCC AGAAAGCTCG CGGAGCTGGG TAAGATACTC AAGTTCGAAA AGCACGAGAA AGGCCTACTC ATCCTGGGAG AAGATCTTCT GGATAAGCAA GCTCCCTCAG CACCCTTTAT CACGGAAAAG AGGAAGTACA GGTTTGACGG AAGGGGGCTC CTCAGGACGA GGACCTCCCT CTACCTAGTG AAGGGCAACG ACGTGGTCAA GGTCCTGGGA GGAGATTTTG ACGTCACTGA CTTCTCCACT AACGGTAAGA GGGTAGTTGT ATCTACTACC CAACCAAATG ACGACCTAGG CTTGAATGCC CTTTACGAGC TTGATCTTGA GACCGGAGAG ACCAGGAGGA TAACCAAGGA GGACGGTATG ATAGTCGCAG TTGCCATGAA CTCTGACGGG GACGTTGCGT ACCTAGGACA TGATAAGGGG AAGTCTCCGT GGGCAGTGAG GGAGGTGATC TTCCCTGAAA GAGGGGAGAG ATACCTGTGC GGAAACACGT GCGGTTCCAC GGTCCTCACA GACGTCTTTG ATGGGGCTAA GGAAAGGCTA GTTTTCCTGA AGAACCAGGT CATCACCTTG GGCCAGATGG GAGGAGAGGT AAACCTGTAC CGGATAAGCG ACAGGAAGGT TGACAAGTTG ACTGAAGGGA AACAGGTGGT GAGGTTATTC GACTACGACG GGAACTCCCT GGTTTACTCC TTCATGACAC CTGAAAAGCC CTCCCTTCTG TTCCGTGGAG AGGTATACGA TCCGGACCCA AATGTGAAAG GGCTCATGCC CGTGAGGGTT AGCTCCAAGA TTGAGGGGTG GGGCATCATC ACGGGAGATA AGCCCACAAT CCTCTTCATT CACGGTGGGC CACATATGGC CTACGGTTAC GGTTACTTCA TCGAGTTTCA ATTCTTCGCC TCAAACGGTT TCAACGTGAT TTACGCTAAC CCAACAGGAA GCCAGGGTTA TGGAGAGGAG TTCGCCAAGG GATGCGTTGG GGACTGGGGA GGAAGGGACA TGGCAGAACT ACTGGAGTTT GTGGAGGACG CTAGGAGGCA GTTTAACCTG ACTAAGAGGA TGGGAGTCAC GGGAGGGTCC TATGGAGGTT TCATGACAAA CTGGATCATT ACTCACTCTG AGATCTTTTC AGCTGCAGTG AGTGAGAGGG GTATCTCGAA CCTAGTTAGC ATGTGCGGTA CGAGCGACAT AGGCTTCTGG TTCAATGCCG TGGAGTCAGG GGTCGATGAT CCTTGGAATC CAGAAAACAT GGAGAAGTTA ATGAGAATGT CCCCAATATA CTACGTTGGG AAAGTAAGTA CTTCCACCAT GTTCATTCAT GGGGAAGAGG ATTACAGGTG CCCCATAGAA CAGGCGGAGC AGTTTCACGT GGCCCTTAGA TCTAGGGGAG TCGAGAGCAA GCTGGTGAGA TATCAGGGAG ACGGGCATGA ACACGCAAGG AGAGGGAGAC CAGACAACAT GATGCACAGG TTAACAATAA AGTTACAGTG GTTCAAGGAC CACCTCACGT AA
|
Protein sequence | MDPKEAYSLK VISEVKLESQ GLIHGETWIK DNAYFTTIFL NNRPILEGKV SLPRFLGEDL YYVRNDGSAT LLVQSPYGEP RKLAELGKIL KFEKHEKGLL ILGEDLLDKQ APSAPFITEK RKYRFDGRGL LRTRTSLYLV KGNDVVKVLG GDFDVTDFST NGKRVVVSTT QPNDDLGLNA LYELDLETGE TRRITKEDGM IVAVAMNSDG DVAYLGHDKG KSPWAVREVI FPERGERYLC GNTCGSTVLT DVFDGAKERL VFLKNQVITL GQMGGEVNLY RISDRKVDKL TEGKQVVRLF DYDGNSLVYS FMTPEKPSLL FRGEVYDPDP NVKGLMPVRV SSKIEGWGII TGDKPTILFI HGGPHMAYGY GYFIEFQFFA SNGFNVIYAN PTGSQGYGEE FAKGCVGDWG GRDMAELLEF VEDARRQFNL TKRMGVTGGS YGGFMTNWII THSEIFSAAV SERGISNLVS MCGTSDIGFW FNAVESGVDD PWNPENMEKL MRMSPIYYVG KVSTSTMFIH GEEDYRCPIE QAEQFHVALR SRGVESKLVR YQGDGHEHAR RGRPDNMMHR LTIKLQWFKD HLT
|
| |