Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0440 |
Symbol | |
ID | 5105436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 392237 |
End bp | 393847 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506346 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001190541 |
Protein GI | 146303225 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTAT CTTACCTTAA CTACACGCCC GTTCAAGACT TCGATGTTAA GGATGGGCGA GTCGCATACG TGATCCTGAA GGACTCTCCT AAGGTTGAAA TTCTCGGAGT GGGAGAGGTG CAGATAGAGG AGCCAGAGAC CGTTCACTGG GTGGGATCTA GGCTGGCTGT GGTCGCCGAT CAGGGTGGGG CAGAGGTTAG GTCCATTTAC CTGGTGGATG AAGCTCCACA TCCCTTACTT TCCGACGGAT TTGACAACAT GGAACCTGTC TTCCTCAAGG AGGACAAATT CTACTTCCTT TCCAATAGGG ATAGGGAAAC GATTCGCCTC TACCTATATG ACGGAGGACA GATCACGAAG GTAAGCAAGG GTAACCTTCC AGTATCTGAC GTTTGTGTCT CCCCTGGGGG AAGATGGGTA GCCTACTCCT CAGGAATCTA CGATAATGAC CTTTACCTTC TTGATAGCAA GACAGGGGAA GAGGTAGTCG TGTCATATCC TAACTCAGAG CAGTATCCAA GTTCGTCTCA ATGCTTCACA GGGGATTCGC TTCTCTTTCT TAGCAATCAC AACGGCTTCC TTGATGTCGG GAAACTCTCG TTAAGGGATC ACACAGTTTC TTGGTTAGTC ACGAGCAAGG AGGACAAGTT CGAGGCTCTG ATGTGGAGGG ACAGGTTGGT GTACACTGTG GACGTGCGAG GTCATATTCT GCTCATGGTG GACGGAAAGC CCCTAACTGA CCAGGGGGTA GTGACTGACG TGAAGGTCGA TAGAGACCTC TTCTTCCTTC ACTCGTCATA CGACAGAGCA TACGATCTTT ACAGGCATTC CACCGTGACT GAAAGGTTAA CGGACTCAAT GAGGGAGGTT AAGGGAGAAT TCGTGAAACC CACTCTCGTA AAGTACGTCT CCCTAGGAGA GGAGATTGAT GGGCTCCTTT ACCAAAGGGG AGGGGAGAAA CGTGGAGTAG TCTACATCCA CGGAGGTCCA GATTACGAGT GCCTAAGTAA CTACTCAGCT GAGATTCAGA TGTTGGTGGA CCAAGGATTC AAGGTCATAT GCCCGAATTA TAGGGGCTCG ACCGGTAGGG GAAGGAGGTT CAATCACCTC AATGACAGGG ACCTTGGTGG AGGCGACCTA GTGGATGTGG TGGAGTCAGC TAGCCTCTTG AAGGTTCCCA AGGTTGCGGT GACAGGGGCG AGTTATGGAG GATACCTAAC CATGATGGCA GTAACCAAGT ACCCTGAGAA ATGGTGTGCT GCAGCTGCTG TGGTTCCATT CGTTAACTGG TTCACGGAAA AGAAGATGGA AAGAGAGGTA CTCAGGCAGT ACGACGAGGT AAAGATAGGT AATGACGAGG AACTACTCAG GGATAGGTCC CCCGTGTATT TCCTGGACAG GGTTAGGGCA CCACTCCTCC TTCTAGCTGG GGAAAACGAT CCCAGATGTC CTGCTGAGGA GACGCTACAG GTAGTGGAGA AGATGAAGGA GATGGGAAGA ACCGTGGAGT ACAAGATCTA CGAGAACGAG GGTCATGGAT TCGTTAAGAG GGAAAACCTG GTGGATTCCA TAATAAGGGT AGTGGAATTT CTAGATAAAA ATTGTAAATA G
|
Protein sequence | MNLSYLNYTP VQDFDVKDGR VAYVILKDSP KVEILGVGEV QIEEPETVHW VGSRLAVVAD QGGAEVRSIY LVDEAPHPLL SDGFDNMEPV FLKEDKFYFL SNRDRETIRL YLYDGGQITK VSKGNLPVSD VCVSPGGRWV AYSSGIYDND LYLLDSKTGE EVVVSYPNSE QYPSSSQCFT GDSLLFLSNH NGFLDVGKLS LRDHTVSWLV TSKEDKFEAL MWRDRLVYTV DVRGHILLMV DGKPLTDQGV VTDVKVDRDL FFLHSSYDRA YDLYRHSTVT ERLTDSMREV KGEFVKPTLV KYVSLGEEID GLLYQRGGEK RGVVYIHGGP DYECLSNYSA EIQMLVDQGF KVICPNYRGS TGRGRRFNHL NDRDLGGGDL VDVVESASLL KVPKVAVTGA SYGGYLTMMA VTKYPEKWCA AAAVVPFVNW FTEKKMEREV LRQYDEVKIG NDEELLRDRS PVYFLDRVRA PLLLLAGEND PRCPAEETLQ VVEKMKEMGR TVEYKIYENE GHGFVKRENL VDSIIRVVEF LDKNCK
|
| |