Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0189 |
Symbol | |
ID | 5103933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 152722 |
End bp | 153777 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506094 |
Product | peptidase M24 |
Protein accession | YP_001190290 |
Protein GI | 146302974 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0275902 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.05907 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTATA AGAACAGGGT ACAAAGGGTA AAGGAGCGTC TAAAGGGAAA GGCGGACTAC CTTGTTCTGG GCCCTGGAAG TAACATGTTT TACCTCACGG GTTTCACGGA GGAACCAATG GAGAGACCGA TCCTCCTGAT ACTCGGAGAA CAGGATTACA TGATAGCCCC AAAGATGTAT GAACAACAGT TATCAGGTCT CAGCCTAGAA GTAAGAACTT ACGTGGACGG AGAAGATCCC TATTCTCTTT TACAGATCAA GAAAGGTTCT TCTCTTGCTA TCGACGACCA ACTTTGGTCA ATGTTTCTAG TTAGTATACT TAATAGGTTC TCCCCATCGG ACCTAATCCT GGTTTCACCA CTCATAGCTC CAATAAGATC AGTTAAGGAT GAGGAGGAGA TAGGGATAAT GAAGGAAGGG TTGAAAATTG CAGAGCAATC CTTCATGGAA TTTATTTCGA GGGTTAAGGA GGGGGAGACG GAATGTCGCT TGTCGCAGAT ATTGGAGGGG ATTTTCAGGG AGAATGGAGT AACGCCATCC TTCTCTACAA TCCTCACCTC AGGTCCAAAC ACGGCAATGC CACACCTGAG ATGCACTGAG AGGAAAGTGC GTAAAGGAGA ACCTGTGATT GTGGATTTTG GTATCAAATA CCATGGGTAT TCCACAGATA CCACAAGGGT CGTTACTATT GGGAAGCCAT CACAGGAGGT GACAAAAATT TGGGAAATAG TTCACGAGGC TGTGGTAAAG GCTGAGGAGT CCACCTATGG ATTGTCGGGG ATGAAGATAG ACCAAAGGGC TAGAGGCGTC ATAGAAGGTA GGGGCTACGG TAAATACTTC ATTCATAGAA CCGGACATGG AATTGGAATA GACGTTCACG AGTTCCCCTA CATCTCTCCC GACAACGGCG ATGTGATACC TAGGAACTCC GTTTTTACCA TAGAACCTGG GATCTACATA CCCGAGAAAT TTGGGATTAG AATCGAGGAC ATGGTCATTA TGCGAGACAG GGCGGAGGTT CTCTCTTCCT TGCCTAAGGA GATCTATCAA GTCTAG
|
Protein sequence | MNYKNRVQRV KERLKGKADY LVLGPGSNMF YLTGFTEEPM ERPILLILGE QDYMIAPKMY EQQLSGLSLE VRTYVDGEDP YSLLQIKKGS SLAIDDQLWS MFLVSILNRF SPSDLILVSP LIAPIRSVKD EEEIGIMKEG LKIAEQSFME FISRVKEGET ECRLSQILEG IFRENGVTPS FSTILTSGPN TAMPHLRCTE RKVRKGEPVI VDFGIKYHGY STDTTRVVTI GKPSQEVTKI WEIVHEAVVK AEESTYGLSG MKIDQRARGV IEGRGYGKYF IHRTGHGIGI DVHEFPYISP DNGDVIPRNS VFTIEPGIYI PEKFGIRIED MVIMRDRAEV LSSLPKEIYQ V
|
| |