Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0493 |
Symbol | |
ID | 5103655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 449008 |
End bp | 450183 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640506399 |
Product | amidohydrolase |
Protein accession | YP_001190594 |
Protein GI | 146303278 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.642244 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCTC AAAGGATTTA CAACGAAGCG AGGGAAATAG AGGATAAGGT AATAGAGTTA AGAAGGAAGA TACACGAAAA TCCAGAACTC TCATACCAGG AGTATGAGAC TGCGAAGTTA GTCGCCAACT ACCTGAGGTC TCTTGGGATA GACGTCAGGG AAGGGGTAGG GACCGAGACA GGGGTCCTCG GCGTCATCAA GGGAAGGAGA AGCGGAACAG TTGCCCTAAG GGCAGACATG GACGCCCTCC CCGTGACCGA GGAGACAGGT CTACCCTTCG CCTCAAAGAA ACCAGGAGTT ATGCATGCAT GTGGTCACGA TGCCCACACT GCCATGCTCC TCGGTGCAGC CACAATACTC TCACGTCACC TTGACGAGAT CGGAGAGGTT AGACTCATCT TCCAGCCGGC AGAGGAGGAT GGAGGAAGGG GTGGTGCGCT TCCCATGATA GAGGCCGGAG TAATGGAAGG CGTTGACTAT GTTTTCGGGT TACATGTGAT GTCGGGCTAT CCCTCAGGCA CCTTGGCAAC GCGTGGAGGG GCTATAATGG CCTGTCCCGA TTCCTTCAGG GTTGAGGTAG TGGGAAGAGG AGGTCATGGA TCTGCTCCTC ATGAGACAAT TGACCCAGTC TTCATCTCCG CCATGATAGT TAACGCGTTG CAGGGCATAA GGTCGAGGCA GATCAATCCC CTTGAACCCT TTGTCCTTTC CGTGACCAGT ATCCATTCTG GTACTAAGGA CAACATAATA CCTGACAGGG CGGTGATGGA GGGAACAATC AGGACCCTAA ACGAGAAGGT GAGAGAGACG GCACTTAAGT CCTTCAGAAA CATTGTGAAA TCGGTGTGTG AGGCATACGG AGCCGAGTGC TTGGTTCAGT TCAAGGAGGA CGCATACCCT GTCACCGTGA ACGATCCCGA TACCACAAAG AGAGCCATGG AAATACTCAA GGATATTCCT GGGGCAGAGG TGAAGGAAAC GCAACCTGTG ATGGGTGGAG AGGACTTCTC GCGTTTTCTG CAAAGGGCTA AGGGATCTTT CATCTTCCTC GGAACCAGAA ATGAGAAGAA GGGAATAGTC TATCCTAACC ATAGCTCTAA GTTCACAGTG GATGAGGATG CCCTAAAGGT TGGGGTAACT GCGTTAGCAC TTCTGGCAAG TAAGTTCTCG TCATGA
|
Protein sequence | MDAQRIYNEA REIEDKVIEL RRKIHENPEL SYQEYETAKL VANYLRSLGI DVREGVGTET GVLGVIKGRR SGTVALRADM DALPVTEETG LPFASKKPGV MHACGHDAHT AMLLGAATIL SRHLDEIGEV RLIFQPAEED GGRGGALPMI EAGVMEGVDY VFGLHVMSGY PSGTLATRGG AIMACPDSFR VEVVGRGGHG SAPHETIDPV FISAMIVNAL QGIRSRQINP LEPFVLSVTS IHSGTKDNII PDRAVMEGTI RTLNEKVRET ALKSFRNIVK SVCEAYGAEC LVQFKEDAYP VTVNDPDTTK RAMEILKDIP GAEVKETQPV MGGEDFSRFL QRAKGSFIFL GTRNEKKGIV YPNHSSKFTV DEDALKVGVT ALALLASKFS S
|
| |