Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1217 |
Symbol | |
ID | 5103831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1189803 |
End bp | 1191254 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507109 |
Product | Acetyl-CoA hydrolase |
Protein accession | YP_001191302 |
Protein GI | 146303986 |
COG category | [C] Energy production and conversion |
COG ID | [COG0427] Acetyl-CoA hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.872767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAGGA TAGCTTGCCG TGAACTGAGG TCGTTAGTCA CGGACCCGAT GGACGCAGTG AAGAGTCACC TTCCCAGACA TGGGGTAATT GCCTTCGGTG GAATGTCCGG CTCCTCGGTT CCCAAGGAGT TTCCCAGGGC CATGGCCCAG CATCTGGAGG GAGAGAGTGG ATACGCCTTA ACCCTTTTCA CGGGAGGAGC CACATCGGAG GAGTTCGAGA GGAACATCTC TAGGATAGGT CACGCATTTC GAAGAAGGTA CCCTTTTCTC AGTGGAGGTG TGCATAGGGA ACTTGTGAAC AAAAACGAGC TCGAGTTCTT CGATTACGGT CTCCACAAGT TTAACAGAAA AATCAGATCA GGAGAACTGG GACCAATAGA CCTCGCAGTA ATTGAGGCAA CTGCGATTCT CGAGGACTGC TCCGTGATAC CCTCTCTCTC CCTGGACTCG TCTATCTCCT TTATTAGCGT CGCTAAGAAG GTGATCCTGG AAATCAATGA GAGTAAACCT GAATTGGAGG GCCTTCATGA CATTTACTTG GGGGATAACG TATCACTCGC GAAGGTTTCC CAAAGGATTG GGGACACTAG GCTGACTGTA CCCCCCTCAA GGATAGGGGC AATACTGGTC ACTAGGGAGG AAGATGGATC TCCTGGAGCA TATAAGTCTC CTGGAGACGT GGAGGCGAGG ATCTCGGAGA ACATCATGGA GTTCATGATG AACGAGATCA GAGAGGGCTT CTCCAGAGAA CTATACGAAC GGGGGGATTT CGTGATCCAA CCTGGGGCTG GATCCCTCTC TAGCAAGCTC GCAGACGTGT TACCCCAGTC GGGTTTGACC CTAAGTGTGT GGGCTGAAAC CCTACCCGCA AAGTGGGCCC TCATTCCTGA GGATAACGTG AAGTTCATTA GCTCCTCAGC CTTTTTTACC CTACGTGGAG AGGAGAACTT CCTTCGTAAA TTCTTTGAAA CCGGGGACTT CAGACGGATA GTGCTACGCG GACAGAACGT CACGAACAAC CCCGAGGTGA TTCAAAGGCT TAACCTGATG TCCATACTAC AGGCAATCGA ACTAGATATT TACGGTAATG CCAATATATC TCACATACAC GGAAATATAT ATAATGGCGT CGGTGGGTCT GTGGACTTTG CCCCAAACGC TTACATTACC GTGATCGCAC TACCCTCGGT CACGGGAGAC GGTAAGATCT CTAGGATTGT CCCACTCACG ACTCACGTAG ACGTTCCAGA GCACTACGTG GATGTGGTGG TAACGGAACA GGGATATGCG GACTTAAGGG GAAAATCCCC CCACGAGAGG GCTGAGGAAA TAATAAACAA GTGCGCTCAC CCTAGGTTTA GGGACTCCCT TCTACAATAT TATGGGAAGG TCAGGGAGAT GGGACATGAG GCGCATGACC TGAGGCTGGC CCATGAACTG TTCGGTCTTT GA
|
Protein sequence | MERIACRELR SLVTDPMDAV KSHLPRHGVI AFGGMSGSSV PKEFPRAMAQ HLEGESGYAL TLFTGGATSE EFERNISRIG HAFRRRYPFL SGGVHRELVN KNELEFFDYG LHKFNRKIRS GELGPIDLAV IEATAILEDC SVIPSLSLDS SISFISVAKK VILEINESKP ELEGLHDIYL GDNVSLAKVS QRIGDTRLTV PPSRIGAILV TREEDGSPGA YKSPGDVEAR ISENIMEFMM NEIREGFSRE LYERGDFVIQ PGAGSLSSKL ADVLPQSGLT LSVWAETLPA KWALIPEDNV KFISSSAFFT LRGEENFLRK FFETGDFRRI VLRGQNVTNN PEVIQRLNLM SILQAIELDI YGNANISHIH GNIYNGVGGS VDFAPNAYIT VIALPSVTGD GKISRIVPLT THVDVPEHYV DVVVTEQGYA DLRGKSPHER AEEIINKCAH PRFRDSLLQY YGKVREMGHE AHDLRLAHEL FGL
|
| |