Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0278 |
Symbol | |
ID | 5104914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 237774 |
End bp | 239060 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640506184 |
Product | 2-methylcitrate dehydratase |
Protein accession | YP_001190379 |
Protein GI | 146303063 |
COG category | [R] General function prediction only |
COG ID | [COG2079] Uncharacterized protein involved in propionate catabolism |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.914364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0835983 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCTCG CTGAAGTTTT CTCAGAATTC GCTACCTCAA CCTCGTACTC CGACCTTCCC GAGAAGTCGG TGCACGAGGC GAAAAGGAGG GTTCTTGACT CCCTTGCGGT GGCCTACGCT TCAACGTCGT CACCGCCAGC TGAGGTCGTC AGAAAGGCAA TTCCAAGTTT TCAGGGACAG GGCTTGCTCC TAGGGGGAGG AAACTCTTCC CCAGACATGG CTGCGTTCTA CAACACCCTC CTCATCAGGT ATCTGGACTT CAATGATACC TATCTTTCCC TTGAACCTCT CCATCCCTCT GACATGATTG GCGGTCTCCT TGCAGTTAAC CCTAGGCTAA GCGGAAAGGA ATTGATAAGG GCTATCGTTT TAGGGTATGA GGTTTCCACT AGGCTATGCG ATTCCACGTC CCTTAGGAAA AAGGGATACG ACCACGTAAA TTTCCTCCAG GTTGGGTCGG CAGTGGCACT TGGAGTGGCC TTGGGTCTTA ACAAGGAGCA ATTGGTGAAC GCAATCTCGA TCACCACTGT GCCACACGTG GCCCTCCGGG AAACTCGTTC AGGAAGTCTT AGTATGTGGA AAGCTGGGGC AACCGCTGAG GCAGTGAGGA ACTCCGTCTT TGCTGTGCTC TTGGCAAAGG CTGGATTCAC GGGTCCTTCG ACTCCCTTTT CAGGGAAGAT GGGATTCAGG AACGTAATTG CACCGGACAT GTCAGATGCC CCCTTCAAGA GCCTGGGGAC CACCAAGATC CTAGAGACGT ACATAAAGAA ATATCCTGTG GAGTATCACG CTCAAGCAGC TGTTGAGGCA GGTATCAAGT TAAGGAAACA GCTGATGGGG GATATAACCA AGGTAACCGT GGAGACCTAT GAGGCTGGAA GGACTATCCT AGCTGACGAG GGAAAGTGGG ATCCTAAGAA CAAGGAAACC GCTGATCACA GCCTTCCCTT CATAGTAGCA GTCACCCTGC TAACTGGTAA GTTCTGGCTC GATGCATACG ATCTTGTGGG GGATCCCAAG GTCACGGAAC TCATGAAGAA GATTGAGGTC GTGGAGAACG AGGAGTACAC CAAGGTCTAC CCTAGCGAGC TACCCACCAA GATAGTGGTG AAGACCACCT CTGGTACTTT CTCAGAGGAG GTTAGAATTC CTAGGGGTCA CCACAAGAAC CCCATGAGCG ACGAGGAGGT GGAGGAAAAG GCAATGAAAC TGGGTCTAGG GAAGGACATC GTGAATAAGA TCTGGAACCT GGAGAAAATG GAGGTGAAGG ACATTGTCTC TTGGTAA
|
Protein sequence | MELAEVFSEF ATSTSYSDLP EKSVHEAKRR VLDSLAVAYA STSSPPAEVV RKAIPSFQGQ GLLLGGGNSS PDMAAFYNTL LIRYLDFNDT YLSLEPLHPS DMIGGLLAVN PRLSGKELIR AIVLGYEVST RLCDSTSLRK KGYDHVNFLQ VGSAVALGVA LGLNKEQLVN AISITTVPHV ALRETRSGSL SMWKAGATAE AVRNSVFAVL LAKAGFTGPS TPFSGKMGFR NVIAPDMSDA PFKSLGTTKI LETYIKKYPV EYHAQAAVEA GIKLRKQLMG DITKVTVETY EAGRTILADE GKWDPKNKET ADHSLPFIVA VTLLTGKFWL DAYDLVGDPK VTELMKKIEV VENEEYTKVY PSELPTKIVV KTTSGTFSEE VRIPRGHHKN PMSDEEVEEK AMKLGLGKDI VNKIWNLEKM EVKDIVSW
|
| |