Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1108 |
Symbol | |
ID | 5103582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1035797 |
End bp | 1037266 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507003 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_001191196 |
Protein GI | 146303880 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0811747 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGTAG CAGAATCAAT ATTCAAAACC CTGTCAAGTT CTACCACAAC CGTGTACGGG AACCCAGGAA CCACGGAGAT TTCCTTTCTC AAGTATCTAC CGAGCGAATT TCGATATTTC CTAGCCCTCC ACGATGGCCC AGCCATAGGT ATGGCCAGTG GTTACTCTCT CATGACAGGT AAGGTAGGGG TAACCAACAC GCATGCAGCT CCTGGGTTAA TGAATTCCTT GGGTTACGTT TATTCGGCAA GACTTGACAG AACTCCCCTT CTCATCACGG TGGGGCAACA GTCTTCTACC CAGTTGTTGG ATGAGCCCAT ACTTTCCGTA GATCTAAGGA CCGTTCCATA TGCAAAGGAT GTGATAGAGG TGAGAAGGAA GGAGGAAGTT AGTAAAGCCT TGATTAGGGG AATCAAAACG GCTGTTTCTC TTCCGCCCGG ACCGGTTATC CTCGGGCTAC CATATGACAT CATGGAGGAG GAGATAGGGA ACACGGAGAG TTACATCTCT GGAAGAGTTG AATGGAACTG TCCCTGCAAT CTCCTAGACG TAGAGATGGT AGCCGAGGAG ATCAATGCAG TCAATAAAGT TGCAGTAGTT GCAGGATACG AATTGGACAT AGTGGATGCT CACGAGGAAG TTGTGGAGTT GGCAAGAAAG GTGGGTTCAC CTATCTTCAC AGAACCCCAC TTCTCCCGGT CTCCAGGTTC AAAGATCGAC GTTATATTAC CAAGAAGTGC CAGTGGGATA AACAGGATTC TTGGTCAATA CGATCTAGTC CTCCTCCTCG GAGGTACCCT TCACAACGTG TTGTACATGG ACCAAGAGTT CAGGTTCAAC ATACTTCAGA TTACCATGGA CCCAGAGGAG AAATCCAAGA GGATTTGGAG AACCGTCCTC TGTAATCCAA AGGACTTCCT GAGACACCTT CTCCCTAAGG TAAGGGAAAA AGTCGGTTCT CACGATCTCA AGCCGGATAA CAAAAATAAG GTCACGGAGC TAATGGAGTA CCTGGTTTCA AAGCTAAACG GACACGCCAT ATTTGAAGAG ACTCCGTCCC ATAAGGAGGT AGTTAAGAAA GTAATTGGGA TTAGGAAACA TCTCTTCTTC TCCAATAGAT CTGGATTCCT GGGTTGGGCT CTACCTGCAT CACTGGGCTA CGTTACTGCC GGAGGTAAGG CTGTCACTCT CATAGGAGAT GGAAGTTTCC ACTTTTCTCC ACAGACACTT TGGACCGCAT CCTACTATGA CCTAGAAATG AGAATAATGA TACTTAACAA CCATGGGTAT GAATCGTTGA GGGGGAGAGC TGATTATCAA GCTAACTTCT TCAATCCAAG GACACAACCC CTAAAAGTCG CTGAGGCTTA TGGATTTGAG ACGTTCGAGA CTGACCATTT AGCAGATGGT GTGGATTGGC TAATGGAAAA GGGAGGGAAG AGGAGAGTTG TGGAAATTGT ACTAAAATAA
|
Protein sequence | MNVAESIFKT LSSSTTTVYG NPGTTEISFL KYLPSEFRYF LALHDGPAIG MASGYSLMTG KVGVTNTHAA PGLMNSLGYV YSARLDRTPL LITVGQQSST QLLDEPILSV DLRTVPYAKD VIEVRRKEEV SKALIRGIKT AVSLPPGPVI LGLPYDIMEE EIGNTESYIS GRVEWNCPCN LLDVEMVAEE INAVNKVAVV AGYELDIVDA HEEVVELARK VGSPIFTEPH FSRSPGSKID VILPRSASGI NRILGQYDLV LLLGGTLHNV LYMDQEFRFN ILQITMDPEE KSKRIWRTVL CNPKDFLRHL LPKVREKVGS HDLKPDNKNK VTELMEYLVS KLNGHAIFEE TPSHKEVVKK VIGIRKHLFF SNRSGFLGWA LPASLGYVTA GGKAVTLIGD GSFHFSPQTL WTASYYDLEM RIMILNNHGY ESLRGRADYQ ANFFNPRTQP LKVAEAYGFE TFETDHLADG VDWLMEKGGK RRVVEIVLK
|
| |