Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1175 |
Symbol | |
ID | 5104471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1144036 |
End bp | 1145616 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640507067 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_001191260 |
Protein GI | 146303944 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.312541 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAAA CCACTGCGGA ACTCCTGATG GACGCGATCT CCTCCCAGGT TGAGGACGTT TTCGGGATAC CTGGGACTCA CGGCCTCTCC CTCTATGAGG AGATAAGGAA GAGGGTTGAG AGACAAGAGG TTCGTTACTA CATGCCGAGA CTGGAGTACG GGGGCGCCAT AATGGCCGAC TACTACGCCA GACTGAAGGG AAACGTGGGG ATCTTCATCT CGGTGAATGG TCCAGGATTC ACGAACTCCT TGACGGGGCT AGCAGAGGCC TTCTCCGAGG GTTCACCCCT TGTCCTAATC TCCTTCAACA AGGAGTTCAG GTATAGGCAT AGGAGACAAC TTCACGACAC TGGATACTAC GACGCACAGA TCGAGGTAGC TAAGCAGATA ACTAAGGCGT CATTCAGGAT CTATTCCCCC GAGGAGGTAC CCCTAGTCAT GGAAAAGTCC TTCAGGATAG CCCTTCAGGA CAGGATGGGG CCTGTCTACG TTGAGATCCC TGTTGACGTC CTCGAGGAGA AGGGCGAGGC TGAGGCGGAG AGAGTGAAGG TTCCAAGGAG TCTGGTCTAC CCTAGTAAGG ACGAGGTGAG GGAGGCCGTG AACTTTCTGA GTGAGTGCTC CAAGCCCGTC CTACTCCTGG GATATGGAGC GTCCAGATCC GATCTGGTTC CCTACCTGGA AAAGCTGGGA ATTCCTGTCC TGACCACGGT TAGGGGTAAG GGAAGCATCC CTGAGAATCA CCCTCTCTAC GCGGGCACAA CCTTCAACCT AGCCGAGATC CCTGGGGACT GCCTGATCGC CATTGGAACC TCATTCAATG ACCTCGAGAC TAGGAGGTGG AGCATGAAGC TTCCTAGGAC TCTTCACGTC GATCCGGACC CCTCAGTCTT CAACACCTCC TTCAGGGCAG ACGTCGTGGT GAGGGCTAGC GCCGAGGCCT TCTTAACCGA GGTGGTGGAG AGGGTCAAGT TGCCCAGGTG GAGTTTCAGG GTTGAGCGAA GGGAGACCAA CCTGCAGGGT GAGGGAATAA CTCACGACCT TCTCGCCAAG GTTCTAAACG AGGCGTTAGG GGAGGACAGG GTGGTGATCG CAGATGCGGG CACAAATCAG GTAATGGCTA TTGACGTACA GGTGTATAGG CCCAACTCCT ACTTCAACTC CCTGATCTTC AACGCCATGG GTTCAGCTAT CCCAGCGGGT ATAGGGGCAA AGATTGCAGT CCCAGAGAGG CAGGTCGTGA GCATAATAGG TGACATGGGA TTTCAGGGGT GTTTTCAGGA GTTGATCACT GCGGTAGAGA ACGGGATCAA CTTCCTGACA GTCCTCGTGG AGGACGGCGT TCAGCATTTC CTAAGGATGA ACCAGAACAT GAGATATGGA ACCACCTTCA CGACGCAGGT CTTTCCAATT GACTACACGA AGGTTCTCGA GGGGATTGGG GTGAAGGTTG TGGAGGCTAG GGACAGGGAA GAGTTGAGGA GGGCAACGGA GGAGGCAGTC AGCTGGTCAG CCAAGATGCC AACGGTACTC AGGGTTAGGG TGAACCCGAA CAGCGTCCCA TCTAGGCTAA CGCGAAGATG A
|
Protein sequence | MAKTTAELLM DAISSQVEDV FGIPGTHGLS LYEEIRKRVE RQEVRYYMPR LEYGGAIMAD YYARLKGNVG IFISVNGPGF TNSLTGLAEA FSEGSPLVLI SFNKEFRYRH RRQLHDTGYY DAQIEVAKQI TKASFRIYSP EEVPLVMEKS FRIALQDRMG PVYVEIPVDV LEEKGEAEAE RVKVPRSLVY PSKDEVREAV NFLSECSKPV LLLGYGASRS DLVPYLEKLG IPVLTTVRGK GSIPENHPLY AGTTFNLAEI PGDCLIAIGT SFNDLETRRW SMKLPRTLHV DPDPSVFNTS FRADVVVRAS AEAFLTEVVE RVKLPRWSFR VERRETNLQG EGITHDLLAK VLNEALGEDR VVIADAGTNQ VMAIDVQVYR PNSYFNSLIF NAMGSAIPAG IGAKIAVPER QVVSIIGDMG FQGCFQELIT AVENGINFLT VLVEDGVQHF LRMNQNMRYG TTFTTQVFPI DYTKVLEGIG VKVVEARDRE ELRRATEEAV SWSAKMPTVL RVRVNPNSVP SRLTRR
|
| |