Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0201 |
Symbol | |
ID | 5103945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 164663 |
End bp | 166321 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506106 |
Product | thiamine pyrophosphate enzyme, central region |
Protein accession | YP_001190302 |
Protein GI | 146302986 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.582625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0361256 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCAGTT TCTGGTATTA TAGCGATCCA ACTTTGAGCC AGCCTAAGAG GAAAGAGGAG ACCGTAGGTA GGGAGATGAC GGGAGATGAG GCATTAGCTT ACGTCCTTAA GGAAATAGGA GTTAAGCGAG TGTTCACCTC GAACGCAGTC CCAGATTTCC TTAGGGAGAG ATTAGCCCAG TATGGCCTTG AAATAGATAT TTCCTTGAGT GTAAGGGAGG CTCTGGAACT GGCAGACGCT TTCGCCAGGG ATTCAGGAGA TGCTGGAGTT GTAATCAGCA CTCCAGGAAG CTCATTACTC GAGGGTAGCA GTGTAGTGGC TCAGGCATTC TCCGATTCCG TTCCGCTACT CATGATCGGT ACATTGAGGT CCTATAGGGA TGTGGGAAAG GCTAGAGTTG GCGAACTGAA GTCACCTGAC GACGTATCAA GTTCCCTCTC ACCCTTCATC AAGTTCAAGG AGAGGGTAAT CAGCATAGAG GAGATTACAG TCACTGTGGA GAAGGGGTAC AAGGAAGCTC TGAGCAATAG AATGAGGCCC GCTCTTGTGG AGATAGCAGA GGAGCTTTTC CGGTTAAAGG CATATCCACT CTCTACCGCA GAGCAGAAGC CTGAGAGGAA GACGCCAGAC AAAAACACAG TGGCCAAGGT GGCTGAGGTA ATGGGAAACT CCAAGTTGCC TGTAGTGGTT GCAGGGTATG GAGTTAGGGC AAGCAATGCG TCACCTCAAT TATTGGAACT CGCGGAGTTA CTTGACGCGC CGGTGATCAC AACCTTTAGA GCCAAGGGAG TTTTCCCGGC CTCACATCCG CTCTACGCAG GCGAGGGATT GGGAGCATTT TCCACGGAAG TTGCTTCCAA GCTCATGATG GAAGCAGACT CGATTCTAGT ACTTGGGTCT AGATTGCCTC AACTTAGTAC TGCCGGCTGG TCCATGAGGT ATAAGGGTTT CCTCATGCAC AACAATGTGG ATGGAGAGGA TATAGGCAAG GTAGTAATGC CACAACTTCC CATTGTTGCA GACACAGGCC TCTTCCTTAA GGAACTGATA ACAATACTCA AACAGAAACT AAAGGAAAAC ATCAAGAGGG AGGTGAGAAG CGAGATAGCG TCAAGCAGGA GAGTGTTCAC CATGAAACCC CACTCGGGAC TATGGCCATA TGACGTTACT AGGCTTCTAC AACAGTTCAA GTTTTCGAGA TACTTCGTGG ATTTGAGTGC CCCAACTCTC GACCTGGTTA GACTGCCCAT CGAGAGCCCT GTGTGGAACA CGAGCGAATC AATTCTCGAG AAGGGAATAG GTGTCGCTGG TGTGCTGCAG TCCAACGATC CAGGTGCCCT CGGGATTACT GACCTAGCTG GTGTACTAAG AAATGTTGGC CTCATTCAGC AAAGGGCTGA AAAGGCGAAG GGAGTAATCC TTGTGCTCAA TGACGGGGGA GCCACTTACC TTGACACGTT CAAATCGGAC ATACCGTCTA TAGGAAAATC GGGAACGTTT GTGGACGTGG ATGAATTCCT AGAGAGATCC ACGGGGGCAG TCACAGTGGA TACCTACGGA GGGTTGAAGG ACATCCTGGA GCGGAGAGAC CCTAAGCTCA AGGTAATAAA CGTGAAGATA GATCCGGATT ACGAGTCAAT CGTTCTTCTA AAACCATAA
|
Protein sequence | MTSFWYYSDP TLSQPKRKEE TVGREMTGDE ALAYVLKEIG VKRVFTSNAV PDFLRERLAQ YGLEIDISLS VREALELADA FARDSGDAGV VISTPGSSLL EGSSVVAQAF SDSVPLLMIG TLRSYRDVGK ARVGELKSPD DVSSSLSPFI KFKERVISIE EITVTVEKGY KEALSNRMRP ALVEIAEELF RLKAYPLSTA EQKPERKTPD KNTVAKVAEV MGNSKLPVVV AGYGVRASNA SPQLLELAEL LDAPVITTFR AKGVFPASHP LYAGEGLGAF STEVASKLMM EADSILVLGS RLPQLSTAGW SMRYKGFLMH NNVDGEDIGK VVMPQLPIVA DTGLFLKELI TILKQKLKEN IKREVRSEIA SSRRVFTMKP HSGLWPYDVT RLLQQFKFSR YFVDLSAPTL DLVRLPIESP VWNTSESILE KGIGVAGVLQ SNDPGALGIT DLAGVLRNVG LIQQRAEKAK GVILVLNDGG ATYLDTFKSD IPSIGKSGTF VDVDEFLERS TGAVTVDTYG GLKDILERRD PKLKVINVKI DPDYESIVLL KP
|
| |