Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1212 |
Symbol | |
ID | 5103826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1185502 |
End bp | 1186500 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640507104 |
Product | transketolase, central region |
Protein accession | YP_001191297 |
Protein GI | 146303981 |
COG category | [C] Energy production and conversion |
COG ID | [COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATGA AGGGAATTTC CCAGGCAATA GCCCAGGCAA TATCTCAGGA GATGGAGAGG AGGAGTGACA TCGTCGTCTT GGGGGAGGAC GTGACTTACT GGGGCGCAGT TTTCGGGTTC ACCATGGGAC TCTTTGACAA GTTCGGCAGG AAGAGGGTGG TGGACACACC TATCACGGAG CAAACTTTCA TGGGAATGGC AGTGGGAATG GCCTCAGTGG GACTTCATCC CGTAGTGTCG CTCATGTTCG TAGACTTTCT GGGGGCCGGG TTTGATCAGA TGTACAACCA TATGGCTAAG AACCACTACA TGTCGGGCGG ACAATTCCCC ATGCCGGTAA CTGTGATCAC AGCAATAGGA GGAGGTTATG GAGACGCGGA ACAACACTCT CAAGTGCTTT ATGGACTCTT CGCTCACGTC CCAGGTTTCA AGGTTGTGGT TCCGTCTAAC GCATACGACG CTAAGGGTTT AACCATTAGG GCGTTGAGGG ATCCTAACCC TGTGGTCATA TTTGGTCACA AACTCCTCAC AGGGTTACCC TTTCTACCCT ATGAGGGAGG AGAGGACGAG GTTCCTGAGG AACCCTATGA GCTGGAGTTT GGGAAGGCCT CCGTGAGGAT GGAGGGGAGC GATCTGACCG TGGCATCTGC CGGACTGATG GTCCACAGGG CAATGAGGGT TGCAGAGAAG TTGAGAAAGG AGGGAATTTC CGTGGAGGTT GTGGATTTGA GGACCCTAGT CCCCCTGGAC GAGGAGACAC TCTCTAGGTC AGTAAAGAAG ACTGGTAGGT TGCTCATCCT TGACGAGGAT TACATGAGCT ATGGGATGAC CGGCGAGGTC ACGTTCAGGG TTCAGTCCAG AGCCTTAAGG GATCTGAAGG CCCCGATCCA AAGATTGGCA GTCCCTGATG TTCCAATCCC CTTCAGTGAG CCCCTAGAGA AGGAGGTGAT TCCAGGGGAG GCGAGAATAG AGGCCAAAAT CAGGGAAATG GTAAGCTAG
|
Protein sequence | MRMKGISQAI AQAISQEMER RSDIVVLGED VTYWGAVFGF TMGLFDKFGR KRVVDTPITE QTFMGMAVGM ASVGLHPVVS LMFVDFLGAG FDQMYNHMAK NHYMSGGQFP MPVTVITAIG GGYGDAEQHS QVLYGLFAHV PGFKVVVPSN AYDAKGLTIR ALRDPNPVVI FGHKLLTGLP FLPYEGGEDE VPEEPYELEF GKASVRMEGS DLTVASAGLM VHRAMRVAEK LRKEGISVEV VDLRTLVPLD EETLSRSVKK TGRLLILDED YMSYGMTGEV TFRVQSRALR DLKAPIQRLA VPDVPIPFSE PLEKEVIPGE ARIEAKIREM VS
|
| |