Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1878 |
Symbol | |
ID | 5104146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1821153 |
End bp | 1822091 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640507764 |
Product | transketolase subunit B |
Protein accession | YP_001191942 |
Protein GI | 146304626 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3958] Transketolase, C-terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCAAG GCAGTTTCTC CTCAATTCGT GAGGCCTTCG GTAAAACCCT GGTTAAACTG GGGGAGAAGG ACAACGACAT CGTAGTGATT ACTGCTGACG TTGGGGACTC GTCAAGGGCT TCCTACTTTA AGGAGAAGTT CCCCGACAGG TACTTCAACA TAGGCATCTC TGAGCAGGAC ATGGTAAACT TCGGCGCCGG TCTCTCGGCT GTGGGTAAGA AGCCCGTGGT GGTGGGATTC GCCATGTTCC TTATGAGAGC ATGGGAACAG ATGAGGAACA GCATAGGGAG AATGAACCTC AACGTTAAGG TGTTCGTGAC CCACTCGGGC TACAGCGATA GCGGTGACGG TTCAAGTCAT CAGGCCCTTG AGGACATCGC GCTCATGAGG GTAATCCCGA ACTTCAAGGT AGTTATACCG GCTGACGCTG CCGAGGTGGA GAGAAGTATG CCCGTAGTTC TTGAGGACAA GGGACCTCTC TACTACAGGA TGGGGAGAGA CTATTCTCCT CCGATCACCT CAACCATGGA CTACAAATTC GAGATAGGTA AGGCATACGT TCTCAGGGAA GGAGATGACG TGGCACTGAT GGGAGCAGGT GTGGTTCTTT GGGACGCACT AAAGGCTGCT GAGGAACTGG AGAAGATGGG AATAAGCGCG GCCGTGATTA ACGTACCCAC GGTGAAGCCA ATAGATCAAT CCACAATAGA GTACTACGCG AGGAAGACGG GGAGGATAGT TACCGTTGAG GAGCACAACG TCATGGGGGG AGTGGGATCA GCAATAGCTG AGACAGTGGT GAAAACTTAC CCTGTTCCCA TGAGGTTCGT GGGAGCCACA ACCTATGGTA GGTCGGCGAG GAGCCAGAGG GAACTCCTTG ATTATTACGG TATAACGCCC AAGACAATCG TGAACTCAGC CCTCGAGTTG ATCAAGTAA
|
Protein sequence | MLQGSFSSIR EAFGKTLVKL GEKDNDIVVI TADVGDSSRA SYFKEKFPDR YFNIGISEQD MVNFGAGLSA VGKKPVVVGF AMFLMRAWEQ MRNSIGRMNL NVKVFVTHSG YSDSGDGSSH QALEDIALMR VIPNFKVVIP ADAAEVERSM PVVLEDKGPL YYRMGRDYSP PITSTMDYKF EIGKAYVLRE GDDVALMGAG VVLWDALKAA EELEKMGISA AVINVPTVKP IDQSTIEYYA RKTGRIVTVE EHNVMGGVGS AIAETVVKTY PVPMRFVGAT TYGRSARSQR ELLDYYGITP KTIVNSALEL IK
|
| |