Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0906 |
Symbol | |
ID | 5103552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 837089 |
End bp | 838396 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506809 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001191002 |
Protein GI | 146303686 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.598412 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTATTAT ATATGACACA AATCCTCGAG GCGAGAAGGG GAAATGTCAC CGAGGAGATG AGGAAGATCG CTGAAATCGA GGGAGTATCT CCGGAAAAGA TTAGGGATAG GGTCGCTACA GGCAGAGTAG TAATTCTCAA GAACTTGAAA AGAAACTTGA GGAAGTACAC CGCAGTAGGG GAGGGGCTCT CAACGAAGGT GAACGTTAAT ATTGGTGCCT CGACTGATCA TTACAATATA GAAGAGGAAC TGAGGAAGGT GGAGATAGCG AATAGATACG GAGCGGATTC CCTAATGGAC CTAACCGATG GCGGTGACAT AGATTCCATG AGGAGGATGG TAATAGAACA CGCAGAAATG CCCGTGGGGA CGGTCCCCAT TTATCAGGTA TATTATGAGA TGGTCACAAG AAGGAAGTAC GTAATCGACT TCACGCCTGA TGAACTTTTC AGGGTGATAA GGAAACAGTT AGATGACGGA GTTGACTTCA TCACTGTTCA CACTGGAATT ACCCTCGAGC TCTCCAGAAA ACTAGTTGAG AAGAAAAGAG TGGCAGGTAT CGTGAGCAGG GGTGGAACTA TCATGGCTGC ATGGTCCCTA CATAATCAAA GGGAGAACCC GCTTTATTCA GAGTTCGATT ATCTCCTGGA AATAGCAAGG GAATATGATG TTACCCTGAG CCTTGGAGAT GCGCTCAGAC CTGGAGGAAT TGCTGATGCG CATGACGAGT TCCAGGTAGC TGAGCTCATA AATAACGCTA GGTTGGCCAG GAGGGCGGTC GAAAAGGGAG TCCAGGTAAT GATAGAGGGA CCTGGTCACA TGCCCTTAGA TCAGATAGAG ATGGACATAA AACTTGAGAA GGAACTCTCT GGAGGAGTTC CATATTACGT CCTAGGAATT TTACCTACCG ATATAGCTGC AGGCTATGAT CACATTGCCG GAGCCATAGG AGGTGCAGTC GCCTCAGCTC ACGGAGCTGA TATGTTATGT TATCTTACAC CTGCAGAACA TCTTTCCTTG CCTACCCCAG AACAAGTGAA GGAGGGACTC ATAGCATTTA AGATAGCTGC CCATGTGGGG GATACGATTA AACTCGGGGA GAGGGCTAGG GAAAAGGACA GAGAAATGAG CGTGGCTAGG GCCTCTCTCA ATTGGTTGAA GATGTTCTCG CTCACCTTCG ATCAGGATAG GGCGAAACAG ATCTACACTC AATACAAAGA CAAGCCTCTC GGTTCATGCA CCATGTGTGG AGACCTTTGC GTCTACCTGG TACTACCCAG AGTTACCGAG AAAATGAAGA GAACTTGA
|
Protein sequence | MLLYMTQILE ARRGNVTEEM RKIAEIEGVS PEKIRDRVAT GRVVILKNLK RNLRKYTAVG EGLSTKVNVN IGASTDHYNI EEELRKVEIA NRYGADSLMD LTDGGDIDSM RRMVIEHAEM PVGTVPIYQV YYEMVTRRKY VIDFTPDELF RVIRKQLDDG VDFITVHTGI TLELSRKLVE KKRVAGIVSR GGTIMAAWSL HNQRENPLYS EFDYLLEIAR EYDVTLSLGD ALRPGGIADA HDEFQVAELI NNARLARRAV EKGVQVMIEG PGHMPLDQIE MDIKLEKELS GGVPYYVLGI LPTDIAAGYD HIAGAIGGAV ASAHGADMLC YLTPAEHLSL PTPEQVKEGL IAFKIAAHVG DTIKLGERAR EKDREMSVAR ASLNWLKMFS LTFDQDRAKQ IYTQYKDKPL GSCTMCGDLC VYLVLPRVTE KMKRT
|
| |