Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1845 |
Symbol | |
ID | 5104116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1790868 |
End bp | 1791917 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640507733 |
Product | glucose-1-phosphate thymidyltransferase |
Protein accession | YP_001191912 |
Protein GI | 146304596 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1209] dTDP-glucose pyrophosphorylase |
TIGRFAM ID | [TIGR01208] glucose-1-phosphate thymidylylransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0548966 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGGGC TTATCCTCGC GGGTGGACAC GGAACTAGGT TAAGACCCTT AACTCACACT GGGAACAAGC ACGCGATCCC CATCGCCAAC AAGCCCATGG TCTTGTACGC AGTCGAGAAC CTAGTGAACG CGGGGATACG CGACATTGTG GTGATCTTGG GTCCGCTCAA GGAGGGGATA AAGGAGGCCA TTGACGGGAA CTACCCCGCT AATTTCACCT ACGTGGAGCA GGAACCCCTC GGGCTAGCCC ACGCGGTCAT GAAGGCTGAG AAGTACCTAG ATGAGCCCTT CGTCATGCAC CTTGGCGACA ACCTCCTGCA GAACGGGATC TCCCAGTTCG TGAACAAGTT CCATGAAACC AAGGCAGACG CAGTGATTGG CGTAACTCCC GTGAAGGACC CGAGGCAGTA CGGTGTCGTT GTAATCGAGA ATGGGAGGGT GAAGAGGCTT ATGGAGAAAC CCAGGGACCC GCCCTCTAAC CTGGCACTCG TGGGAGTTTA CGTTTTCACT CCCGTGGTCC ACGACTATAC GAAGAGGCTG AAGCCGAGCT GGAGGGGAGA GTACGAGATT ACAGACGTGT TACAGCTCAT GGTTGAGGAT GGTAGGAGGG TTGAGGTGGT TCAGGTGGAG GGATGGTGGA AGGACACGGG GAAGCCAGAG GACCTGCTTG AGGCGAACCA GTTGGTGCTG GACTCTCTTC ACGGTAGCTT TAGACACGAT CACGCGAAGA TCGAGGGCAG GGTACAGGTC GGGGAAGGGA CAGTCTTGAG GGAGAACGTC ATAATTCGCG GACCCGCGAT TATAGGGAAG AACTGCGTCA TAGGGCCTAA CGTATTCATT GGTCCATATA CCTCGATCTG GGATGATTGC GAACTCAGTG ATGTAGAGAT AGAGAACTCG ATCGTCATGA AGGGCGTTAA GATAAAAGGG GTTTCCAGGA TAAGCTATAG TATTATAGGT AACGATGTGG TCGTTGAGAG CAGATCGGGA GTACCCAGGA TCAAGCGACT CGTGGTCGGG GATAGGTCAA GGATAACGCT GTCAAGTTGA
|
Protein sequence | MKGLILAGGH GTRLRPLTHT GNKHAIPIAN KPMVLYAVEN LVNAGIRDIV VILGPLKEGI KEAIDGNYPA NFTYVEQEPL GLAHAVMKAE KYLDEPFVMH LGDNLLQNGI SQFVNKFHET KADAVIGVTP VKDPRQYGVV VIENGRVKRL MEKPRDPPSN LALVGVYVFT PVVHDYTKRL KPSWRGEYEI TDVLQLMVED GRRVEVVQVE GWWKDTGKPE DLLEANQLVL DSLHGSFRHD HAKIEGRVQV GEGTVLRENV IIRGPAIIGK NCVIGPNVFI GPYTSIWDDC ELSDVEIENS IVMKGVKIKG VSRISYSIIG NDVVVESRSG VPRIKRLVVG DRSRITLSS
|
| |