Gene Msed_1845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1845 
Symbol 
ID5104116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1790868 
End bp1791917 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content54% 
IMG OID640507733 
Productglucose-1-phosphate thymidyltransferase 
Protein accessionYP_001191912 
Protein GI146304596 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1209] dTDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0548966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGGC TTATCCTCGC GGGTGGACAC GGAACTAGGT TAAGACCCTT AACTCACACT 
GGGAACAAGC ACGCGATCCC CATCGCCAAC AAGCCCATGG TCTTGTACGC AGTCGAGAAC
CTAGTGAACG CGGGGATACG CGACATTGTG GTGATCTTGG GTCCGCTCAA GGAGGGGATA
AAGGAGGCCA TTGACGGGAA CTACCCCGCT AATTTCACCT ACGTGGAGCA GGAACCCCTC
GGGCTAGCCC ACGCGGTCAT GAAGGCTGAG AAGTACCTAG ATGAGCCCTT CGTCATGCAC
CTTGGCGACA ACCTCCTGCA GAACGGGATC TCCCAGTTCG TGAACAAGTT CCATGAAACC
AAGGCAGACG CAGTGATTGG CGTAACTCCC GTGAAGGACC CGAGGCAGTA CGGTGTCGTT
GTAATCGAGA ATGGGAGGGT GAAGAGGCTT ATGGAGAAAC CCAGGGACCC GCCCTCTAAC
CTGGCACTCG TGGGAGTTTA CGTTTTCACT CCCGTGGTCC ACGACTATAC GAAGAGGCTG
AAGCCGAGCT GGAGGGGAGA GTACGAGATT ACAGACGTGT TACAGCTCAT GGTTGAGGAT
GGTAGGAGGG TTGAGGTGGT TCAGGTGGAG GGATGGTGGA AGGACACGGG GAAGCCAGAG
GACCTGCTTG AGGCGAACCA GTTGGTGCTG GACTCTCTTC ACGGTAGCTT TAGACACGAT
CACGCGAAGA TCGAGGGCAG GGTACAGGTC GGGGAAGGGA CAGTCTTGAG GGAGAACGTC
ATAATTCGCG GACCCGCGAT TATAGGGAAG AACTGCGTCA TAGGGCCTAA CGTATTCATT
GGTCCATATA CCTCGATCTG GGATGATTGC GAACTCAGTG ATGTAGAGAT AGAGAACTCG
ATCGTCATGA AGGGCGTTAA GATAAAAGGG GTTTCCAGGA TAAGCTATAG TATTATAGGT
AACGATGTGG TCGTTGAGAG CAGATCGGGA GTACCCAGGA TCAAGCGACT CGTGGTCGGG
GATAGGTCAA GGATAACGCT GTCAAGTTGA
 
Protein sequence
MKGLILAGGH GTRLRPLTHT GNKHAIPIAN KPMVLYAVEN LVNAGIRDIV VILGPLKEGI 
KEAIDGNYPA NFTYVEQEPL GLAHAVMKAE KYLDEPFVMH LGDNLLQNGI SQFVNKFHET
KADAVIGVTP VKDPRQYGVV VIENGRVKRL MEKPRDPPSN LALVGVYVFT PVVHDYTKRL
KPSWRGEYEI TDVLQLMVED GRRVEVVQVE GWWKDTGKPE DLLEANQLVL DSLHGSFRHD
HAKIEGRVQV GEGTVLRENV IIRGPAIIGK NCVIGPNVFI GPYTSIWDDC ELSDVEIENS
IVMKGVKIKG VSRISYSIIG NDVVVESRSG VPRIKRLVVG DRSRITLSS