Gene Msed_1698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1698 
Symbol 
ID5105344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1637253 
End bp1638437 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content47% 
IMG OID640507592 
Productthreonine synthase 
Protein accessionYP_001191777 
Protein GI146304461 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.80409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.346904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTGTA TTGAATGTGG GTTCCAAAGC GAATTGGACC AAAAAATGAT CACTTGCCCA 
AGATGTGGGG GAATACTCGA GATATCAGTA AAGTTACCCC CTACATTCTC GTTCTCCAAG
TTAAGAGGTA GAGGGGTCTG GAGATACTCC CCTGCTATAG CTGGAAATTA CAAGAAGATT
GTGAGTATAA GTGAAGGCGG AACACCCCTA ATCAGATCAA GGGAGAACAG TGAGGTGTAT
TATAAGTTTG AGGGCGCAAA CCCTACTGGT AGCTTCAAGG ATAGGGGAAT GACAGTTGCC
ATCAGCTCTG CAGTAAGCGA GGGATACAAG ATTGTGGTTG CAGCGTCCAC GGGAAACACT
GCAGCCTCAG CCGCAGCTTA CTCGGCGAGG GCTGGACTCA AGATTTACCT AGTTCTACCT
AAGGACAAAG TTGCCATAGG TAAGTTGGCC CAATCTATCC TATATGGTGC CACGATCCTA
GAGGTCGAGG GGAGTTTCGA CGTCGGTATG AAGGCTGTGA TGAGACTGTA TAAGGACGTG
GGAATAGTTT ATCCGCTGAA CTCATTCAAT CCCTGGAGAC TTGAGGGCCA GAAGACTATT
GCGTATGAGA TTACGGAGGA GATAGGTGTT CCCGATTACG TGTTCGTACC AGTTGGGAAT
GCTGGGAATA TTTATGCAAT TTGGAAGGGC TTTACAGAAT TAAGGGATGC CGGAGTAATT
GACAGGGTAC CAAGAATGGT TGGAGTACAA GCTGAAGGGG CTTCGCCAAT TGCCAAGGCA
ATTCTAAACA ACCAGGACAC GCCCCAGTTT GTGGAGAACC CAGAAACCGT GGCAACAGCA
ATCAGGATAG GGAAACCAGT AAATTGGAAG AAGGCCATGA AGGCCATCAA GGAATCGCAG
GGAACTGCAA TCTACGTCAG CGATAACGAG ATAATGGAGG CACAGAGGGA ACTGGCCAGG
AAGGAGGGAA TAGGGGCTGA ACCGGCGTCG GTGGCCTCCT TTGCTGGATA TAAGAAGGCC
TTGGAACACG GACTTGTGGA TAGGACAAGC AAGACCGTCA TGATACTAAC TGGACATGCA
TTAAAGGACC CCGACTCCAT GATAAAATCT TCAGCTCGAC GTATAATTGT AAATCCTGAT
CATTTAGAAA ATATAATTCT AGGTGATCTG AATGTTAGTG GTTAA
 
Protein sequence
MRCIECGFQS ELDQKMITCP RCGGILEISV KLPPTFSFSK LRGRGVWRYS PAIAGNYKKI 
VSISEGGTPL IRSRENSEVY YKFEGANPTG SFKDRGMTVA ISSAVSEGYK IVVAASTGNT
AASAAAYSAR AGLKIYLVLP KDKVAIGKLA QSILYGATIL EVEGSFDVGM KAVMRLYKDV
GIVYPLNSFN PWRLEGQKTI AYEITEEIGV PDYVFVPVGN AGNIYAIWKG FTELRDAGVI
DRVPRMVGVQ AEGASPIAKA ILNNQDTPQF VENPETVATA IRIGKPVNWK KAMKAIKESQ
GTAIYVSDNE IMEAQRELAR KEGIGAEPAS VASFAGYKKA LEHGLVDRTS KTVMILTGHA
LKDPDSMIKS SARRIIVNPD HLENIILGDL NVSG