Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1698 |
Symbol | |
ID | 5105344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1637253 |
End bp | 1638437 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507592 |
Product | threonine synthase |
Protein accession | YP_001191777 |
Protein GI | 146304461 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0498] Threonine synthase |
TIGRFAM ID | [TIGR00260] threonine synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.80409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.346904 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTTGTA TTGAATGTGG GTTCCAAAGC GAATTGGACC AAAAAATGAT CACTTGCCCA AGATGTGGGG GAATACTCGA GATATCAGTA AAGTTACCCC CTACATTCTC GTTCTCCAAG TTAAGAGGTA GAGGGGTCTG GAGATACTCC CCTGCTATAG CTGGAAATTA CAAGAAGATT GTGAGTATAA GTGAAGGCGG AACACCCCTA ATCAGATCAA GGGAGAACAG TGAGGTGTAT TATAAGTTTG AGGGCGCAAA CCCTACTGGT AGCTTCAAGG ATAGGGGAAT GACAGTTGCC ATCAGCTCTG CAGTAAGCGA GGGATACAAG ATTGTGGTTG CAGCGTCCAC GGGAAACACT GCAGCCTCAG CCGCAGCTTA CTCGGCGAGG GCTGGACTCA AGATTTACCT AGTTCTACCT AAGGACAAAG TTGCCATAGG TAAGTTGGCC CAATCTATCC TATATGGTGC CACGATCCTA GAGGTCGAGG GGAGTTTCGA CGTCGGTATG AAGGCTGTGA TGAGACTGTA TAAGGACGTG GGAATAGTTT ATCCGCTGAA CTCATTCAAT CCCTGGAGAC TTGAGGGCCA GAAGACTATT GCGTATGAGA TTACGGAGGA GATAGGTGTT CCCGATTACG TGTTCGTACC AGTTGGGAAT GCTGGGAATA TTTATGCAAT TTGGAAGGGC TTTACAGAAT TAAGGGATGC CGGAGTAATT GACAGGGTAC CAAGAATGGT TGGAGTACAA GCTGAAGGGG CTTCGCCAAT TGCCAAGGCA ATTCTAAACA ACCAGGACAC GCCCCAGTTT GTGGAGAACC CAGAAACCGT GGCAACAGCA ATCAGGATAG GGAAACCAGT AAATTGGAAG AAGGCCATGA AGGCCATCAA GGAATCGCAG GGAACTGCAA TCTACGTCAG CGATAACGAG ATAATGGAGG CACAGAGGGA ACTGGCCAGG AAGGAGGGAA TAGGGGCTGA ACCGGCGTCG GTGGCCTCCT TTGCTGGATA TAAGAAGGCC TTGGAACACG GACTTGTGGA TAGGACAAGC AAGACCGTCA TGATACTAAC TGGACATGCA TTAAAGGACC CCGACTCCAT GATAAAATCT TCAGCTCGAC GTATAATTGT AAATCCTGAT CATTTAGAAA ATATAATTCT AGGTGATCTG AATGTTAGTG GTTAA
|
Protein sequence | MRCIECGFQS ELDQKMITCP RCGGILEISV KLPPTFSFSK LRGRGVWRYS PAIAGNYKKI VSISEGGTPL IRSRENSEVY YKFEGANPTG SFKDRGMTVA ISSAVSEGYK IVVAASTGNT AASAAAYSAR AGLKIYLVLP KDKVAIGKLA QSILYGATIL EVEGSFDVGM KAVMRLYKDV GIVYPLNSFN PWRLEGQKTI AYEITEEIGV PDYVFVPVGN AGNIYAIWKG FTELRDAGVI DRVPRMVGVQ AEGASPIAKA ILNNQDTPQF VENPETVATA IRIGKPVNWK KAMKAIKESQ GTAIYVSDNE IMEAQRELAR KEGIGAEPAS VASFAGYKKA LEHGLVDRTS KTVMILTGHA LKDPDSMIKS SARRIIVNPD HLENIILGDL NVSG
|
| |