Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1904 |
Symbol | |
ID | 5103291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1848920 |
End bp | 1849945 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507791 |
Product | L-myo-inositol-1-phosphate synthase |
Protein accession | YP_001191968 |
Protein GI | 146304652 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1260] Myo-inositol-1-phosphate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0868076 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATAAGAG TTGGCATAGT GGGCGTAGGA AACGTTGCTT CCTCCCTTGT TCAGGGCGTG GAATACGTTA AGCAAGGAGG ACGCGTGCCA GGAATACTTG ACCTAGAGTA TAGGCCAGAG GAAATTGATA TTGTTACTGC CTTTGACATT GATGCGAGAA AAGTGGGTAA GAAGCTGTCC CAGGCTATCT TTGAGAAACC TAACGTCGTG GAAAAGTACG TGGATGTGAG GTCTGACGTT GTCGTGCTTA GGGGACCTAC CCTGGACGGG ACTGAGGGAA TACTGGGTAA GGTGATTGAG GAGTCCAAGG AGAGGCCAGT GGACGTTAAA TCCGTTCTCA GGGAGAACAA GGTAGACGTG GTTGTTAACC TTCTTCCCAC GGGGGTTGAG AGGGCGAGCG AGTACTACGC AGTTAACTCG CTGGAGGCAG GCTCTTCCCT AGTTAATGCC TCGCCTGCTC CTCTAGTGGA GAGGTTTGAG GAGAAGTTCA AATCCGCTGG TTTACCTCTC CTAGGTGACG ACCTCATAAG CCAGATAGGT GGAACTGCTC TTCACGCAGG TATAATCAAC TTCCTTACCG AGAGGGGGGT AAAGGTAACT AGGTCCTATC AGATTGATAT CTCAGGAACC ACTGAGACTC TCGTGACCCT AGAGGATTCC AGAAAGGAAC TCAAGAAGAG GATAAAGTCA TCATACATCT CCAGCCAGCA AGATGGAGTT GAAGTGGTTG CTGGAACTTC CGACTACGTA GAATTTCTAG GGGATAGAAG GGTAAGTTAC ATGGTGATAG AGGGAGAATA TGCCCTAGGT GCTAAGGTAA GGATTGACGT TTCCATGAAG AGCTTGGATG GGCCTAACGC TGTGGCACCT CTCCTCGACC TGATTAGGCT CGCAAAGCTG TTGAAGGATA GGGGTATAGG TGGATCTCCT CCTCAGATAT GCTCCCATTA CTTTAAGGGG TACCACGGAA AAGTAGGTGG AGATACTAGG GCTAGTCTCA TCAATTTCAT TCAAGCCCTG AAGTGA
|
Protein sequence | MIRVGIVGVG NVASSLVQGV EYVKQGGRVP GILDLEYRPE EIDIVTAFDI DARKVGKKLS QAIFEKPNVV EKYVDVRSDV VVLRGPTLDG TEGILGKVIE ESKERPVDVK SVLRENKVDV VVNLLPTGVE RASEYYAVNS LEAGSSLVNA SPAPLVERFE EKFKSAGLPL LGDDLISQIG GTALHAGIIN FLTERGVKVT RSYQIDISGT TETLVTLEDS RKELKKRIKS SYISSQQDGV EVVAGTSDYV EFLGDRRVSY MVIEGEYALG AKVRIDVSMK SLDGPNAVAP LLDLIRLAKL LKDRGIGGSP PQICSHYFKG YHGKVGGDTR ASLINFIQAL K
|
| |