Gene Msed_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1904 
Symbol 
ID5103291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1848920 
End bp1849945 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content49% 
IMG OID640507791 
ProductL-myo-inositol-1-phosphate synthase 
Protein accessionYP_001191968 
Protein GI146304652 
COG category[I] Lipid transport and metabolism 
COG ID[COG1260] Myo-inositol-1-phosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0868076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATAAGAG TTGGCATAGT GGGCGTAGGA AACGTTGCTT CCTCCCTTGT TCAGGGCGTG 
GAATACGTTA AGCAAGGAGG ACGCGTGCCA GGAATACTTG ACCTAGAGTA TAGGCCAGAG
GAAATTGATA TTGTTACTGC CTTTGACATT GATGCGAGAA AAGTGGGTAA GAAGCTGTCC
CAGGCTATCT TTGAGAAACC TAACGTCGTG GAAAAGTACG TGGATGTGAG GTCTGACGTT
GTCGTGCTTA GGGGACCTAC CCTGGACGGG ACTGAGGGAA TACTGGGTAA GGTGATTGAG
GAGTCCAAGG AGAGGCCAGT GGACGTTAAA TCCGTTCTCA GGGAGAACAA GGTAGACGTG
GTTGTTAACC TTCTTCCCAC GGGGGTTGAG AGGGCGAGCG AGTACTACGC AGTTAACTCG
CTGGAGGCAG GCTCTTCCCT AGTTAATGCC TCGCCTGCTC CTCTAGTGGA GAGGTTTGAG
GAGAAGTTCA AATCCGCTGG TTTACCTCTC CTAGGTGACG ACCTCATAAG CCAGATAGGT
GGAACTGCTC TTCACGCAGG TATAATCAAC TTCCTTACCG AGAGGGGGGT AAAGGTAACT
AGGTCCTATC AGATTGATAT CTCAGGAACC ACTGAGACTC TCGTGACCCT AGAGGATTCC
AGAAAGGAAC TCAAGAAGAG GATAAAGTCA TCATACATCT CCAGCCAGCA AGATGGAGTT
GAAGTGGTTG CTGGAACTTC CGACTACGTA GAATTTCTAG GGGATAGAAG GGTAAGTTAC
ATGGTGATAG AGGGAGAATA TGCCCTAGGT GCTAAGGTAA GGATTGACGT TTCCATGAAG
AGCTTGGATG GGCCTAACGC TGTGGCACCT CTCCTCGACC TGATTAGGCT CGCAAAGCTG
TTGAAGGATA GGGGTATAGG TGGATCTCCT CCTCAGATAT GCTCCCATTA CTTTAAGGGG
TACCACGGAA AAGTAGGTGG AGATACTAGG GCTAGTCTCA TCAATTTCAT TCAAGCCCTG
AAGTGA
 
Protein sequence
MIRVGIVGVG NVASSLVQGV EYVKQGGRVP GILDLEYRPE EIDIVTAFDI DARKVGKKLS 
QAIFEKPNVV EKYVDVRSDV VVLRGPTLDG TEGILGKVIE ESKERPVDVK SVLRENKVDV
VVNLLPTGVE RASEYYAVNS LEAGSSLVNA SPAPLVERFE EKFKSAGLPL LGDDLISQIG
GTALHAGIIN FLTERGVKVT RSYQIDISGT TETLVTLEDS RKELKKRIKS SYISSQQDGV
EVVAGTSDYV EFLGDRRVSY MVIEGEYALG AKVRIDVSMK SLDGPNAVAP LLDLIRLAKL
LKDRGIGGSP PQICSHYFKG YHGKVGGDTR ASLINFIQAL K