Gene Msed_1693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1693 
Symbol 
ID5105339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1630961 
End bp1632040 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content48% 
IMG OID640507587 
Productmyo-inositol-1-phosphate synthase 
Protein accessionYP_001191772 
Protein GI146304456 
COG category[I] Lipid transport and metabolism 
COG ID[COG1260] Myo-inositol-1-phosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.23989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGA TAGCAATCGC TGGCCTTGGA AATTGTGCAT CCATGCTTGT GCAAGGAATA 
GAATACTACA GGAAAATGGG CGAGAATTAC TTTGATGGAT TGATTACACC CATCATAGGC
GGTTACAAGG TAACTGACAT CAAGGTAGTA GCTGCCTTTG ATGTCTCAGT CAACAAGATA
GGAAAAGATG TGGCTGAGGC GATATTTGAG AAGCCTAACA TAACTCCTAG AATAGTTGAG
ATGGACAAGT TGGGAGTTGA GGTCTCTCCA GGTCCTGTAT TGGATGGCGT TGCCCCACAC
ATGATGAACG TGTTTAATCC CTCAAGCGAG GGGAAGATTG AGTCCGTTGT GGATGAACTG
AAAAGTAGTG GAGCAGATCT GCTCGTCAAC ATGTTGCCAG TGGGTAGCGA GATGGCCTCG
AGGGCATATG CGAGGGCATC ACTTGAGGCC AGAATAGGGT TTGTTAACGC TATCCCCGTC
TTCATAGCGA GTGACCCCAC AGGTGAATTC CCAAGGAGGT TCAGAGAGCT TGGATTACCC
ATTGCTGGTG ACGACGTGAA GGGACAAGTT GGCGCAACCA TATTTCATAG GGCCATCACC
TCGCTATTCA GATTAAGGGG GGTTAAGGTA GAGGAGACAT ATCAGCTAAA CGTAGGAGGA
AACACGGATT TCCTCAACAT GAAGACTGAG GAGAGGCTCC ACTCCAAGAG GATCAGCAAG
ACCAAGGCCG TAACGAGTAC CCTTGATAAT GAGCAGGAAA TAGAGACCCA AGGAAGGATA
AGGATAGGGC CCAGCGATTA CGTTCCATTC CTGGGAAACA CTAAGGTGGC ATACATCTAC
GTTAACGGGT CTGGGTTTGC TGGAAGGCCA GTGAAGGTGG AGGCAACCCT AGAGGTTGAC
GATAAGGCTA ACTGTGCCTC AGTACTGGTA GATGTAATAA GGGCAGTGAA GTTAGCCAAG
GACAGGGGAA TAGGAGGCCC CCTGAACGAG GTTTCTGCGT TCTACTTCAA ACATCCACCT
AAGCAGGCTA AGGATGATGA GGAGGCCTAT CTTTGGTTTA AGAAATTCAT TGAAATGTGA
 
Protein sequence
MIKIAIAGLG NCASMLVQGI EYYRKMGENY FDGLITPIIG GYKVTDIKVV AAFDVSVNKI 
GKDVAEAIFE KPNITPRIVE MDKLGVEVSP GPVLDGVAPH MMNVFNPSSE GKIESVVDEL
KSSGADLLVN MLPVGSEMAS RAYARASLEA RIGFVNAIPV FIASDPTGEF PRRFRELGLP
IAGDDVKGQV GATIFHRAIT SLFRLRGVKV EETYQLNVGG NTDFLNMKTE ERLHSKRISK
TKAVTSTLDN EQEIETQGRI RIGPSDYVPF LGNTKVAYIY VNGSGFAGRP VKVEATLEVD
DKANCASVLV DVIRAVKLAK DRGIGGPLNE VSAFYFKHPP KQAKDDEEAY LWFKKFIEM