Gene Mbar_A1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A1040 
Symbol 
ID3625041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp1267652 
End bp1268752 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content45% 
IMG OID637699930 
Productmyo-inositol-1-phosphate synthase 
Protein accessionYP_304588 
Protein GI73668573 
COG category[I] Lipid transport and metabolism 
COG ID[COG1260] Myo-inositol-1-phosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAA TAAAAATAGC AATTGCAGGA ATCGGGAACT GTGCAAGCTC TTTGATACAG 
GGCATTGAGT ACTATAAAGT CGACGATAAA GAACCCATAG GACTGATGCA CAGGGAAATT
GGAGGGTATA AACCTGGCGA CATTAAGGTT GTTGCTGCCT TTGACATTGA TGCCAGAAAA
GTAGGAAAAG ACGTCTCTGA GGCCATCTTT GCTCCCCCGA ACTGCACAGC AGTTTTTTGT
CCTGATGTTC CGTCCACAGG TGTGAAGGTT AAAATGGGCA GGGTTCTCGA TGGGGTCTCT
GACCATATGA AAAACTATAA AGAAAGTCAA ACTTTTGTTG TCAGCAAGGA ACAGGAAGCT
ACAAAAGCCG ATATCGTGAA CGAAATTAAA AACTCGGGTG CTGATATGCT TCTCAATTAC
CTGCCTGTAG GTTCTGAAGA AGCGGTCCGT TTTTATGCCG AGTGTGCTCT TGAAGCAGGT
GTGGCTTTAG TTAATAATAT GCCTGTTTTT ATCGCAAGCA ATCCAGAATG GGCAAAAAAA
TTTGAGGAAA AGAATATTCC TATCATTGGT GATGATGTTA AAGCTCAACT CGGAGCTACC
ATCACGCATA GGATTCTTGC AGACCTTTTC GAAAAACGAG GCGTAAAACT TGAGAGAACA
TACCAGCTAA ATACCGGGGG AAATACGGAT TTCCTGAATA TGCTCAACAG GAACAGGCTC
GCTTCCAAAC GGGAATCAAA GACCGAAGCT GTTCAGTCCG TGCTTTCCAG GAAGCTTGAT
GACGACAATA TCCACATCGG ACCCAGCGAC TACGTAGCCT GGCAAAAGGA CAACAAGATT
TGCTTCCTGA GAATGGAAGG TAAGCTTTTT GGAGATGTGC CTATGAACCT TGAACTCCGC
CTTTCCGTAG AAGACTCTCC TAACTCCGGA GGTGTGGTAA TCGATGCCAT CCGATGCTGT
AAGCTGGCTC TTGATCGCGG AATAGGAGGC GTGTTGTATT CCCCGGCTTC CTACTTTATG
AAACACCCGG CAATTCAGTA CCCTGATGAC GAGGCTTACA GGCGGACTGA AGAATTCATA
TCTGGAACCA GAGAACGTTA A
 
Protein sequence
MTKIKIAIAG IGNCASSLIQ GIEYYKVDDK EPIGLMHREI GGYKPGDIKV VAAFDIDARK 
VGKDVSEAIF APPNCTAVFC PDVPSTGVKV KMGRVLDGVS DHMKNYKESQ TFVVSKEQEA
TKADIVNEIK NSGADMLLNY LPVGSEEAVR FYAECALEAG VALVNNMPVF IASNPEWAKK
FEEKNIPIIG DDVKAQLGAT ITHRILADLF EKRGVKLERT YQLNTGGNTD FLNMLNRNRL
ASKRESKTEA VQSVLSRKLD DDNIHIGPSD YVAWQKDNKI CFLRMEGKLF GDVPMNLELR
LSVEDSPNSG GVVIDAIRCC KLALDRGIGG VLYSPASYFM KHPAIQYPDD EAYRRTEEFI
SGTRER