Gene Mpal_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0354 
Symbol 
ID7272659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp376600 
End bp377808 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content34% 
IMG OID643569006 
Productglycosyl transferase group 1 
Protein accessionYP_002465458 
Protein GI219851026 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000079613 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.480518 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAC TTCAGATCAT CTCCTCTTTC CCACCCGCTT ATTCGTATGG GGGGCCCCTT 
CAGGTTGCAT ATAATTTATC GAAAAATCTT GTGAAACAAG GGGATGAAGT TACTGTATAT
AGCACAGATG TCCTCGATCA AACTCATAGG AACAAAAATG TTGAAAATCC TGGGTTTCTT
GATGGGATAA AAATATTCAG GTTTAAAAAT ATCAATAATC GACTGGCATC AAAATATCGA
ATATCCTGTG CTCCACGTAT TGCATATGCA TTGAATCTCA ATATTAAAAA TTATGATATT
GTTCATTGTC ATGAATATCG CGCATTTGAA GCAATTGTGA TGCATCATTA TGCAAAAAAA
TATCATATCC CTTATATTGT TCAGGCTCAC GGTGCGGTAC TGCCAGTCTT TGAAAAACAA
GGAATAAAAA AAATATATGA TATTCTTTGG GGAAACCAAA TATTAAGAGA TGCCTCAAAA
TTAATTGCGG TATCAAAAGT TGAGAAAGAC CAGTACTTGA AAATGGGAAT GCCGGAAAAC
AAAATTGTAA TTATTCCAAA TGGAATTGAT GTGTCTGAAT ATGAAACACT TCCGGAACGA
GGAATATTTC GGAAAAAATA TGGAATTGCA TCCGATGAAA AAGTCGTTCT CTATTTGGGA
CGGTTGCATA AAAGAAAAGG AATTGATTTT CTTATTAATA CTTTTTCAAG TCTTTTGGAC
TTAAAAAAGG ACAATAAACT AATTATTGCA GGTCCGGATG ATGGATTTCT CGAGATCCTT
ATGGAAATTA TAAAAAAATT AAAAATTGGT AAGAATGTTT TGGTTACCGG CTCTCTTTCT
AAAGTTGAAA AAATCGAAGC ATTCGTTGAT GCTGATGTGT TAGTATATCC TGGAATATTG
GAAATTTTTG GGCTGGTACC ATTTGAAGCT ATAATGTGCG GAACACCGGT TATAGTGACG
GACGACTGTG GGTGTGGCGA GGTAATTAAA GAAGCTAAAT GCGGATATTT AGTAAAGTAC
GGAGATACTG ATGATTTGAG AGGAAGAATA TTAAATGTCC TTAATGATGA CATCCAATCA
AACCTGTTCG TGAGGGATGG GCAAAAATTT ATTAAAGATA ATTTATCCTG GTCATATCTC
ATTGAAGTTG TAAAACAAAC ATATTTGGAA TGTATTTCTC AATCAAATGG AGTAAAACAT
GAATTATAA
 
Protein sequence
MKILQIISSF PPAYSYGGPL QVAYNLSKNL VKQGDEVTVY STDVLDQTHR NKNVENPGFL 
DGIKIFRFKN INNRLASKYR ISCAPRIAYA LNLNIKNYDI VHCHEYRAFE AIVMHHYAKK
YHIPYIVQAH GAVLPVFEKQ GIKKIYDILW GNQILRDASK LIAVSKVEKD QYLKMGMPEN
KIVIIPNGID VSEYETLPER GIFRKKYGIA SDEKVVLYLG RLHKRKGIDF LINTFSSLLD
LKKDNKLIIA GPDDGFLEIL MEIIKKLKIG KNVLVTGSLS KVEKIEAFVD ADVLVYPGIL
EIFGLVPFEA IMCGTPVIVT DDCGCGEVIK EAKCGYLVKY GDTDDLRGRI LNVLNDDIQS
NLFVRDGQKF IKDNLSWSYL IEVVKQTYLE CISQSNGVKH EL