Gene Mpal_1207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1207 
Symbol 
ID7271485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1240875 
End bp1242347 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content56% 
IMG OID643569844 
Producthomoserine O-acetyltransferase 
Protein accessionYP_002466268 
Protein GI219851836 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0517] FOG: CBS domain
[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0144114 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGAG GCTCAGTAGG GATCAGTACC ACCTCAACCT TTACCCTTGC CACACCCCTC 
CTACTGGAGA GTGGTGCTTC ACTGTTCTCC GTTCAGATTG CGTACGAGAC CTATGGAACG
CTGAACCATG ATAAGAGCAA TGCAATCCTG GTCTGTCATG CCCTGACTGG TGACGCCCAT
GCAGCAGGCC ACCATGGGGA CGAGTCACGT CCTGGCTGGT GGGACGGGGT GATCGGCCCG
GGAAAGGCCT TCGATACGGA TAAGTATTTT GTGATCTGTT CGAACGTCCT CGGGGGCTGT
AAGGGGACGA CCGGGCCGGC ATCACAAAAT CCTGATACAG GAAAACCCTA CGGCACCTCA
TTCCCGGTAG TGACGATTCG GGACATGGTG AACGTACAGA AGGCACTGAT CGATCACCTG
GGCATCAGCC AGCTCTTTGC AGTCGCCGGC GGATCGATGG GAGGCATGCA GGTGCTGCAG
TGGATGGTCT CCTATCCATC GATGGTCAGG AAGGCGATCG CCATAGCGGC AACAGGGTCT
TCAACCCCAC AGCAGATCGC GTTCAACGAA GTAGGAAGGA AGGCGATCAC TGCCGACCCT
GCATGGTGTG GTGGTGACTA CTATGGAAAG GAGCACCCGG TGAAGGGGCT TTCGCTCGCA
CGGATGGTCG CCCATATCAC CTACCTGAGC GATGCTTCAA TGCACACCAA GTTCGGACGG
GCCCTGCAGG ACCGGGAGTT CAGAGGGTTC GACTTCGACA CCGAATTTCA GGTCGAGAGT
TATCTGCACC ACCAGGGCAC CTCTTTCACC AAACGGTTCG ATGCGAACTC ATACCTGTAT
CTGACCAAGG CTGTCGACTA CTTCGATCTC TCCGTCGACG ACTCGTTGAT CAGCGGGTTC
GCTCCAACGA AAGCGACGGT GCTGATCATA TCGGTCACCT CGGACTGGCT GTACCCACCG
TATCAGTCAC AGGAGATCGT ATCGGCCCTC TCGGCCAACG AATGCGATGT TCATTACTGC
GAACTCCGCT CCCAGTTCGG GCATGATGCG TTCCTGATTG AGACCGGGCA ACTCAACTAC
AGTATCAGTA GATTCCTCGA CCACACCCTG GTCAGGGATG TGATGAACAC ACAGGTGCCG
GTGATCAGCG AGCAGTCGAC GATCGCTGTC GCTGCCCGGA TGATGATCAC ACAGGGAGTG
AACCACCTCC CGGTTCTCGC CCCGGATCAG AGTCTAGTTG GGATTGTGAC CTCATGGGAT
ATCGCAAACG CGGTAGCCTG CGGATATACC AGCCTCGATC AGATCATGTC CTCACAGGTG
ATCACAACAA CAGGAGACGA GACGATCGAG GTGGCCGCAT CCCGTATGGA GCAGCATCGG
ATATCAGCCC TCCCGGTGAT CGACCAGGCA CAGCATGTGA TCGGACTAAT CTCAAGCGAT
GGACTCAGCA AGTTGATCGG TAGGGGTCCA TAA
 
Protein sequence
MQRGSVGIST TSTFTLATPL LLESGASLFS VQIAYETYGT LNHDKSNAIL VCHALTGDAH 
AAGHHGDESR PGWWDGVIGP GKAFDTDKYF VICSNVLGGC KGTTGPASQN PDTGKPYGTS
FPVVTIRDMV NVQKALIDHL GISQLFAVAG GSMGGMQVLQ WMVSYPSMVR KAIAIAATGS
STPQQIAFNE VGRKAITADP AWCGGDYYGK EHPVKGLSLA RMVAHITYLS DASMHTKFGR
ALQDREFRGF DFDTEFQVES YLHHQGTSFT KRFDANSYLY LTKAVDYFDL SVDDSLISGF
APTKATVLII SVTSDWLYPP YQSQEIVSAL SANECDVHYC ELRSQFGHDA FLIETGQLNY
SISRFLDHTL VRDVMNTQVP VISEQSTIAV AARMMITQGV NHLPVLAPDQ SLVGIVTSWD
IANAVACGYT SLDQIMSSQV ITTTGDETIE VAASRMEQHR ISALPVIDQA QHVIGLISSD
GLSKLIGRGP