Gene Mpal_2019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2019 
Symbol 
ID7272000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2140968 
End bp2142119 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content64% 
IMG OID643570633 
Productproposed homoserine kinase 
Protein accessionYP_002467043 
Protein GI219852611 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3635] Predicted phosphoglycerate mutase, AP superfamily 
TIGRFAM ID[TIGR00306] 2,3-bisphosphoglycerate-independent phosphoglycerate mutase, archaeal form
[TIGR02535] proposed homoserine kinase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.734369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.229701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTACC TGCTCGTCCT CGGAGACGGA ATGGCCGACG AACCGATCCC GGAGCTGGGT 
AACCGGACGC CGCTCGCCTA TGCGAACACC CCGAACATGG ATCGGATCGC ACGCGAGGGG
AGGTCCGGGC AGGTGCAGAC GGTCCCGGAC GGTTTTGAAC CCGGCAGCGA TGTCGCCAAC
CTCTCGATCC TCGGCTATCA TCCGGCCCGG TTCTATACCG GCCGGGGTCC ACTCGAGGCC
GTGAACATGG GGGTCGACCT GACCGACGAC CAGATCGCCT ACCGCTGCAA CCTGGTCACG
ATCAGAGACG GGGTGATGCA GGACTTCAGC GCCGGGCATA TCACCTCAGC CGAGGGGGCG
GCCCTGTTCA AATCCCTGCA GGAGTACCTG CCGGAGGTGA AGCTCGTCTC AGGGGTCAGT
TACCGGAATC TGCTGGTCGT CGACAGGGGG AGGGGGGCCG AGGGAAAGGC ACCCCACGAC
ATCGTCGGCG AGGAGATCGA GCAGTACCTG CCGCACGGTG AGGATGCACC ACTGCTTCGG
GCCTGCATCG AGAAGAGCAT CGAGGTCTTC GCCGATCACC CGGTGAACCG GGACCGCCTG
GCCAGGGGAT TGCCGGCTGC GACGATGATC TGGCCGTGGA GCGGTGGCAA GCGCCCGGCT
CTGATCCCAT TTCAGGAGAA GTACGGAAAG AAGGGCGGGA TGATCTCGGC GGTCGACCTG
CTGAACGGGA TCGCCCGGTA CGCGGATATG AAGGTGATCA CCGTCCCCGG GGCGACCGGT
TACCTGGACA CCGACTACCA GGCCAAGGCC CGGTATGCCA TCGAGGCGCT CAAAGACCTC
GACTTTCTGT ACCTGCATGT CGAGGCCCCG GACGAGGCCG GGCATCTCGG CTCGCTCAAG
GAGAAGGTGA AGGCGATCGA ACGGGTCGAC GAGATGATCG GCACCATCAT GGCCGGCTTC
GACGGCGTGA TTGCCGTGCT CCCCGACCAT GCCACCCCAA TCCGGCTGAA GACCCATACG
CGAGGTCCGG TCCCCTGTGC AGTGCTCGGA AAGGGAAAGG ATGAGACAGA AGTATTCTCA
GAAGAAGCGG CGGCGAACGG GTCGCTCGGG ATGATCCGGG GGGATCTGTT CCTGACGGAA
CTCTTCTCCT GA
 
Protein sequence
MKYLLVLGDG MADEPIPELG NRTPLAYANT PNMDRIAREG RSGQVQTVPD GFEPGSDVAN 
LSILGYHPAR FYTGRGPLEA VNMGVDLTDD QIAYRCNLVT IRDGVMQDFS AGHITSAEGA
ALFKSLQEYL PEVKLVSGVS YRNLLVVDRG RGAEGKAPHD IVGEEIEQYL PHGEDAPLLR
ACIEKSIEVF ADHPVNRDRL ARGLPAATMI WPWSGGKRPA LIPFQEKYGK KGGMISAVDL
LNGIARYADM KVITVPGATG YLDTDYQAKA RYAIEALKDL DFLYLHVEAP DEAGHLGSLK
EKVKAIERVD EMIGTIMAGF DGVIAVLPDH ATPIRLKTHT RGPVPCAVLG KGKDETEVFS
EEAAANGSLG MIRGDLFLTE LFS