Gene Mpal_0416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0416 
Symbol 
ID7271442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp432579 
End bp433877 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content64% 
IMG OID643569061 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002465513 
Protein GI219851081 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.741705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGTTA TGCATAGCCT GATCCATGAA TGTCTCCATG GTGTGCCTTC AGAGGTCGAA 
ACCATTGCAC GGGACGAAGG ACTTGTACCC CGACAGGCCG CACGCGCCGT CGCACGGGGC
AGGATCGCCG TGCTTGCCAA CCCGGTTCGG CCACATCGGA TCGCTGCGGT CGGCGAGGGG
TGCCGGGTCA AGGTGAACGT GAACATCGGC ACCTCTGGAC AGCGGTGCGA CCCGGTTCTT
GAGATGGAGA AGGGGCGGGC GGCTCTGGCC GAGGGTGCCG ACGCGCTGAT GGATCTATCG
ACCGGCGGGG ATCTGCAGGC GATTCGAAAG GAGATCCTGA CCCTCGACGC TCCGGTCGGG
ACGGTTCCGA TCTATGAGGC GGTCCGGCGA GCCGGGAATG TCACCGACGT CGACGCCGAC
CTGCTCTTCC GGGTGATCCG CGAGCACTGC AAGCAGGGGG TCGACTTCCT GACCCTCCAC
TGCGGGGTGA ACCTGCAGGC CCTCGATGCG CTGAAGTCCG ACCCGCGGAT CATGGGCGTC
GTCTCCCGCG GCGGGTCGTT CCATGTCGCG ATGATGCTCT CGAGCGGTGA GGAGAACCCG
CTCTATGCCG AGTTCGATTA CCTGCTCGAG ATACTCGCCG ACACCGGCGT CGTGATCAGT
CTCGGTGACG GGATGCGCCC GGGGTGCATG CAGGATGCCG AGCGGCATGC AAAGGCGACC
GAGTACCTCA CGCTCGGCCG GCTGGCCAAA CGCTCGCTCG ATGCTGGGGT GCAGCGGATG
ATCGAGGGGC CGGGGCACAT TCCGATCACG CAGGTCGGTT ACAATGTGAA GATGATCAAG
GAACTGACCG ATGACGCTCC CCTGTATCTG CTCGGGCCGC TGGTCACCGA TATCGGGGCC
GGCTACGACC ACGTGGTCGG TGCGATCGGA GGGGCGATCG CCTGTATGAA CGGGGCCGAC
TTCCTCTGCA TGGTCTCCCC GAGTGAGCAC CTGGCCCTGC CGGACGTGCA GGACATCATC
GAGGGGACCC GCGTTGCCTG CCTGTCTGCT CATATCGGGG ACCTGGCACG GGACCCGAAG
GGGCATTACA TGCAGCGCGA GGTGCAGATG GCAGAGGCCC GGCGGAGGCT CGACTGGGAC
GAGCAGTTCA AGCTGGCCCT CTTCGGTGAG CGGGCGAAGG CGGTCCATGT CCGTGACGGC
GAGACCGAGA CCTGTTCTAT GTGCGGGGAT CTCTGTGCAC TCAAGATCGT GGATAAACTG
CTCCCTGCAC CTGATGTGAC TCCAAAAAAA GAAGAATAA
 
Protein sequence
MCVMHSLIHE CLHGVPSEVE TIARDEGLVP RQAARAVARG RIAVLANPVR PHRIAAVGEG 
CRVKVNVNIG TSGQRCDPVL EMEKGRAALA EGADALMDLS TGGDLQAIRK EILTLDAPVG
TVPIYEAVRR AGNVTDVDAD LLFRVIREHC KQGVDFLTLH CGVNLQALDA LKSDPRIMGV
VSRGGSFHVA MMLSSGEENP LYAEFDYLLE ILADTGVVIS LGDGMRPGCM QDAERHAKAT
EYLTLGRLAK RSLDAGVQRM IEGPGHIPIT QVGYNVKMIK ELTDDAPLYL LGPLVTDIGA
GYDHVVGAIG GAIACMNGAD FLCMVSPSEH LALPDVQDII EGTRVACLSA HIGDLARDPK
GHYMQREVQM AEARRRLDWD EQFKLALFGE RAKAVHVRDG ETETCSMCGD LCALKIVDKL
LPAPDVTPKK EE