Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0416 |
Symbol | |
ID | 7271442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 432579 |
End bp | 433877 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643569061 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002465513 |
Protein GI | 219851081 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.741705 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGTGTTA TGCATAGCCT GATCCATGAA TGTCTCCATG GTGTGCCTTC AGAGGTCGAA ACCATTGCAC GGGACGAAGG ACTTGTACCC CGACAGGCCG CACGCGCCGT CGCACGGGGC AGGATCGCCG TGCTTGCCAA CCCGGTTCGG CCACATCGGA TCGCTGCGGT CGGCGAGGGG TGCCGGGTCA AGGTGAACGT GAACATCGGC ACCTCTGGAC AGCGGTGCGA CCCGGTTCTT GAGATGGAGA AGGGGCGGGC GGCTCTGGCC GAGGGTGCCG ACGCGCTGAT GGATCTATCG ACCGGCGGGG ATCTGCAGGC GATTCGAAAG GAGATCCTGA CCCTCGACGC TCCGGTCGGG ACGGTTCCGA TCTATGAGGC GGTCCGGCGA GCCGGGAATG TCACCGACGT CGACGCCGAC CTGCTCTTCC GGGTGATCCG CGAGCACTGC AAGCAGGGGG TCGACTTCCT GACCCTCCAC TGCGGGGTGA ACCTGCAGGC CCTCGATGCG CTGAAGTCCG ACCCGCGGAT CATGGGCGTC GTCTCCCGCG GCGGGTCGTT CCATGTCGCG ATGATGCTCT CGAGCGGTGA GGAGAACCCG CTCTATGCCG AGTTCGATTA CCTGCTCGAG ATACTCGCCG ACACCGGCGT CGTGATCAGT CTCGGTGACG GGATGCGCCC GGGGTGCATG CAGGATGCCG AGCGGCATGC AAAGGCGACC GAGTACCTCA CGCTCGGCCG GCTGGCCAAA CGCTCGCTCG ATGCTGGGGT GCAGCGGATG ATCGAGGGGC CGGGGCACAT TCCGATCACG CAGGTCGGTT ACAATGTGAA GATGATCAAG GAACTGACCG ATGACGCTCC CCTGTATCTG CTCGGGCCGC TGGTCACCGA TATCGGGGCC GGCTACGACC ACGTGGTCGG TGCGATCGGA GGGGCGATCG CCTGTATGAA CGGGGCCGAC TTCCTCTGCA TGGTCTCCCC GAGTGAGCAC CTGGCCCTGC CGGACGTGCA GGACATCATC GAGGGGACCC GCGTTGCCTG CCTGTCTGCT CATATCGGGG ACCTGGCACG GGACCCGAAG GGGCATTACA TGCAGCGCGA GGTGCAGATG GCAGAGGCCC GGCGGAGGCT CGACTGGGAC GAGCAGTTCA AGCTGGCCCT CTTCGGTGAG CGGGCGAAGG CGGTCCATGT CCGTGACGGC GAGACCGAGA CCTGTTCTAT GTGCGGGGAT CTCTGTGCAC TCAAGATCGT GGATAAACTG CTCCCTGCAC CTGATGTGAC TCCAAAAAAA GAAGAATAA
|
Protein sequence | MCVMHSLIHE CLHGVPSEVE TIARDEGLVP RQAARAVARG RIAVLANPVR PHRIAAVGEG CRVKVNVNIG TSGQRCDPVL EMEKGRAALA EGADALMDLS TGGDLQAIRK EILTLDAPVG TVPIYEAVRR AGNVTDVDAD LLFRVIREHC KQGVDFLTLH CGVNLQALDA LKSDPRIMGV VSRGGSFHVA MMLSSGEENP LYAEFDYLLE ILADTGVVIS LGDGMRPGCM QDAERHAKAT EYLTLGRLAK RSLDAGVQRM IEGPGHIPIT QVGYNVKMIK ELTDDAPLYL LGPLVTDIGA GYDHVVGAIG GAIACMNGAD FLCMVSPSEH LALPDVQDII EGTRVACLSA HIGDLARDPK GHYMQREVQM AEARRRLDWD EQFKLALFGE RAKAVHVRDG ETETCSMCGD LCALKIVDKL LPAPDVTPKK EE
|
| |