Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_0044 |
Symbol | |
ID | 5411344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 38425 |
End bp | 39702 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640867258 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001403211 |
Protein GI | 154149593 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTATCC GCAGCTGCTC CTCTGGTATT CCTGCTGTTG TCGCAGAAGC GGCACGGGCA GAGGGCATGG ATGCGGAACG GATGGCCCGG CGCATTGCAG CCGGGCGGAT TGTGATCCCG GCAAACCCCG CACGCCGGCA CCGCCCCTGC GCCATCGGGC AGGGCTGCAC GGTTAAGGTG AACGTGAACG TCGGGACCTC CGGCTCCCGG TGTGAGTATG CCCTTGAGGA AAAGAAGGCA AAAGCGGCGC TCGATAACGG CGCCGATGCC ATCATGGACC TTTCCACCGG GGGGGACCTT GTTGCGATCC GCAAAAAAAT GCTTGCCCTT GATACCACGG TCGGGACGGT GCCGGTGTAC GAGGCGGTGC GCCGGGCGGG AAACGCTGCC GATGTGGATG CCGACCTGCT CTTTAAGGTC ATCCGGGAGC ACTGCAAACA GGGCGTGGAT TTCCTCACCC TCCACTGCGG GGTGAACCGG CAGGCGCTCG ATGCCCTGAA GCTCGACCCG CGCCTGATGG GGGTGGTGAG CCGGGGCGGG ACATTCCATT GTGCAATGAT GATGCTCCGG GACGAGGAGA ACCCGCTTTT TTCGGAGTTC GACTACCTGC TGGAGATCCT CGCAGAACAC GATGTCACCA TCAGCCTCGG GGACGGGATG CGGCCCGGCT GCCTGCAGGA CGCGGGAAAA CTTGCCAAAT CCGTGGAGTA CGTGACGCTC GGGACGCTTG CACAGCGTGC CCTTGCCGCG GGCGTCCAGC GGATGATCGA GGGACCGGGG CACATGCCCT ATGACCAGGT GGGCTACAAC GTGCGGATGA TAAAAGAGAT CACCGCCCAT GCGCCGCTCT ACCTGCTCGG CCCCATCGTC ACCGACATCG CGCCAGGCTA CGACCACGTT GTCGCGGCCA TCGGCGGGGC GGAGGCCTGC CGGAACGGGG CTGACTTCCT CTGCATGGTC TCTCCCAGCG AACACCTCGC TCTTCCGGAC GTTGAGGATA TCATCGAAGG AACCCGGATC GCAAAGATCG CGGCGCATGT AGGCGATACC GTCCGGCGGC ACGAAGGATA CCGGATGGAA CGCGAGGTGC AGATGGCAGA GGCCCGGCAC GATCTTGACT GGGACGCCCA GTTCCGGCTG GCACTCTATG GCGATCACGC AAAGGCGATC CATGTCCGGG ACGGGGACAC GGACACCTGC TCGATGTGCG GGGACCTCTG TGCGATCAAG ATGGTGCGGG AACTCTTCGA AGGTGCAGCC TCAAAGAAGA AAGAATAA
|
Protein sequence | MLIRSCSSGI PAVVAEAARA EGMDAERMAR RIAAGRIVIP ANPARRHRPC AIGQGCTVKV NVNVGTSGSR CEYALEEKKA KAALDNGADA IMDLSTGGDL VAIRKKMLAL DTTVGTVPVY EAVRRAGNAA DVDADLLFKV IREHCKQGVD FLTLHCGVNR QALDALKLDP RLMGVVSRGG TFHCAMMMLR DEENPLFSEF DYLLEILAEH DVTISLGDGM RPGCLQDAGK LAKSVEYVTL GTLAQRALAA GVQRMIEGPG HMPYDQVGYN VRMIKEITAH APLYLLGPIV TDIAPGYDHV VAAIGGAEAC RNGADFLCMV SPSEHLALPD VEDIIEGTRI AKIAAHVGDT VRRHEGYRME REVQMAEARH DLDWDAQFRL ALYGDHAKAI HVRDGDTDTC SMCGDLCAIK MVRELFEGAA SKKKE
|
| |