Gene Mboo_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0044 
Symbol 
ID5411344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp38425 
End bp39702 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content63% 
IMG OID640867258 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001403211 
Protein GI154149593 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTATCC GCAGCTGCTC CTCTGGTATT CCTGCTGTTG TCGCAGAAGC GGCACGGGCA 
GAGGGCATGG ATGCGGAACG GATGGCCCGG CGCATTGCAG CCGGGCGGAT TGTGATCCCG
GCAAACCCCG CACGCCGGCA CCGCCCCTGC GCCATCGGGC AGGGCTGCAC GGTTAAGGTG
AACGTGAACG TCGGGACCTC CGGCTCCCGG TGTGAGTATG CCCTTGAGGA AAAGAAGGCA
AAAGCGGCGC TCGATAACGG CGCCGATGCC ATCATGGACC TTTCCACCGG GGGGGACCTT
GTTGCGATCC GCAAAAAAAT GCTTGCCCTT GATACCACGG TCGGGACGGT GCCGGTGTAC
GAGGCGGTGC GCCGGGCGGG AAACGCTGCC GATGTGGATG CCGACCTGCT CTTTAAGGTC
ATCCGGGAGC ACTGCAAACA GGGCGTGGAT TTCCTCACCC TCCACTGCGG GGTGAACCGG
CAGGCGCTCG ATGCCCTGAA GCTCGACCCG CGCCTGATGG GGGTGGTGAG CCGGGGCGGG
ACATTCCATT GTGCAATGAT GATGCTCCGG GACGAGGAGA ACCCGCTTTT TTCGGAGTTC
GACTACCTGC TGGAGATCCT CGCAGAACAC GATGTCACCA TCAGCCTCGG GGACGGGATG
CGGCCCGGCT GCCTGCAGGA CGCGGGAAAA CTTGCCAAAT CCGTGGAGTA CGTGACGCTC
GGGACGCTTG CACAGCGTGC CCTTGCCGCG GGCGTCCAGC GGATGATCGA GGGACCGGGG
CACATGCCCT ATGACCAGGT GGGCTACAAC GTGCGGATGA TAAAAGAGAT CACCGCCCAT
GCGCCGCTCT ACCTGCTCGG CCCCATCGTC ACCGACATCG CGCCAGGCTA CGACCACGTT
GTCGCGGCCA TCGGCGGGGC GGAGGCCTGC CGGAACGGGG CTGACTTCCT CTGCATGGTC
TCTCCCAGCG AACACCTCGC TCTTCCGGAC GTTGAGGATA TCATCGAAGG AACCCGGATC
GCAAAGATCG CGGCGCATGT AGGCGATACC GTCCGGCGGC ACGAAGGATA CCGGATGGAA
CGCGAGGTGC AGATGGCAGA GGCCCGGCAC GATCTTGACT GGGACGCCCA GTTCCGGCTG
GCACTCTATG GCGATCACGC AAAGGCGATC CATGTCCGGG ACGGGGACAC GGACACCTGC
TCGATGTGCG GGGACCTCTG TGCGATCAAG ATGGTGCGGG AACTCTTCGA AGGTGCAGCC
TCAAAGAAGA AAGAATAA
 
Protein sequence
MLIRSCSSGI PAVVAEAARA EGMDAERMAR RIAAGRIVIP ANPARRHRPC AIGQGCTVKV 
NVNVGTSGSR CEYALEEKKA KAALDNGADA IMDLSTGGDL VAIRKKMLAL DTTVGTVPVY
EAVRRAGNAA DVDADLLFKV IREHCKQGVD FLTLHCGVNR QALDALKLDP RLMGVVSRGG
TFHCAMMMLR DEENPLFSEF DYLLEILAEH DVTISLGDGM RPGCLQDAGK LAKSVEYVTL
GTLAQRALAA GVQRMIEGPG HMPYDQVGYN VRMIKEITAH APLYLLGPIV TDIAPGYDHV
VAAIGGAEAC RNGADFLCMV SPSEHLALPD VEDIIEGTRI AKIAAHVGDT VRRHEGYRME
REVQMAEARH DLDWDAQFRL ALYGDHAKAI HVRDGDTDTC SMCGDLCAIK MVRELFEGAA
SKKKE