Gene Mboo_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2201 
Symbol 
ID5411221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2276222 
End bp2277211 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content59% 
IMG OID640869451 
Productradical SAM domain-containing protein 
Protein accessionYP_001405358 
Protein GI154151740 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR03550] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofG subunit 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.524494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCCCC GGGTGATCAC GTACACAAAG AACGTCTTTT TGCCGCTCAC CAGCGTCTGC 
CGGAACCGGT GCGGGTACTG CTCGTTCCGC ACCCCGGTTC AGGAAGGATG TGTCATGCTG
CCTGAAGAGG TGGAAGCGGT TCTTGCGCAG GGGCAGGCGG CCGGGTGCAC CGAGGCGCTC
TTTACCTTCG GCGAGCATCC CGAAGAAGAG GAAGGTTTTC GCGCATACCT GGAAAAGACG
GGTTACGATA CCATCCTCGA TTACTGCGAG GCAATGTGCC GGCTTGCTCT CCGGTACGGG
ATCCTCCCGC ACACCAACGC CGGTATCCTC ACGTATGACG AGATGAAACG GCTCCGGCCC
ACAAACGCCA GCATGGGCCT GATGCTTGAG ACTACGGCAC GGATCCCGGC GCACCAGGGA
TCGAAAGGAA AGGAACCGGA AGTGCGCCTT GCAATGATGG AAGACGCGGG CCGGCTGAAG
ATCCCGTTCA CCACCGGCCT GCTCCTCGGG ATTGGCGAGA CTGCGGCCGG CCGCGAGGAC
TCACTTATTG CAATCCGGGA CATCCATAAG AAGTACGGGC ATATCCAGGA GATCATCCTC
CAGAATTTCT GCCCCAAGAA CAATACACCC ATGGCTGCGT TCCGGGTGCC GGATACACAG
GAGATCTGCA ACACGATCCT GATGGCTCGC CGGATCCTGC CAGAGGAGAT CTCCATCCAG
GTAGCCCCCA ATCTCATCGA TGCGTCCCGG CTCATTGGTT GCGGGGTCAG TGATCTGGGG
GGGATATCCC CGGTAACCAT CGATTATGTG AATCCTGAAC ATCCCTGGCC GGCGTTCAAC
GACCTCAAAA AGATCGTTGG GGACGCAACA CTTCAGGAGC GCCTCTGCAT CTATCCACGG
TTCATCCGGC CGGGCTGGTA CGACCCTGGC CTGCAACCTC TAATAAACAG GCTCAACCAA
CGTATAAGCA GAGGGAGCAG CCAACCGTGA
 
Protein sequence
MEPRVITYTK NVFLPLTSVC RNRCGYCSFR TPVQEGCVML PEEVEAVLAQ GQAAGCTEAL 
FTFGEHPEEE EGFRAYLEKT GYDTILDYCE AMCRLALRYG ILPHTNAGIL TYDEMKRLRP
TNASMGLMLE TTARIPAHQG SKGKEPEVRL AMMEDAGRLK IPFTTGLLLG IGETAAGRED
SLIAIRDIHK KYGHIQEIIL QNFCPKNNTP MAAFRVPDTQ EICNTILMAR RILPEEISIQ
VAPNLIDASR LIGCGVSDLG GISPVTIDYV NPEHPWPAFN DLKKIVGDAT LQERLCIYPR
FIRPGWYDPG LQPLINRLNQ RISRGSSQP