Gene Mboo_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0038 
Symbol 
ID5411338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp34139 
End bp35227 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content61% 
IMG OID640867252 
Productradical SAM domain-containing protein 
Protein accessionYP_001403205 
Protein GI154149587 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily
[TIGR03551] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACCTG CACTGATCCC CCTGTTAAAC GATTGTCTGG CCGGCCACCG CCTTACCCCG 
GAAGAGGGCG AGATCCTCAT GAAGGCCACC GGCCGGGACA TCTTCCGGAT CACCGCGGCC
GCTGACGAGC TGCGCGAGAA GAAAGTCGGC GATATCGTCA CGTATGTGCG CAACCAGAAC
CTGCATGTGA CCAACATCTG CAAGAACCTC TGCGGATTCT GCGGATTTGG CAAGAAGGCA
ACTGACCCCG GTGCCTACTG CCTTGACCGG GATACAATCC AGGCAGGTGT CCGGCTGGCA
GAGAAACGAA AGGTCACCGA GATCTGTTTC CTCTCCGGGG TCCACCCGGG GTTTGGCCTG
GAAAACTACA CGGACCTTAT CGCTGCAGTG CACGAGATCG CCCCGGAGAT CCATATCCAT
GCCTTCAGCC CGGACGAGGT GGCCCACGCG GCAAAGCGGG GCAAACTAAC GACAGCCGAG
GTGCTTGCCG CGCTCAGGGA CGCAGGTCTT GGCTCCCTGC AGGGAACGGC TGCGGAGATC
CTCATCGAGC CGGTCCGAAA AGTCATCTGC CCGCGGAAGG TCTCCGGGCA GGAATGGGCA
CGGATTATCA AGGAGGCTCA TAAACTGGGC ATCCGCTCCT CTGCCACCAT TATGTACGGC
TCGTACGAAT CGGCCCGGGA CCAGGTGGAC CACCTGGCGA TCATCCGGGA GATCCAGGAC
GAAACCCACG GGTTTACCGA GTTTATCCCA ATGTCCTACA TCCACCCCAA CACCCCGCTC
TTTACCGAAG GGATCGCCCG GGCCGGGGCA ACGGGCAGGG AAGACCTCCT CATGATCGCG
GTCTCGCGAC TCTTTCTGGA CAACTTCGAC AATGTCCAGG TCTCGTGGGG CAAGCTCGGC
CTCAAGATGA CGCAGCTTGC GCTCCTCTGC GGTGGAAACG ACCTTGCGGG TACGATGTTC
ACGGACGAGG TCTCCGTGGA TGCGGGAGCC GGGGATGCCA GTTACCTTGC CCCTGAAACC
ATGGAGCGGA TGACCTCCGA TCTTGGCCGG ACCCTCAGGC AGCGGACAAC GCTCTACGAA
CTCGTGTAA
 
Protein sequence
MAPALIPLLN DCLAGHRLTP EEGEILMKAT GRDIFRITAA ADELREKKVG DIVTYVRNQN 
LHVTNICKNL CGFCGFGKKA TDPGAYCLDR DTIQAGVRLA EKRKVTEICF LSGVHPGFGL
ENYTDLIAAV HEIAPEIHIH AFSPDEVAHA AKRGKLTTAE VLAALRDAGL GSLQGTAAEI
LIEPVRKVIC PRKVSGQEWA RIIKEAHKLG IRSSATIMYG SYESARDQVD HLAIIREIQD
ETHGFTEFIP MSYIHPNTPL FTEGIARAGA TGREDLLMIA VSRLFLDNFD NVQVSWGKLG
LKMTQLALLC GGNDLAGTMF TDEVSVDAGA GDASYLAPET MERMTSDLGR TLRQRTTLYE
LV