Gene Mboo_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0040 
Symbol 
ID5411340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp35710 
End bp36798 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content59% 
IMG OID640867254 
Productradical SAM domain-containing protein 
Protein accessionYP_001403207 
Protein GI154149589 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily
[TIGR03551] 7,8-didemethyl-8-hydroxy-5-deazariboflavin synthase, CofH subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGGG ATCGTGCCAC ACTCCTCTCC GGTGCAGTGG AAGGGGACCG GATTAGCGAA 
AAGGACGCAC TGCGCCTCTT TTCAACACGC GACCGCGACG TCTGGAAGAT CGCGGCTGCC
GCGGATGAGA AACGGGAGCA GGTTGTCGGT GATGCAGTTA CCTATGTCAG GAACCAGAAT
ATCAACGTGA CCAACCTATG CGTCAACGCT TGCGGGTTCT GCGGTTTTGG AAAAAAGCCC
GGCGATGAGG GCATCTATTT CCATGATGAG GCGGTTATCC GGGAGAAGGC TGCGCTTGCA
AAATCGCGGA ATGTTACCGA GATCTGTACG GTAAGCGGCC TGCACCCGGC CTTTACCGCA
CAATCCTATA TCGATGTGTA CCGCTGGATC GGGGAGGCCG CCCCGGGGGT ACACCTGCAC
GCGAGTAACC CGATGGAGGT GGCATATGCG GCGCGGAAAA GCGGCATGAG CACAAAAGAG
GTGCTCGCGG CGATGAAGGA CGCCGGCCTT GCCTCCATGT GCGGGACAGC GTCCGAGATC
CTCGTTGATT CCGTCCGGGA GAAAATCTGC GCTCAGAAGA TCCCGACCGC GGAATGGGTA
CGTATTATCA GAGAGGCGCA CCAGCTCGCG ATTCCCACCA CTGCCACTAT TATGTACGGC
CATTGCGAGA CCGATGCAGA CCGGGTGCGC CACCTTGCCA TACTCCGGGA GATCCAGGAC
GAAACCAAAG GATTTACGGA ATTTGTCCCG CTCTCGTTTA TCCACATGAA CACACCCATC
TTCCGGAACG GGACCGCCCG GGCAGGGGCC ACCGGCCGGG AAGATCTCCT GATGGTTGCA
GTTGCCCGGC TCTTCCTTGA CAATTTCGCA AACATCCAGG TCTCGTGGGT CAAGGAAGGG
ATCAAGATGG CGCAGCTTGG CCTCCTTGCC GGGGCAAACG ATCTCGGGGG CACGATGTTT
GAAGAGAGCA TCTCCAAGGG TGCCGGCGCT ACAAATACCG ATTACCTGGA CCCGGCCGAG
ATGCAGCGGG TAGCAGAAGA CCTGGGCCGG ACGCTTTGCC GGCGCACAAC CCTCTATAAA
CCGGTCTGA
 
Protein sequence
MDRDRATLLS GAVEGDRISE KDALRLFSTR DRDVWKIAAA ADEKREQVVG DAVTYVRNQN 
INVTNLCVNA CGFCGFGKKP GDEGIYFHDE AVIREKAALA KSRNVTEICT VSGLHPAFTA
QSYIDVYRWI GEAAPGVHLH ASNPMEVAYA ARKSGMSTKE VLAAMKDAGL ASMCGTASEI
LVDSVREKIC AQKIPTAEWV RIIREAHQLA IPTTATIMYG HCETDADRVR HLAILREIQD
ETKGFTEFVP LSFIHMNTPI FRNGTARAGA TGREDLLMVA VARLFLDNFA NIQVSWVKEG
IKMAQLGLLA GANDLGGTMF EESISKGAGA TNTDYLDPAE MQRVAEDLGR TLCRRTTLYK
PV