Gene Mboo_1214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1214 
Symbol 
ID5410386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1232890 
End bp1235061 
Gene Length2172 bp 
Protein Length723 aa 
Translation table11 
GC content56% 
IMG OID640868441 
Producthypothetical protein 
Protein accessionYP_001404375 
Protein GI154150757 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.372382 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0723033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAAG GACGATGGAA CCGTATGCAG CCCGGAGAAT ACCCGGACAC TCATAATTCA 
GGCACCATGC AAACGCGCCG GTCATCCAAT CGTCTGGCCC GTGAGACAAG CCCGTACCTG
CTCCAGCATG CGTCCAACCC GGTGGATTGG TACCCCTGGG GAGGGGAGGC ATTTTCCCGT
GCCAAACGTG AAGACCGGCC ACTCTTCCTT TCTATAGGAT ACTCTGCCTG CCACTGGTGT
CATGTGATGG CACGGGAATC TTTCGAGAAC AACGAAGTTG CCGGAATTCT CAACAAACAT
TTTGTCTGCA TCAAAGTGGA CCGCGAGGAA CGCCCGGATG TCGACAGCGT GTACATGGGG
ATCTGCCAGC AGCTGACCGG GCAGGGGGGC TGGCCGCTTA CCATTATCAT GACACCGGAG
AAAAAACCGT TCTTTGCCGG GACATATTTC CCCAAAACCG GCAGGGCCGG GATGCCGGGG
CTTACGGATA TTCTCATCAC TATCGCCAAT CTTTGGGAAA CAAGACGTGA TGAACTGTAT
GCAGCCGCGG AACAGATCCT TTCTGATGCA CACCTTTTGC ACAAAAGCCC GTCAGGGGAT
CCGGACCGGC ACCTGCTGGA TAAAGGCTTT CGGGAACTTG CTGCGCAGTT CGATTCTGCA
AATGGAGGGT TTGGCCGCGC ACCGAAATTT CCGGCTCCCC ATAACATACT ATTCCTCCTC
CGGTACTGGC AGATGACAGG TGAGAACCGG GCGCTTGATA TGGCAGAGCA GACACTGGAT
GCGATCAGGC AGGGTGGGAT CTGGGACCAT GTCGGAGGCG GCATGCACCG GTATGCAACC
GATGCCCGTT GGCTCGTCCC GCATTTTGAG AAGATGCTCT CTGATCAGGC AATGCTTGTG
CTTGCCAGCA CTGAAGCGTA TGCTGCAACC GGAAAGATCC GGTACCGCAC CATTGCCGAG
GAATGCATTG CCTACGTACT CCGCGAACTA CGGGATCCCG GGGGAGGGTT TTACACTGCC
GAGGATGCTG ACAGTCCGGC AGGAGAAGGG GCATACTACC TGTGGACAGA AGAGGAGATC
GCCCGGATTC TTGGCCTGGA CGCTGCATTC GCATCCATCC TGTTTTCGTT GACGCCGCTT
CCCGGTTCCG AAAAACACGC CAGTATTATT TCTGCTGCCG GGCCGGACCC GGTTCTCCTG
AAAAATCTTG GGATCACAGA GCAGGAACTT ATTTCCCGCC GGGCTGGTAT CTTACGCCGG
CTCGCACACG AGCGGGAGAA GCGTCCTAAA CCGGCCCGTG ACACCAAGAT CCTGACAGAC
ACAAATGCCC TCTTCTGCAC TGCCCTTGCC CGGGCCGGCC GGGTATTGGG AAATCCTTCA
TACACCGATG CCGCAGCCTG CACCCTCCGG TTTCTCCTGC AAAATATGAG AAATGGTGAG
GGCAGGATCC TGCACCACTC CGGTGGAGGA GAACATGCAG TTCCCGGTTT TGCTGATGAT
TATGCGCACC TTGTCGCTGC ACATATTGAA CTTTACAAGG CAACATCCGA CATTGCCTGT
ATCAAAGAAG CCGTTACGAT CAATGCCCTG CTCCTTACGC ACTACCGTGA CAAAGAGGGC
GGGGGATTTT TTACTACTGC GGATACCGCT GTGGATCTGC CGGTGCAAAA AAAAGAATGG
TATGATGGCG CAGTCCCGTC AGCCAACACG ACCGCCTTTG AAAATCTCAC CGCTCTTTAC
CGGCTCACCG GCAATGATGT ATTTAACGAA GCGGCGCTTG AGTGCGCCAG GTTTATCACC
GGTGCTGCTT CCAGGGCACC CCATGCGGTC ACCGGGTTCC TTGCAGCGCT CGCATGTTCC
CCCTTAACTG GAAATACGCA GGATCTTGTG ATTGCCGGTG ATCCAGCAAA TGCCGGCACG
CAGACCCTGC TTGCCGTGGC ACGCAGGCAG TACCTCCCCG GTCTGCTTAT CCTGCTCCGG
CCACCGGGCA AAGCCGGCGA TGAAGTGGAT ACAGTTTTTC CGGTTGTACA GGGCAAAGTT
CCTCATGAGG GAAAGGCAAC TGCATATCTT TGTACCGGTT TGGCGTGTCT GCCCCCGGTA
AGCGATCCGC AGGAACTGGT AAATCAACTC TCCATGCGGG ATAAAAAAAA CCGGCCCCTA
AACAAAGGTT AG
 
Protein sequence
MGKGRWNRMQ PGEYPDTHNS GTMQTRRSSN RLARETSPYL LQHASNPVDW YPWGGEAFSR 
AKREDRPLFL SIGYSACHWC HVMARESFEN NEVAGILNKH FVCIKVDREE RPDVDSVYMG
ICQQLTGQGG WPLTIIMTPE KKPFFAGTYF PKTGRAGMPG LTDILITIAN LWETRRDELY
AAAEQILSDA HLLHKSPSGD PDRHLLDKGF RELAAQFDSA NGGFGRAPKF PAPHNILFLL
RYWQMTGENR ALDMAEQTLD AIRQGGIWDH VGGGMHRYAT DARWLVPHFE KMLSDQAMLV
LASTEAYAAT GKIRYRTIAE ECIAYVLREL RDPGGGFYTA EDADSPAGEG AYYLWTEEEI
ARILGLDAAF ASILFSLTPL PGSEKHASII SAAGPDPVLL KNLGITEQEL ISRRAGILRR
LAHEREKRPK PARDTKILTD TNALFCTALA RAGRVLGNPS YTDAAACTLR FLLQNMRNGE
GRILHHSGGG EHAVPGFADD YAHLVAAHIE LYKATSDIAC IKEAVTINAL LLTHYRDKEG
GGFFTTADTA VDLPVQKKEW YDGAVPSANT TAFENLTALY RLTGNDVFNE AALECARFIT
GAASRAPHAV TGFLAALACS PLTGNTQDLV IAGDPANAGT QTLLAVARRQ YLPGLLILLR
PPGKAGDEVD TVFPVVQGKV PHEGKATAYL CTGLACLPPV SDPQELVNQL SMRDKKNRPL
NKG