Gene Mboo_0993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0993 
Symbol 
ID5411670 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp972617 
End bp973525 
Gene Length909 bp 
Protein Length302 aa 
Translation table11 
GC content62% 
IMG OID640868219 
Productformylmethanofuran--tetrahydromethanopterin formyltransferase 
Protein accessionYP_001404154 
Protein GI154150536 
COG category[C] Energy production and conversion 
COG ID[COG2037] Formylmethanofuran:tetrahydromethanopterin formyltransferase 
TIGRFAM ID[TIGR03119] formylmethanofuran--tetrahydromethanopterin N-formyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.945463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.149431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGGTG AGCGGACCAT GGAGATCGGC GGGGCCGTTA TCCTTGACAC CTTTGCCGAG 
GCATTCCCGG TCTGGATTTC AAGGGTCCTT GTGACCGCGG ACACTCCCGG GTGGGCGCTT
GCTGCCGCCG CCGAGGCAAC GGGGTTTGCA ACATCGAAGA TCGCCTGCCC CTGTGAGGCC
GGAATCGAGC GGCCCCTTGC CCGGGGCGAG ACCCCGGACA AGCGGCCCGG ATACTCAATC
CTGATCTGTA CCGAGAAGAA GGAGATGAAG GCGCAGGTTG CCGCACGGGT CAGCCAGTGC
ATCCTTCCGG CCCCGACCGC GTCCGCATTC GACGGGCTTC CCGCTGCAAA GGACCGGTTC
TACACCCGGA TGCATTACTT TGGCGACACC TACGAAGAGC GATGTGTGGT CGGCGGCCGG
CAGTGCTGGA AGATCCCGGT GATGGAAGGC TGGTACACCG GCGAGGAGCG CTTCGGGCTT
ATGAAAGGGA TTGCCGGCGG GAACTTCCTT GTCATGGCAG AGGACCGGGC TGCGGCACTT
TCCGGGGCAG AGGCGGCGAT GGCAAAGGTC GCCGGCACGC CCGGGATAAT CGCAAGTTTT
CCCGGAGGGA TTGTCGGGAG TGGATCGAAA GTCGGGTGTA AGAATTACCG GTTCCCGATG
CCGGCAAGCA CGAACCACCG CTGGTGCCCG GCGCTTAAAA ATAAAATTCC GGACTCGCTT
GTCCCTGACG GCGTGGGCGC GGTATACGAG ATCGTGATCA ACGGCTTTGA TGAGGCCGCA
ATTGCCGGGG CAATGCGTGA GGGAATCCGG GCTGCCGCGG CAACCGGAAA GGTCAGCTGC
ATTGGTGCCT CGAACTTTGA GGGAAAACTC GGGCAGACCC GGATAAACCT TCACGCACTT
TTCTCCTGA
 
Protein sequence
MGGERTMEIG GAVILDTFAE AFPVWISRVL VTADTPGWAL AAAAEATGFA TSKIACPCEA 
GIERPLARGE TPDKRPGYSI LICTEKKEMK AQVAARVSQC ILPAPTASAF DGLPAAKDRF
YTRMHYFGDT YEERCVVGGR QCWKIPVMEG WYTGEERFGL MKGIAGGNFL VMAEDRAAAL
SGAEAAMAKV AGTPGIIASF PGGIVGSGSK VGCKNYRFPM PASTNHRWCP ALKNKIPDSL
VPDGVGAVYE IVINGFDEAA IAGAMREGIR AAAATGKVSC IGASNFEGKL GQTRINLHAL
FS