Gene Mbur_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_1994 
Symbol 
ID3996946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp2095822 
End bp2097012 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content49% 
IMG OID637959735 
Productbifunctional formaldehyde-activating enzyme/3-hexulose-6-phosphate synthase 
Protein accessionYP_566623 
Protein GI91773931 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0269] 3-hexulose-6-phosphate synthase and related proteins
[COG1795] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03126] formaldehyde-activating enzyme 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGTTGA TTGGAGAAGC ACTTATTGGC GAGGCACCAG AGCTCGCACA CGTTGATCTT 
ATGATCGGGG ACAAGGAGGG GCCGGTAGGG CAGGCATTCG CAACAGGAAT GACCCAGCTT
TCAGCAGGCC ACACTCCCGT TCTTTCTGTC ATCCGCCCGA ACTTACCTAC AAAGCCATCC
ACACTTATCG TTCCAAAAGT GACCGTTAAG GGAATGGATC AGGCTTCACA GATATTCGGT
CCTGCACAGG CCGCTGTTTC AAAGGCAGTT GCTGATGCAG TGGAAGAAGG ACTGATCCCT
AAGGAAAAAG CAGAAGACCT TGTCATCATT GCAAGCGTTT TCATTCACCC GCAGGCAGTG
GACTATAACC GTATCTACAG GTACAATTAC GGAGCTACTA AATTGGCACT TAAACGCGCA
CTTGATGGTT TCCCTGACAT TGATACAGTT CTTCATGAGA AGGACCGGGC TGCACACGCT
GTCATGGGAT TCAAGATATC CAAACTTTGG GATGCTCCAT ACTTGCAGGT CGCACTTGAC
AATCCAAACC TTCCTGTTAT CCTTAATATC ATCAAGCAGC TCCCTAAGAG CGACCACTTG
ATACTGGAAG CAGGTACACC CCTTATCAAA CGCTATGGTG TGGATGTCAT TTCCAAAATA
CGTGAGGTCA GACCGGACGC GTTCATCGTT GCAGATCTTA AGACCCTCGA CACAGGTAAC
CTTGAGGCAC GTATGGTGGC GGATGCAACC GCTGATGCTA TTGTAGTATC CGCTCTTGCA
CCTATCGCAA CACTCAACAA GGTAATTGAA GAGGCACACA AGACCGGTAT CTATGCTGTC
ATGGATACAT TGAACACTCC TGATCCAGTA GCTGTCCTTG AACAATTGGA CGTACTTCCT
GATGTAGTTG AACTACACCG TGCAATTGAC ATCGAGGGCA CTGCTCACGC ATGGGGCAGT
ATCGAAGGTA TCAAGGCACT TGCAGTGAAG CGTTCTTCCA AGGTCCTTGT AGCAGTCGCT
GGTGGTGTAC GTGTTGACAC TATCTCTGAT GCACTTGGAG CAGGTGCTGA TATCCTTGTC
GTTGGCAGGG CTATCACCAA TTCAAAGGAT GTCAGGCAGG CAGCTGACCG GTTCATTGAA
GGCTTGAACA AGCCTGAGAT CGACCAGTTC AGAATAATGA CCGATTTTTA A
 
Protein sequence
MMLIGEALIG EAPELAHVDL MIGDKEGPVG QAFATGMTQL SAGHTPVLSV IRPNLPTKPS 
TLIVPKVTVK GMDQASQIFG PAQAAVSKAV ADAVEEGLIP KEKAEDLVII ASVFIHPQAV
DYNRIYRYNY GATKLALKRA LDGFPDIDTV LHEKDRAAHA VMGFKISKLW DAPYLQVALD
NPNLPVILNI IKQLPKSDHL ILEAGTPLIK RYGVDVISKI REVRPDAFIV ADLKTLDTGN
LEARMVADAT ADAIVVSALA PIATLNKVIE EAHKTGIYAV MDTLNTPDPV AVLEQLDVLP
DVVELHRAID IEGTAHAWGS IEGIKALAVK RSSKVLVAVA GGVRVDTISD ALGAGADILV
VGRAITNSKD VRQAADRFIE GLNKPEIDQF RIMTDF