Gene Mpal_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1036 
Symbol 
ID7271770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1063181 
End bp1064362 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content59% 
IMG OID643569673 
Productbifunctional formaldehyde-activating enzyme/3-hexulose-6-phosphate synthase 
Protein accessionYP_002466107 
Protein GI219851675 
COG category[S] Function unknown
[G] Carbohydrate transport and metabolism 
COG ID[COG0269] 3-hexulose-6-phosphate synthase and related proteins
[COG1795] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03126] formaldehyde-activating enzyme 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCAGA TAGGTGAAGC ACTCGTCGGC GATGGCGCAG AACTCGCGCA CATCGACCTT 
ATCATTGGAG AGAAGTCAGG ACCAGTAGGG ATCGCGTTTG CGAACGGACT GTCCCAGCTA
TCGGCAGGGC ACACCCCGCT GCTGGCTGTG ATCCGGCCGA ACCTGCTGAC AAAGCCAGCC
ACGCTGATCA TTCCGAAGGT CACTCTGAAG AACCAGACGC AGGTAACCGA GATGTTCGGC
CCGGTGCAGG CCGCGATCGC CAAGGGGATT GCAGACTGTA TCGAGGAGGG GACATTCAAG
GAGTACGACA TCGAGGACCT CGTGATCCTG GCATCGGTGT ACCTCGCCCC TGAAGCGAAG
GACTATAACA AGATCTACCG CTACAACTAT GGGGCCATTA AACTGGCGCT GAACCGCGCA
CTCGAAGGGT TTCCCCCCGA GAAGACGATC CTCTACGAGA AGGATCGCGG GGCACACGCT
GTCATGGGAT TCAAGGTCCA GAGGCTCTGG GATGCCCCAT ATCTGCAGGT CGCTATGGAC
CTTGTGGACA TGGGCAAGGT CGCGAAGGTA CTCAAGGAAG TGCCCGACAA CGACCACGTG
ATCATCGAGG CCGGCACCCC GCTGATCAAG CGGTTCGGTC TGAGTGTTAT CAGTGAGATC
CGGAAGCTCC GGCCGAACGC GTTCATCATC GCGGATATGA AGATCCTCGA CACTGGGAAC
CTCGAGTCAA GGATGGCCGC CGACGCCTCT GCCGATGCTG TCGTGATCTC CGGTCTCGCC
CCGGCCTCGA CGATCGAGAA GGCGATCGAG GAGACCAAGA AGACGGGTAT CTACTCGATC
ATCGATATGC TGAACGTGCC TAACCCGGTC GAGCTGATTG CATCGCTGAA GATCAAGCCC
GACATCGTCG AACTGCACCG TGCCATCGAC TGCGAGACCT CCTGCCATGC CTGGGGCGAC
ATCGTGGCCA TCAAGAAGGC CGCAGGCGGC AAACTGCTGG TCGCGACGGC TGGTGGGGTC
CGTGTCGAGG TCGTCAAGGA GGCACTCGCA TCAGGAGCCG ATATCCTGGT CGTCGGCAGA
GCGATCACTG CGAGCAAGGA CATTCGCCAT GCTACCGAGG AGTTCCTTGA GCAGCTGCAC
AAGGATGAGA TCGACCAGTT CAGGGTCATG ACCGATTTCT GA
 
Protein sequence
MYQIGEALVG DGAELAHIDL IIGEKSGPVG IAFANGLSQL SAGHTPLLAV IRPNLLTKPA 
TLIIPKVTLK NQTQVTEMFG PVQAAIAKGI ADCIEEGTFK EYDIEDLVIL ASVYLAPEAK
DYNKIYRYNY GAIKLALNRA LEGFPPEKTI LYEKDRGAHA VMGFKVQRLW DAPYLQVAMD
LVDMGKVAKV LKEVPDNDHV IIEAGTPLIK RFGLSVISEI RKLRPNAFII ADMKILDTGN
LESRMAADAS ADAVVISGLA PASTIEKAIE ETKKTGIYSI IDMLNVPNPV ELIASLKIKP
DIVELHRAID CETSCHAWGD IVAIKKAAGG KLLVATAGGV RVEVVKEALA SGADILVVGR
AITASKDIRH ATEEFLEQLH KDEIDQFRVM TDF