Gene Mpal_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2234 
Symbol 
ID7272531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2383415 
End bp2385127 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content55% 
IMG OID643570846 
Productformylmethanofuran dehydrogenase subunit A 
Protein accessionYP_002467250 
Protein GI219852818 
COG category[C] Energy production and conversion 
COG ID[COG1229] Formylmethanofuran dehydrogenase subunit A 
TIGRFAM ID[TIGR03121] formylmethanofuran dehydrogenase subunit A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.459579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0518322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAT ACCTGATCAA GAACGGCTTT GTCTTTGACG CAGTGAGCGG TGTCAAAGGA 
GACAAGGCTG ACATCGCAAT AAAGGACGGC AGGATCGTCG AGACGTCCGA GCTCTCATCG
AAGGCGGAGC AGATCGACGC AAACGGCAAG ACGGTCATGG CCGGTGCGGT TGAGATTCAT
GCCCACGTGG CTGGACCAAA GGTGAACGAA GGAAGAAACT ACCGTCCAGA AGATAAACTC
TTTGGCTGTA CTCCCAAGAC TGCGACCTCT CGGATGGGTG GTGGTTTCTC GATTCCAACC
ACGTTTAAGA CTGGCTATAC CTATGCAAAG ATGGGCTATA CGACGGTGAT GGAGGCTGCG
ATGCCCCCGT TGTACGCACG TCATGTGCAC GAGGAGATAT GCGACACGCC GATCATTGAC
CAGGGTGCCT TTCCGGTCTT TGGAAACAAC TGGTTCATGC TCGAGTACCT CAAGAACAAT
GAGATCGAGA ATGCGGCTGC GTACATTGCC TGGCTGCTCC GTGCAGCCAA GGGGTACGCG
GTCAAGGTTG TGAACCCGGG TGGCACTGAA GCATGGGCAT GGGGACTGAA CTGTCTCTCT
GTCAATGATC CGGTTCCATA CTTTGACATT ACCCCAGCGC AGATCGTGAA GGGGCTGCTC
GAAGCGAACG AGTACCTCGG TCTCCCGCAC TCGATCCATG TCCACTCCAA CAACCTCGGG
AACCCCGGAA ACTATACGAC CACGCTCGAC ACGTTGAAGA TCGCTGAGGG GTATAAGACC
CACAACAAGT TCGGCCGTGA ACAGGTGATG CACCACACGC ATCTCCAGTT CCACTCGTAT
GGTGGCGATA GCTGGCTGAA TTTCGAGTCC AAGGCCAAAG AGATGATGGA CTACGTGAAC
GCTCAGAAGA ACCTGACGAT CGACCTCGGG TGTGTGACCC TCGATGAGAC AACGACGATG
ACCGCCGACG GACCGTTCGA GCACCACCTG ACCGAATTGA ACCATCTGAA GTGGGCGAAC
GTCGACGTCG AACTTGAAAC TGCAGCAGGG ATTGTACCGT ACATTTACAG TCCGAATGTC
AAGGTCTGCG GTATCCAGTG GGCCATCGGA CTTGAACTGG CACTCTTCGC CAAGGACCCG
ATGCGGACCT TTATCACCAC TGACCACCCG AATGCAGGGC CGTTCACCCG GTATCCCCGC
ATATTTAAGT GGTTGATGAG CCAGGAAGCC CGGCAGGAGC GGCTTGATAC GTTCAAGTGG
AGCCAGAAGG TAATCGACGC CACAAACCTC GCAGAGATCG ACCGTGAGAT CACGCTCTAT
GAACTGTCGC AGATGACCCG GGCAGGGCCG GCGAAAGCAC TCGGTCTGAC CGAGATGTGC
GGCGGGTTAA AGCCAGGGAT GGACGCGGAC GTTGTCGTCT ACAACTTCAA CCCTGAAGCA
CCCTTCACGC CGGACCAGAT CGAGACCGCG TTCACCTGTG CGGATGATGT CTTCAAGTGC
GGTGTCCATG TGGTCAAGAA TGGCGAGGTC ATCTCTAATG GCAACAAACG GACTCTCTGG
GTCAACGCCA AAGTCAGGGA CAACCCGCAG GTCATGCATG ATGTTGAGGA GAAATTCCTG
AAATACTACA GTGTCAACCA GAACAACTAC GAAGTCAGCG GGCACCACTA TCTCCCGAAC
CCGTATGTGC TCGAGGTCGA TGCGACAGAG TGA
 
Protein sequence
MTEYLIKNGF VFDAVSGVKG DKADIAIKDG RIVETSELSS KAEQIDANGK TVMAGAVEIH 
AHVAGPKVNE GRNYRPEDKL FGCTPKTATS RMGGGFSIPT TFKTGYTYAK MGYTTVMEAA
MPPLYARHVH EEICDTPIID QGAFPVFGNN WFMLEYLKNN EIENAAAYIA WLLRAAKGYA
VKVVNPGGTE AWAWGLNCLS VNDPVPYFDI TPAQIVKGLL EANEYLGLPH SIHVHSNNLG
NPGNYTTTLD TLKIAEGYKT HNKFGREQVM HHTHLQFHSY GGDSWLNFES KAKEMMDYVN
AQKNLTIDLG CVTLDETTTM TADGPFEHHL TELNHLKWAN VDVELETAAG IVPYIYSPNV
KVCGIQWAIG LELALFAKDP MRTFITTDHP NAGPFTRYPR IFKWLMSQEA RQERLDTFKW
SQKVIDATNL AEIDREITLY ELSQMTRAGP AKALGLTEMC GGLKPGMDAD VVVYNFNPEA
PFTPDQIETA FTCADDVFKC GVHVVKNGEV ISNGNKRTLW VNAKVRDNPQ VMHDVEEKFL
KYYSVNQNNY EVSGHHYLPN PYVLEVDATE