Gene Mpal_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0049 
Symbol 
ID7272218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp49008 
End bp51887 
Gene Length2880 bp 
Protein Length959 aa 
Translation table11 
GC content56% 
IMG OID643568707 
ProductHEAT domain containing protein 
Protein accessionYP_002465167 
Protein GI219850735 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGATTT TTGATATATT TACCCCGGAT ATCCAGAAGA TGCGCGATGC AGGTGATGTC 
CGGGGGCTGG CGAAGGCGCT TGGGGATTCG AATCTGGAGA TCGCCCGGAA GGCGGCAGCA
GCGCTGGTGG GTCTTCCCTC GTCCAGTGCC GTGATTCCGT TGATCCGGTC ACTTTCATCC
CCTGACAAGG ATATCAGGCG TCTGAGCACG GCTGCACTTG GTGCGACCGG GGACCCACAT
GCGCTCCCTG CCGTTCTGGA AGCAACGGAG GATGGAGATC TCGGCGTAAG GCTGGAGGCA
GTAAAAGCAC TTGCAAAATT CCACGATCCT GAAGCAAATT TGCTTATTAC CCGGTTTACC
GCTGACAGCA ATCTGGACAT CCGGATGGCG GCAGTCTCTG CACTGGGGCA GACCGGGGAT
CCCACATCGA TCGAACCTCT CCTCCATCTG CTCGTGGATC CCCATTACGG CATACGTGAG
GTCTCTGCAT ATGCACTCGA CTCTCTCGGC TGGGTGCCGG CGAACGATCG GGATAAGGCT
TTCTATTTTA TCGCAAAACG GGAATGGAGA GGCCTGTTCA ACCTCCAGTC GGTGGCAGTA
AAAGTTCTTG TCTGGGCATT GAAAGATGAA TATTATGCTG TCAGGCAGGG GGCGGCCTCG
ACCCTGGGAA AACTCAAGGA TCTCCGGGCT GTCAGGGCAC TGGTCTCTGC TCTTTCCGAT
GAAGAGAGCA GTGTTCGGAT GGAAGTGGTC TCCGCTCTGG GTGAGATAGG CGACCTTCGG
ACTGTCCCTA TCCTGGTACG GGTACTTGAC GATGATTATA TCGGTGTGCG CATGACTGCC
GCCTCTGTTC TCGATGGCAT GGGATGGAAA CCGTCAACTG AGAACGACCT GATCCTCTAT
CTCCTCGCCA AGGAACGATG GATGGATATT GCAGTTATCG GGAAACGATC GACGCAGGTT
CTGGCAAAAC GACTCAATGA TCCTAACTAT AGCACCCGTG ATGAAGTCGG AAAGATTCTG
CAGAGACTGG GTGAGCATGC ACAAGAACCG ATGCTTTCTG CATTGAAGAA TCCTGACCCG
GATGTCCGGT CAAGGGCTGT GTGGATCCTC GGGAATATCA GGACCAGGCA GGCTGTTGGC
CCGATCATTC GAATCCTCAG CGATGACAAT CCAGCATGTA GAGAGGAAGC TGTCAGAGCG
CTGGGAAAGA TCGGGGACCC CCGTGCCATC CCGTTCCTCA ACCGTGTCCT CGGGCGTGAA
ATTCTTCTCG CCCCGGTGGC GATCCGGGCG CTGGGGCAGA TCGCTCATCC CGCTTCTGTC
AAGGCATTGA TGCCGTATCT TGGGAGTGCA GACCGGGAGA TCCGGCTGCA TACGATCCTG
GCACTGGGAG AGAACGGTGA TAAAAAGATA TCAGGTGCCA TCGCTCATGC GGTGAAGGAT
TCCGATCCTG AGGTACGGGT TGCTGCTATT ACCGTTATCA GCAAGTTCCC TTCGAACGAG
GTGTACGCCC TGATCCGGGC TGCACTCGAA GATGTTCATC CCGATGTCCG GTATGCAGCC
CTCCTCGCCA TCTCCACGTG GCAGGCCGAC GACACGATCC CACTGATGGT CAGGCGACTG
GAAGATGAAG ATCAGAAGAT CTGCAGGATT GCTGCACAGG CGCTGAATCG ACGGCGCTGG
CAGCCGGCAT CCCTGCCGGA GCGAAGAAAT TACCTGATTG CTATGGGTCA GTGGCGGGAT
CTGGAACGTC TGGACTATGT CAATAAAGAG CTGCAGAAGA AGGAATCGGA TCTCGCATGC
CAGATCGTCC CCCGGGGAAG CATCATGGAT CCTGTACCTC CGCTGTCGGG ATCTCCAGGA
GAGCGGGTTG ATGGAGATCC AACGAGACCG GAGGGTGCAG GAATAACCGG TGAACCCGAT
AGCCTACTGG CACTGATCCG GACGCTGGGG GACCAGAAGG AGATGCAGGC ACTCCGGTGG
AGATCTGCTG AAGGGCTCGG GGGACTGGGC GATAAGCGGG GGGTTGAGCC ACTCATTGTC
GCGTTATTTG ACCCGGATTC TGAGTTACGT TGGCGTGCTG CGTTTTCTCT AGGCCTGCTG
CATGATGAGC GTGCCATCGA ACCACTTGCA ACCGCCCTCC GCAGTGATGA TCAGCTGACT
GTCAGGGTAC GGGTTGCAGA AGCGCTCGGA CAATTTAAAA AACCGGTCGT CATCAGGCCG
CTCATTCACT CACTTGGTGA CGTTCATCCG GATGTCAGGG ATATGGCAAT CCGTTCGCTT
GGAGAGATCG GAACTGAGAG TGCGATTAAC GCAATCCTGA CCGGGCTCCT CGATGCGGAT
GAGACTGTCC GCGAGAGCGT CATCGATACC CTCTTAAAAC TCGGTGCCAT GGCAGGCAGG
TCGTTGGTGA AGAATCTCAA AAATAGAAAC CCAGAGGTCA AAAAAGGAGT TTTGACCGTT
TTTATGCGGA TGAAACCAGC GATCAGTTTC CGCATCCTGG TCTCTGAACT GGAGAATGGT
GACTGGGAAG TCCGCCAGAT GGTTGCCGCT GCACTGGATT CTCTTGACTG GCAACCAGGG
GATCCCTTTC AGAGGGCGAT ATATCTGTTT GCACAGCGCG ACTGGAGAGC CCTTGAAGCG
CAGGGGAAGA CCGCTGAGGG AATTCTGATC CGGGGGACTG CCGACAGCGA TCCCGCGATT
CGGAGGGCTT CAGTGGAACT TCTGGGTTTG ATTGGGGACC GTCGTACTAT TCCTTCTCTG
ACCGAGGTCA TGTATGACGA GAACCGGGAA GTCCGTCTCT CCTCGATAAA AACTCTGCTG
AAGAGGCAGG GCGGGGAGTC CTCCAGGCTT ATCTCCACTC TGAAACGAAA CGTGAAATAA
 
Protein sequence
MGIFDIFTPD IQKMRDAGDV RGLAKALGDS NLEIARKAAA ALVGLPSSSA VIPLIRSLSS 
PDKDIRRLST AALGATGDPH ALPAVLEATE DGDLGVRLEA VKALAKFHDP EANLLITRFT
ADSNLDIRMA AVSALGQTGD PTSIEPLLHL LVDPHYGIRE VSAYALDSLG WVPANDRDKA
FYFIAKREWR GLFNLQSVAV KVLVWALKDE YYAVRQGAAS TLGKLKDLRA VRALVSALSD
EESSVRMEVV SALGEIGDLR TVPILVRVLD DDYIGVRMTA ASVLDGMGWK PSTENDLILY
LLAKERWMDI AVIGKRSTQV LAKRLNDPNY STRDEVGKIL QRLGEHAQEP MLSALKNPDP
DVRSRAVWIL GNIRTRQAVG PIIRILSDDN PACREEAVRA LGKIGDPRAI PFLNRVLGRE
ILLAPVAIRA LGQIAHPASV KALMPYLGSA DREIRLHTIL ALGENGDKKI SGAIAHAVKD
SDPEVRVAAI TVISKFPSNE VYALIRAALE DVHPDVRYAA LLAISTWQAD DTIPLMVRRL
EDEDQKICRI AAQALNRRRW QPASLPERRN YLIAMGQWRD LERLDYVNKE LQKKESDLAC
QIVPRGSIMD PVPPLSGSPG ERVDGDPTRP EGAGITGEPD SLLALIRTLG DQKEMQALRW
RSAEGLGGLG DKRGVEPLIV ALFDPDSELR WRAAFSLGLL HDERAIEPLA TALRSDDQLT
VRVRVAEALG QFKKPVVIRP LIHSLGDVHP DVRDMAIRSL GEIGTESAIN AILTGLLDAD
ETVRESVIDT LLKLGAMAGR SLVKNLKNRN PEVKKGVLTV FMRMKPAISF RILVSELENG
DWEVRQMVAA ALDSLDWQPG DPFQRAIYLF AQRDWRALEA QGKTAEGILI RGTADSDPAI
RRASVELLGL IGDRRTIPSL TEVMYDENRE VRLSSIKTLL KRQGGESSRL ISTLKRNVK