Gene Mpal_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1989 
Symbol 
ID7270795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2113701 
End bp2115173 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content58% 
IMG OID643570604 
Productprotein of unknown function DUF344 
Protein accessionYP_002467015 
Protein GI219852583 
COG category[S] Function unknown 
COG ID[COG2326] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.210937 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACC GATACGATCT CTCTAAGAAG GCGGATCAGA AAGAGTACGA CAAGACAGTC 
CCGGCACTGC AGGTGAGGTT CGGTGAACTG CAGCGGGAAC TGAGAACGGC GGGGATCCCG
TTGATCCTGG TCGTCGAGGG GTGGAATGCT TCGGGGATCT CGGATGCGGT CAGCGAGTTG
ATCCATGCCC TGGACCCACG TGGATTCACC TTTTATGCAA CCGGGAGTCC GAATGACGAG
GAGAAGGCGC ATACCTTTCT CTGGCGGTTC TGGGTGAAGA CCCCGGCGAA AGGGAGGATC
GCGATCTTCG CCCGGAGTTG GTACAGCCGG CTGCTCGCCG AGCGGATGGG CGGGATCAGC
TGGAAAGAGA ATGAGAAGCA GTCCCTTCGG ACAATTCGGG CCTTTGAGCA GCAACAGGCT
GACGACGGGA CGATTGTGCT GAAGTTCTTC CTGCACATCA GCAAGGAGGA GCAGAAGCGA
AGGCTTGAGG AACGTGAAAG GGATCACCTG ACCTCCTGGA TGATCACCCG TGGGGACTGG
GATTTTCACA ACCAGTATGA CCTGTATCTG CCTCTGATCG AGGATGTCAT CAAGGATACC
GATAGCAAGG ACGCCCCCTG GACGATCGTT GAGGCGACGG ATCCCCGGTT TGCAGCCATC
AGGGTCTACA CGGTCCTGAT CAAGACACTC GAGGCGCGGC TCTCTACCGC GAAGAAGGAA
GAGAAGCAGA GCGATCACAA GAAAGACGAC CAGAAACGGT CCGGCTCGAT CCTCTCTCCG
GTCGACCATT CCCTCTCCCT CTCCAAGCCG GAGTATCTTG AGCAGTTGAC GATCGTTCAG
GGGCGGGTCC GCGAACGCCA GTATCAGATC TTCAAACGTG GGATACCGCT GATGATCGTG
TACGAAGGCT GGGATGCCGC CGGTAAGGGG GGAAACATCC TCCGACTGAC GCAGAATCTG
AATCCCCGCG GGTATTCGGT GGTGCCGGTA GCGGTGCCGA ACGATATTGA AAAGGCACAC
CATTACCTCT GGCGGTTTTA CACCCACGCC CCGTCGGCCG GCTCGATCCG GATCTTTGAC
CGTTCCTGGT ACGGCAGGGT GCTGGTCGAA CGAGTCGAGG GGTTCTGCAC TGACGAGGAG
TGGGGGCGGG CGTATAACGA GATCAACCAG ATGGAGGAGG CGTTCCTCGC CAGCGGCGGC
GGGCTTGTCA AGTTCTGGCT CGAGATCGAC AAGGACGAAC AGCTTCGTCG TTTCGAGCAG
CGCCAGAACG ACCCTGCCAA GCAGTGGAAG ATCACCCCCG ATGACTGGCG TAACCGTGAA
AAATGGGACC AGTATACGCT GGCCGTCGAC GAGATGCTGG CTAAGACCAG CACTAAGCAG
GCGCCCTGGA CGATCATCGA GTCCGATGAC AAGTACTATG CACGGATCAA AGCACTCAAT
ACGGTCGTCT CCTATATCGA CACCCTGCTC TGA
 
Protein sequence
MFDRYDLSKK ADQKEYDKTV PALQVRFGEL QRELRTAGIP LILVVEGWNA SGISDAVSEL 
IHALDPRGFT FYATGSPNDE EKAHTFLWRF WVKTPAKGRI AIFARSWYSR LLAERMGGIS
WKENEKQSLR TIRAFEQQQA DDGTIVLKFF LHISKEEQKR RLEERERDHL TSWMITRGDW
DFHNQYDLYL PLIEDVIKDT DSKDAPWTIV EATDPRFAAI RVYTVLIKTL EARLSTAKKE
EKQSDHKKDD QKRSGSILSP VDHSLSLSKP EYLEQLTIVQ GRVRERQYQI FKRGIPLMIV
YEGWDAAGKG GNILRLTQNL NPRGYSVVPV AVPNDIEKAH HYLWRFYTHA PSAGSIRIFD
RSWYGRVLVE RVEGFCTDEE WGRAYNEINQ MEEAFLASGG GLVKFWLEID KDEQLRRFEQ
RQNDPAKQWK ITPDDWRNRE KWDQYTLAVD EMLAKTSTKQ APWTIIESDD KYYARIKALN
TVVSYIDTLL