Gene Mpal_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2033 
Symbol 
ID7272014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2154447 
End bp2157548 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content60% 
IMG OID643570645 
Producthypothetical protein 
Protein accessionYP_002467055 
Protein GI219852623 
COG category[S] Function unknown 
COG ID[COG4983] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.923118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.210395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCG GCTCTGAGAT CCGGCGGGGT CTTGAGATCC TGCTCGCCCC CGGCCAGGTC 
TTCGAGGTCC GTTCTTGGAC CGGCGACCGG ATATCCTCCG GCTACTTCGA CGACCTGACC
GCCGCCGAAA AGGCGATCGA GGCACTGGAC GCCGCCGGCC CTGATGGGAT CTACCTGACC
CCGAATCCGG TCCTGCCCGA CCTGCTGAGC AGGCGAGCGA ATCGGATTAA GGGGCCGCTC
GCAAAGAAGG ACAGCAGCAC CAGCGACGCC GACATCATCG GCCGGCGCTG GCTGCTCATC
GACATCGACC CGGCCCGGCC ATCTGGGGTC TCTTCATCCG AGGAAGAGCA CGCCGCCGCC
CTGACACGAG CCGCAGACAT CGCCGCAGCC CTGGGTGAAC GGGGATGGCC GGCCCCGGTG
GCGGGCGACT CCGGCAATGG TGCTCACCTC CTCTATCGGC TGGACCTCCC GAACGACGAT
CAGGTAACAG CACTGGTTAA GGCCGCCCTG GTGGCCCTCG ATGCGCTCTT CTCCGACCCG
TTCGCCTCAG TGGACACGGC CGTCTTCAAT GCCTCCCGTA TCTGGAAGGT ATACGGGACC
GTGGCCCGAA AGGGTGACAA TACCACGAGC AGACCGCATC GACGGAGCCG GCTGATCTCG
GTGCCGAAGA CGATCGAGAT CGTGACCCGC GAGCAACTCG CCGCCCTGGC GATCACCGAC
CCCGGAGGGG ACACCGTGGC GCCGGCTCCT TCAACGAAGG GCACCGGCCG CAAGGCTGGC
GAGAAGATCG ACCTGGCCGG CTGGCTCCGG GACCACGGCC TCGGCTTCTC CGATCGCCGG
CCCTACCAGG GCGGGGACCT GTACCGGCTC GACACCTGTC CGTTCTCTTC GGCTCACACC
GATGGTGCGT TCGCGATCCA GTTCGCGAGC GGGGCCGTCT ATGCCGGCTG CCATCATGCC
TCCTGTGGCG GCGGGTCGCA GCGGTGGCCT GAACTGCGGG AGATGTTCGA GGGACCTAAG
CGGATCCCGG CCGGCCTGGC GAACCATGAC GAGAAGGAAG TGGAGTGGCG GAAGAAGAAA
GCCGAAGCAA GGGAAGCGGC GGCCGGCCGG GATGACTCCG AACCGGTTGA AGCCGACGCA
GCTGATGAAG TGATCCTGGC AGAGGCCCGG CAGATCCTTG AGCACGGCGA CCCGCTCGCC
TACATGGTCG ACGCGTTCAA CCTGGAGCAC GTCGGCGACA GGGATTTAGC GAAATGTCTG
GTATTGTCTC TCGCATCCAG GTTAATCGCA AACTCAAAGG GTCTGCATGT GATGACATCA
GGGGACAGTG GAAAGGGGAA GAGCTGCGGA TACCAAACAA TGTTGGAACA AGTGCCGGCC
GCGTTCAAGC TCGAAGGATC GTTTTCTGAT AAATCTCTCT ACTACAGCAA CGATCTCCGC
CCACGGACCG TGTTTTTCAT CGACGACAAA GATCTTTCAG AATCCATGCA AGAGGTATTG
AAGGAATCGA CTTCCTCGTT TCAAAAACCG ATCACTCACC GCACGCTCTC CATCGATCGA
AAACTCCAGG TTTGCACAAT CCCACAGAAT TGTGTCTGGT GGATCGCGAA GGTGGAAGGG
ACAGGAGACG ACCAGGTTCT CAACCGGGTA TTGATGGTCC ACGTCAACAA CTCAGCCGAA
CAGGACCAGG CCGTACAGCA GGCCGCCATA GAACGAGAAT CACGGGACGA CCCACCAGGC
GAACCACACC AGGTCAAGGC ATGCCGGGCC ATGTGGATCC ACCTGGACGC CGCTGCTCAA
AACCCAGTGG TCGTATCTCT CTCCAGGTTC GCCAGACGGA TTGTCTTTTC ATCTGCCCGG
GATCGGCGGA ATACCGAAAT TTTCTTCGAT CTGATCCGGT CCGTAACCCT GATCCGATTC
TTTCAACGGG ATCGGGAAGA ACTGCCAGAC GGAACGATCA GAGTATACGC CACCCCGGAC
GACTTCAGAA CCGCGAACGA CGTTTTCAAC GCAATTCAAA AAGACTCAGG ATCGCAGGGT
GCGAAACTCA ACCCACGCGA GGCGGGATTG TTAGACGTGA TCGACCGGAC CGGTCGAGAT
GAATTTACCG TACAGGAACT CGCAAGACTC ACAAACCTTC CAGACTACAC AATCAGGCGA
GCGTTTACAG GAATCCCTGG AAGGACAAGC GCGGGATTGT TGGAAAAATG CCCGGCCTGC
GGGATCCTCG ATCGTTCTGT CAGCTCGCCG GACGACACAC AGGGATCAGC CCGACGCAAT
CAAACCGTGT TCACGTTCGA CCGTGGGATC TATGCCATAT GGGATCGTGG GTGTCGTGTT
CACTTGAAAC CGGCCGATGG GGAAGATGAC GTTACACCGT TACACAGACA TTACACAAAC
GTTACACAAC CCTGTGTAAA GGTAACGCCG GCACCAGAGG CGGCGGATAC ACAAAACAAC
TCTCTCCTCT CTAGTGTACA TAAGAAAGAA GAGAACGTTA CACAAAAACA GGAGAGTACC
CACCCCAGTA ACAATAATCG AGAGATCACA CCTGGAGGTG TATGTGCTCC ACCAGAATGT
GTAAAGGTGT CAGGTCAAAA TGTTGAAAGC ACACCTGTAC CCCCAAATAA CAAAACGAGT
GACGGAAACG CCGAATCAAA CCGTACACAA ACAGGTGTAC GGTTTGTGTC AAGCAGTGTA
AATGTGCAAA ACGCCCCACC CGCCCTTGAT CTCGCCCAGG TCCGCGCCGG CGATTACTCC
CCGATCGCCG ATGGTGCGAT CGCCGAGACC TGCCCGATCT GCTCCGGCCG GCTGGTGCAC
TATCAGGAAA AATTCACGGC GATGAAGAGC CGGGGCGGGG ACCACCCGCG GCGGCTCTGC
CGGTCCTGTT ACACCCAGGC GAAGGAGCGA GAGCAGACCG CGGTCCAGGC CCTCCCCGGC
GCCATCCCGA TCGACGAGGT GAAGCCGATC GCCTCCGGCC TGCTCGGTCG GTGCACGGTC
TGCGGGCTGC AGGTCGCGAC CTACTCCCAT GTCGACAGCG GGACCGCGCT CTGCTCGGTC
TGTTATGAGA AACTGGTGAG GGAGCAGGTG GAGATCCGGT AG
 
Protein sequence
MDAGSEIRRG LEILLAPGQV FEVRSWTGDR ISSGYFDDLT AAEKAIEALD AAGPDGIYLT 
PNPVLPDLLS RRANRIKGPL AKKDSSTSDA DIIGRRWLLI DIDPARPSGV SSSEEEHAAA
LTRAADIAAA LGERGWPAPV AGDSGNGAHL LYRLDLPNDD QVTALVKAAL VALDALFSDP
FASVDTAVFN ASRIWKVYGT VARKGDNTTS RPHRRSRLIS VPKTIEIVTR EQLAALAITD
PGGDTVAPAP STKGTGRKAG EKIDLAGWLR DHGLGFSDRR PYQGGDLYRL DTCPFSSAHT
DGAFAIQFAS GAVYAGCHHA SCGGGSQRWP ELREMFEGPK RIPAGLANHD EKEVEWRKKK
AEAREAAAGR DDSEPVEADA ADEVILAEAR QILEHGDPLA YMVDAFNLEH VGDRDLAKCL
VLSLASRLIA NSKGLHVMTS GDSGKGKSCG YQTMLEQVPA AFKLEGSFSD KSLYYSNDLR
PRTVFFIDDK DLSESMQEVL KESTSSFQKP ITHRTLSIDR KLQVCTIPQN CVWWIAKVEG
TGDDQVLNRV LMVHVNNSAE QDQAVQQAAI ERESRDDPPG EPHQVKACRA MWIHLDAAAQ
NPVVVSLSRF ARRIVFSSAR DRRNTEIFFD LIRSVTLIRF FQRDREELPD GTIRVYATPD
DFRTANDVFN AIQKDSGSQG AKLNPREAGL LDVIDRTGRD EFTVQELARL TNLPDYTIRR
AFTGIPGRTS AGLLEKCPAC GILDRSVSSP DDTQGSARRN QTVFTFDRGI YAIWDRGCRV
HLKPADGEDD VTPLHRHYTN VTQPCVKVTP APEAADTQNN SLLSSVHKKE ENVTQKQEST
HPSNNNREIT PGGVCAPPEC VKVSGQNVES TPVPPNNKTS DGNAESNRTQ TGVRFVSSSV
NVQNAPPALD LAQVRAGDYS PIADGAIAET CPICSGRLVH YQEKFTAMKS RGGDHPRRLC
RSCYTQAKER EQTAVQALPG AIPIDEVKPI ASGLLGRCTV CGLQVATYSH VDSGTALCSV
CYEKLVREQV EIR