Gene Mpal_0820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0820 
Symbol 
ID7272310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp839679 
End bp841415 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content50% 
IMG OID643569469 
Producthypothetical protein 
Protein accessionYP_002465905 
Protein GI219851473 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.123117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.778649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGAC AATTACCCCT TGCGATCACG GTCGTCTCCA CAGTCTTCGT GCTCGTTCTC 
ATCATTCATT TCGCCTTTAC TCCTATCCTG TACTCCCCAT CAGGGGCAGA CAGCACCAGA
TCCCATATCA ACCCTGGTGC CATGCAGAAG TTCTCAGTTG AAAGATATGA GAAGGTCATG
CCCATGATGC AGGAGATTCT CGATCTGAGT GGCTCAGTTG TCCTCAATAT GAACCATAAC
GATACCAATG CAGCAGAACG CGATCTCAGG GAATACCTGG ATCTTTCGCA CTCACTCGAC
CAGGCAGTGG TCGACCTGAA CCTGTCTGAT AGTGAACTTG AGATCTGGCG ACAGGAGAAT
TACGAAAACC GAATTAATCT GACCCTACTC GTCAATGACA CAAAACAGTT AGAGGAGATC
AAAACGCTTG AGGCTCAGGT TCGGAGCCAG AATTCGTCTG GCGATCTGAC AACCCTGAAT
TTTGAAGAAA ATTCGCTCAA TCAGAGGATC AGACAGAATG CTCTAACCTA CAGAGGGTTA
CCAAACTCAT CATTCACCAG CCTTTCAAAA AAATTCGATC TCGATACGAC TGAATTCGAG
AAGAATGCTG GAAATCTGAC GGTGGACAAT CAGGTGGTTG AGCAAAACCC GGGGTCCATC
CCACTCAATG AGAGAGGAGT GCTCACCTTT GGTCTGTCAC AGACATCCGC TTCGTACGGT
GAGATCATCA ATGCCTCAGG AACCCTGCAG AATCTGGATG GAATCGATCA GAATATCGAG
ATTGGCGTCG ATGGAGCACC ATGGAGCATC ATTACCCCAA ACCCGGATCG CACCTACCAG
GTATCACTCG TAATGAGCAA CCTCAGCGCT GGAACTCACA TTGCGTATGC CAGAAGTAGT
ATCTTTCCGG CGAAAGAGAG TGAATTTAAT ATTATTCCAT CTGACACGGT GCTCACCATC
AGAGAGGAAG AGCAGAGAGA CAGTACGAAC ATCACGATAG CCGGGACCCT CAGGACGGTC
TCGGACATAC CGGTTACGGG AGCTCCTGTA CGAATTAGCT GGGACAGCGC CGGCAGAACC
GATGTTCTCA CCAATCCATC CGGGGAGTAT CATACAAATA TTACTCTGCC GCCGGGAGAT
CATCAGATGA AAGCACGGTT CGAAAGTCTG GATCTTCCCC TGAACCAGTC AGAGAGCGCT
GAGATATCTG TCAAAGCCCC GATATCAGCC CAGTCGATAG GAGTGGAGAT ACTGAAATAT
CTTCTGATGG GCGGGATCGT CCTGCTCGCA GTCGGTGGGG CCTCACGATA CATTCACCGA
AGAAGATTCT GGCTTCCTCA GGTGCGAGAA CTGCCATCTG GAGATCGAAT GATGGAAAAT
CCCATTCATT CAGAAGGACG TATAACCAGC CCCTGGGATG ACCTGCACCC GGATGACGTG
ATCGCAGAGT CGCAGAGACT GTTTGCAGAT GAACAATCGG ACGCGATGCA TCATCTCTAT
CAATACCTGG TATCACTGGC AGCACACGTT CATCCCAGAG TCTTTATCCC GGCACTGACT
CCCCGTGAAC TCATGCGTCT TCTCAGGCAT GATGGGGGGG GAGAGGATAT GCGTTCGTTC
ATTGACACCT ACGAAAAGAT CAGGTACGGT GGCATGCGCC TCCCGGATAA AGGGCAAGAA
CCCATCATCT CGTGGTTCCA GGCTATCCTC TCATGGTTGG GAGGTGACCA TCATTAA
 
Protein sequence
MKRQLPLAIT VVSTVFVLVL IIHFAFTPIL YSPSGADSTR SHINPGAMQK FSVERYEKVM 
PMMQEILDLS GSVVLNMNHN DTNAAERDLR EYLDLSHSLD QAVVDLNLSD SELEIWRQEN
YENRINLTLL VNDTKQLEEI KTLEAQVRSQ NSSGDLTTLN FEENSLNQRI RQNALTYRGL
PNSSFTSLSK KFDLDTTEFE KNAGNLTVDN QVVEQNPGSI PLNERGVLTF GLSQTSASYG
EIINASGTLQ NLDGIDQNIE IGVDGAPWSI ITPNPDRTYQ VSLVMSNLSA GTHIAYARSS
IFPAKESEFN IIPSDTVLTI REEEQRDSTN ITIAGTLRTV SDIPVTGAPV RISWDSAGRT
DVLTNPSGEY HTNITLPPGD HQMKARFESL DLPLNQSESA EISVKAPISA QSIGVEILKY
LLMGGIVLLA VGGASRYIHR RRFWLPQVRE LPSGDRMMEN PIHSEGRITS PWDDLHPDDV
IAESQRLFAD EQSDAMHHLY QYLVSLAAHV HPRVFIPALT PRELMRLLRH DGGGEDMRSF
IDTYEKIRYG GMRLPDKGQE PIISWFQAIL SWLGGDHH