Gene Mpal_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2071 
Symbol 
ID7271548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2195214 
End bp2199356 
Gene Length4143 bp 
Protein Length1380 aa 
Translation table11 
GC content54% 
IMG OID643570683 
ProductCarbohydrate binding family 6 
Protein accessionYP_002467093 
Protein GI219852661 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.306385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.362656 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATC TACTCAAAAG AATGACGTGG CTGCTGATTG GGGTGTTCAT CGTTTCGTGC 
CTGCTGATGA TCACGCCTGT GGCGGCGAAA TCTGTCTCCC TGACTGGCAT TGTTACTGAT
TCAATCACTG GTTTACCAGT GTCGGGTGCG ACAATCACCG TCCATGGCAC AGGGGTTACA
ACGAACAGTG TATCCTCAGG CAGTGACGGG CATTATTCGC TTGTTTTTTC TGTATCAAGT
TATATCTATG CTGCACTTTG TGATATTCAT AAGGATGGAT ATGTGGAGAG CAACAACAAC
CCAATTGCAT TTTTAGGATC ATCTACATCA AAAGACTACA AACTCGCCCC GGTCCTCCGG
ACCCTTACCG GTACGGTTAC TGATTCGATC ACCGGTCAGC CTGTATCGAA TGCGACAATC
TCCGTTCACG GCTATGAGAC AACCTCGAAC AATACTCAAT CGGATAGTGA TGGGCACTAT
TCGGTCGATT TTCAAACATC GGGTTCTCTC GCTGTTCCGT TCTGCGATAT TCATAAGGAT
GGATATGTGG AGAACAACAA CAACCTGATC TCGCTATCAG GGTCTGTAAT ATCAAAAGAC
TATAGACTCA CCCCGGTCCT CCGGACCCTT ACCGGTACTG TTACTGATTC GATCACCGGT
CAGCCTGTAT CGAATGCGAC AATCTCCGTT CACGGCTATG AGACAACCTC GAACAATACT
CTATCGGACA GTGACGGATA TTATTCGCTC GATTTTCAAA CATCGGGTTC TCTCGCCGTT
CCGCTCTGTG ATATTCATAA GGATGGATAT GTGGAGAACA ACAACAACCT GATCTCGTTA
TCAGGGTCTG TAATATCAAA AGACTATAGA CTCACCCCGG TCCTCCGGAC CCTTACCGGT
ACTGTTACTG ATTCGATCAC CGGTCAGCCT GTATCGAATG CGACAATCTC CGTTCACGGC
TATGAGACAA CCTCGAACAA TACTCAATCG GATAGTGATG GGCACTATTC GGTCGATTTT
CAAACATCGG GTTCTCTCGC CGTTCCGCTC TGCGATATTC ATAAGGATGG ATATGTGGAG
AACAACAACA ACCTGATCTC GCTATCAGGG TCTGTTATAT CAAAAGACTA CAAACTCACC
CCGGTCCTCC GGACCCTTAC CGGTACTGTT ACTGATTCAC AATCCGGTCA GCCTGTGTCG
AATGCGACAA TCTCCATTCA CGGCTATAAG ACAACCTCGA ACAATACGTC ATCGGACAGT
GACGGACATT ATTCGCTCGA TTTCCAAACC TCTGATTCTC TCGCAGTCCC GCTCTGTGAT
ATTCATAAGA ATGGATATGT GGAGAACAAC AACAACCTGA TCTCGCTATC AGGGTCTGTA
ATATCAAAAG ACTTCCAGCT CACCCTGACC CAGACCCTGA ACGGTACCGT CGTCGATCGA
TTCACCGGTC TGCCGGTGTC CAATGCGAAG ATCCTTATTA ACGGGAACCA GGTTACTTCC
ACGGGGAATG GAAGTTTCAC CTTGAGTAAT GTCACCCCTA CCGTAGTCGA CATAAATGGT
AATGCTCTGA ATGTCCTGGC GTCGAACACC TACTGGTCTC ATAGTAGCCT CTTCTCTTAC
TCATCTGCTG AACTTCAACG CGGCTCAAAG GATCTCGGGA CGATCGAACT CACTCCGTAT
CTGGCAGGAT ATGTAACCAA TTCTGCTAAT CAACCGGTAT CGGGGGCGCA CATCACCCTC
ACCGTGCATT CATTGGTAGT GACCTCCGGT TCGGCGACAT CGGATAATTC GGGGTACTAT
CTTTCTCAAT TGGAGATACC CGATGGTCTC GTTCTGACTA ACATCTCCTC CGTGAACCTG
ACAGTGATGA AGGATGGATA TCTGCCATAC TCCACTGAAC TTGGGGCATC TGATTTTCTC
AATCCCTCCC TCCATAATAT CACATTCACC CCGATTGCAC TCAACAACTG CACCCCTGAC
CATGCAGAGA CCAACAACAC GTCGGTCTCT TTCACGCTCA CCACCTCTGG TATCATCTCA
CCACAGGAGG TCAGGCTTGT GAAGGCCGGC CAGACGAACA TCACGGCCGG TTCGGTTGCG
ATGACACCTG ATAGAAACGC CATCACCGGA ACCTTCAACC TCAGGAATGC AGCCCTTGGG
GATTGGGGGA TTGTGGTTGT GAATGGGGAT GGGGCCGAGG TAAAATGGGC TGGCACCTTC
ACGATTCTGC AGCAACTGGT CGCGAACTTC ACCGCCAACA CGACAGCAGG AAACCGACAA
CTCCCTGTCC AGTTCACGGA CCTGTCGACG GGTTCTCCGA TCTCCTGGCT CTGGGAGTTT
GGTGACGGCA ACTCTTCAAT GGAGCAGAGC CCGGTTCACA TCTACACGGA TGCCGGGAAC
TATACGGTGA ACCTGACGGC AACCAATGCT GGCGGGAGTA ACACCACGGC GAAGACGAAC
TACATTACTG TCCGTGCCCC GGTGCCGCCC ACCACCGCTC CAACGACAAC TGCTCCAACA
ACAGCACCAG CAACGACCGT ACCGACGAGG ATTCCGACGA CGATCCCGAC GACGATACCG
GTGACCCCGA CGCCGACCGC GACGGTTCCG CCGCTGGTAG CGAACTTCAG TGCGAATGTG
ACGGCTGGCC AGACTCCGCT CGCCGTGCAG TTCACTGATG CGACCACCGG TCTGGTCAGG
CAGTACTTCT GGCAGTTCGG CGACGGCGGG GCCTCGTTCG ATAAGAACCC GGTCCACACC
TACTCGGCAG CCGGGAAGTA CACCATCTCT CTCTTTGTGA TCGATCAAAA TGGATGGCAG
GTGAAGACGA ATGAGCAGTA CATCACCGTC ACTGCACCGG GAACACCGAC GGTCTCGCCG
ACCGTGACGA CAACAGTGAC AGCGACAACG CCGGCTCCAA CCACTACAAC AACCATCACT
CCCATCGTGA CAAAGACCAT CCCGGTGACG CCGACCGTGG CCCCGCTTCC GATCGCGAAC
TTCGTGGTCA CCTACCAGAG CGGTGCCGGC TCGATGGGCA TCCAGGTCAC CGATGCCTCG
ACGAATGCGA CCGTGGTGAA ATATGATCTC GGCGACGGCA CGACCACCGC CTATAAAAAC
TTCAAGTACA CCTACTGGCA GCCTGGCACC TACACGGTCA AACTGATCGC GACGAACGAT
GCAGGGTCCT CAACGAAGAC TGTCACGGTG ACGGTGCCGG CCGGATCACC CACGACCACG
ACCGTATCGC CGACGGTGAC CGTGACGACG CCGGCCCCAA CTATGACGAC CGTCTCGCCG
ACAACGACCA TGACCACACC GGTCACGCCG ACTGTGACCT CAACCTTCAC CCCGACACCG
ACACCAACCA TTCCGAATCT CCCGGTGGCG AACTTCACGG TCACCTTCCC GGGTGGCATT
GGCTCGATGG GCATCCAGAT CACAAACACC GCGGTGAATG CGACCTCGGT CCATTATAAC
CTCGGCGACG GGGCGACCAC TGCCTACCCG AACTTCACCT ACTCATACTG GCAGCCTGGG
AACTATACCA TCAACCAGAC CGCGACCAAT GCGGCTGGGT CTTCCAATAA GACCATTATG
GTGACGGTGC CGGTTGGGTC GATCCCGACC ACAATCCCAC CGACCATCCC AACTACCACG
GTCTCACCGA CGGTGACCGT CACCGGTTCG GCCTTTGACG GCCTGCACAC GATCCCCGGC
ACCCTGCAGG CCGAGGACTA TGATCTCGGC GGTGAGGGTG TCGCCTACCA CGATACCACC
CCTGGAAACG AGGGCGGCGT CTACCGGCAT GACGACGTCG ATATCGAGCA GCTCGACACC
GACGGGTCGC CGAACGTCGG TTGGATCCGT GCCGGCGAGT GGCTTGGGTA CACCGTGAAC
GTCAGCACTG CCGGCACCTA CACGGCCGGG TTCCGTGTTG CTTCCTCCCA CACAGGTTCA
TCTGTCCAGA TGTACGTTGA CGACGGTACG ACCCCGGTCG CGACGGTGAG CGTCCCGAAC
ACTGGCGACT GGCCGGTCTT CCAGACCGTT CAGGTTCCAG TGACCCTGCC GGCCGGTACT
CACCGACTGA AGTTTTCGTT CCCGACCGAT TTCGTCAACA TCAACTGGAT CAGTTTCGCC
TGA
 
Protein sequence
MANLLKRMTW LLIGVFIVSC LLMITPVAAK SVSLTGIVTD SITGLPVSGA TITVHGTGVT 
TNSVSSGSDG HYSLVFSVSS YIYAALCDIH KDGYVESNNN PIAFLGSSTS KDYKLAPVLR
TLTGTVTDSI TGQPVSNATI SVHGYETTSN NTQSDSDGHY SVDFQTSGSL AVPFCDIHKD
GYVENNNNLI SLSGSVISKD YRLTPVLRTL TGTVTDSITG QPVSNATISV HGYETTSNNT
LSDSDGYYSL DFQTSGSLAV PLCDIHKDGY VENNNNLISL SGSVISKDYR LTPVLRTLTG
TVTDSITGQP VSNATISVHG YETTSNNTQS DSDGHYSVDF QTSGSLAVPL CDIHKDGYVE
NNNNLISLSG SVISKDYKLT PVLRTLTGTV TDSQSGQPVS NATISIHGYK TTSNNTSSDS
DGHYSLDFQT SDSLAVPLCD IHKNGYVENN NNLISLSGSV ISKDFQLTLT QTLNGTVVDR
FTGLPVSNAK ILINGNQVTS TGNGSFTLSN VTPTVVDING NALNVLASNT YWSHSSLFSY
SSAELQRGSK DLGTIELTPY LAGYVTNSAN QPVSGAHITL TVHSLVVTSG SATSDNSGYY
LSQLEIPDGL VLTNISSVNL TVMKDGYLPY STELGASDFL NPSLHNITFT PIALNNCTPD
HAETNNTSVS FTLTTSGIIS PQEVRLVKAG QTNITAGSVA MTPDRNAITG TFNLRNAALG
DWGIVVVNGD GAEVKWAGTF TILQQLVANF TANTTAGNRQ LPVQFTDLST GSPISWLWEF
GDGNSSMEQS PVHIYTDAGN YTVNLTATNA GGSNTTAKTN YITVRAPVPP TTAPTTTAPT
TAPATTVPTR IPTTIPTTIP VTPTPTATVP PLVANFSANV TAGQTPLAVQ FTDATTGLVR
QYFWQFGDGG ASFDKNPVHT YSAAGKYTIS LFVIDQNGWQ VKTNEQYITV TAPGTPTVSP
TVTTTVTATT PAPTTTTTIT PIVTKTIPVT PTVAPLPIAN FVVTYQSGAG SMGIQVTDAS
TNATVVKYDL GDGTTTAYKN FKYTYWQPGT YTVKLIATND AGSSTKTVTV TVPAGSPTTT
TVSPTVTVTT PAPTMTTVSP TTTMTTPVTP TVTSTFTPTP TPTIPNLPVA NFTVTFPGGI
GSMGIQITNT AVNATSVHYN LGDGATTAYP NFTYSYWQPG NYTINQTATN AAGSSNKTIM
VTVPVGSIPT TIPPTIPTTT VSPTVTVTGS AFDGLHTIPG TLQAEDYDLG GEGVAYHDTT
PGNEGGVYRH DDVDIEQLDT DGSPNVGWIR AGEWLGYTVN VSTAGTYTAG FRVASSHTGS
SVQMYVDDGT TPVATVSVPN TGDWPVFQTV QVPVTLPAGT HRLKFSFPTD FVNINWISFA