Gene Mpal_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1784 
Symbol 
ID7270330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1872780 
End bp1877978 
Gene Length5199 bp 
Protein Length1732 aa 
Translation table11 
GC content59% 
IMG OID643570400 
ProductCarbohydrate binding family 6 
Protein accessionYP_002466814 
Protein GI219852382 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.847863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACAAA GCAAACGATT TGATTTGATT CTGATTATGA TATTACTGGC ATCTGTCCTG 
CTGCTGGCTC CGGTTACAGC ATCTTCTGTA ACTGTTGGTG CCAGCGGGTG CGACTTCACG
ACGCTGACGG ATGCGATTAA TAGTGCATCA GTTGTCGATG GCGACACAAT CTATGTCTAT
AATGGGACTT ACTCCTTCAC TGGACTAACA AAAGCCATTA CCCTCACTGG GGAGGGAGCA
GATCTCGTAA CACTCAATCT CGGAGGATCC GGAAGTACGA TTAGGGGATC TGGAACGATT
ATCGAGCAGA TGAGGTTTAC GAATGGAAGG ATTCAATTAA CCGGTGCTGC ACCCATCGCC
CAGAATATGA TCATCCGACA GTGTATCTTT GAGGGTCTGA CCTTCAATGG CCCTTCATAT
TATTCAGCTA TTCAACTCCT TGGTACAAAT AATACATTTC AAGATAATGC CTTCAGAAAT
AATGTTGCAC CCATAGTGAT CTATTTTGGA ACCGGCTCGG GTAACCGCCT GCTGAACAAT
ACCTTCGTGA ACACTGCGCC CGCGTCGATG TTAAATAGAG GAGTCATCTC AATAACTGTC
CCCGCGACTT CGGCAGTTAT TGAAAATAAT ACGTTTAAAG ACAACACGAT TTCTTGCATC
CAATTGGGGT CAACCATGGG CACAGGAAAC GTGATTGACC GCAACAATTT CATTGTGCCT
AACGGGGTTT CGCCTATTTT GGGCACTGGT ACCATCCCAA TTGCCTCATG GGTCACGTCC
GCGGCGGTTC CGTACACCTA TCAGGGATCA TCCCATATTG GCATTCTCGG CAATTACTGG
AGTACCTATT CCGGCACGGA TGCTAACGGG GATGGCATCG GGGATACCTC TTACAATACT
GGAGTTTCGA ACCAGATCGA TTCCGCTCCG CTGATGGACC AGGCCCAGTT CTACTTTGGG
GCCTCACATT CAGCCACGGC ATCGGTCAGC GTCATCCCGG CATCGGCGAC GGTCAATGTC
AGCGAGACAA AACAGTTCGC CGGGAAAGCC ACCGATAGCG ACGGCCTGAA TATCCCTGGC
CTGAACTATA CCTGGTCCTC CAGCAACGAG ACCGTCGGCA CGATCTCCCA GTCCGGTCTC
TTTACTGCAC TTACACCAGG ATCGGCAAAC ATCACCGCCT CGAATGGCGG AATCAGTAAC
ACCTCTGTTG TTACGGTTCT CTCCGCTCTG TCGCAGCCGG TCGCCGACTT CACGGCGAAT
GTGACGAACG GCACGGCACC ACTCTCGGTC CAGTTCACCG ATGCATCGAC TGGTTCTCCA
ACGGGCTGGT CGTGGGACTT CGGTGACGGG AACACCTCGA CAACGCAGAA CCCGGCCTAT
ACCTACGCGA CGGCCGGGAA CTACACCGTG AACCTGACGG CCACCAACGC CGGCGGGTCC
AACACAACGG TGAAGATGAA CTACATCACG GTCACTGAGT CTGAGACCCC GATGGCTCCG
GTCGCCGGAT TCACCGCTAA TGTAACGAAC GGCACCGCTC CACTGGCAGT CGGTTTCACC
GACCAGTCGA CCGGCACCCC GACCTCATGG TCGTGGAACT TCGGTGACGG CAACACCTCG
ATCGAACAGT CGCCGGCCCA CACCTACGCG ACAGCCGGGA ACTACACCGT GAACCTGACA
GCGACGAACG CCGGCGGCAA CAACACTACC ACGAAGACGA ACTATATCAC CGTCACCACA
GCGGTGACCC CGGTTCCGGT CGCGAACTTC AGCGCGAACA TCACGAACGG TACGGCCCCG
CTGGCTGTGC AGTTCAACGA CACCTCGCTG GGTGAGAACC TGACGGCATG GTCGTGGGCG
TTCGGCGACG GCAACACGTC GCTCGAACAG AACCCGATCT ATAACTACAC GACAGCCGGC
AGTTACACCG TGAACCTCAC GGTGACGAAC GCGACCGGCT CCGACAACGA GGTGAAGACC
AGCTACATCA CGGTCTCTGC ACCGGTGATC CCGACGCCGA CGATCACCGT TGCCCCGCTG
TCAGCCATAC GGCACATCTT CATCAATACC GCGAACGGCG TGAAGTACAA CTATGACGGT
GCGACCTACG GCGGCCCGAA CAACACCTAC TATGTCAAGG CCGACGGTGG CGGACTGAAC
GAGCTGCACC TCACCAACGA CGCGAATGTT GCCTCCGGCC AGGTCACGAC GACCGGTGCC
CAGAACGGCA CCTTCTATGT GACGAACACC GGCGGTCGTG GATTCGATGA CGATATCATC
CTCCTCGTCT CGGTGAAGGG CCCGATCCCC GACAACTTCG GCATCCATCT GACCTCGAAC
GGATATGTCT GGACCCCAGC CGCTGCCGGA GCCTACAACC CGACCGCCCC GACCGACTCC
TCCTACGTGA CCGGTATCGA CCAGACCTTC TCGAAGGCCG ACTTCATCTA CGGCCCGCAG
ACCTGGAAGC CCGGCCCCCA TCAAGGCTCA GCCGACCTGG TGACCCCATA CCTGCCGCTC
TACACCGGGC AGGACATCAA CGACGCCTCG ACGGCCCAGT ACCTGATGTT CGTCGACCTC
AAGGCCGGCA ACGTAAAGCA GGGCGTCATC CCCAGTGCCA CGCTGAACAA CGGTGCCCTG
AAGGTGGACT ACAGTATCAC CAACCTGAGC ACCCGTGCAG ATTTCAATGG ATTCGCCTGG
GCCAATGCCT CGAACCAGGG GCAGGGGATC ACCTGGACAA ACAATGTCTT TAATCCGGGC
GCCAGCGGCT ACTCGGTCAC GGCCCCTGTG GCTGCCCCCG TCGCCGGCTT CAATGCGAGC
GTGACGAACG GAACGGCCCC ACTGGCCGTG CAGTTCAACG ACCTGTCAAC CGGTACCCCG
ACCTCGTGGT CGTGGGACTT CGGTGACGGC AACACATCGA CAACGCAGAG CCCAAGCCAC
ACCTATACTG TGGCCGGGAA CTACACGGTG AACCTGACGG CGACGAACGC CGGCGGTTCC
AACACCACCA CGAAGACGAA CTACATCACC GTCACCGCGT CCGAGAGCCC GTCGGCTCCG
GTCGCGAACT TCAACGCGAA CGTCACGAAC GGCACTGCCC CCCTGACGGT CGGCTTCACC
GATGCGTCGA CCGGAGCTCC GACCTCGTGG TCGTGGTCGT TCGGTGATGG CAACACCTCG
ACGACACAGT CCCCGTCCCA CACCTATACT GTGGCTGGGA ACTACACGGT GAACCTGACG
GCGACGAATG CCAGCGGTTC CAACACCTCG GTGAGGACGA ACTATATCAC CGTCACCGGG
TCAGAAACTC CGACGGCTCC GGTCGCGAAC TTCAACGCGA ACATCACGAA CGGCACTGCC
CCACTGGCTG TGCAGTTCAC CGACCTGTCG ACCGGAGCTC CGGTGACCTG GTCGTGGGAC
TTCGGTGACG GCAACACCTC GACGGTGCAG AACCCGGCCT ACACCTACGC GGCGGCCGGA
AACTACACCG TGAACCTGAC GGCTACGAAC GCCGGCGGTT CCAACACCAC CACGAAGACG
AACTATATTA CCATCCACGC CGTGGTGCCA CCAACGACAG CACCAACCAC CACCGCTCCA
ACAACCGTAC CGACAACCAT TCCAACGACG ATAGCAACAA CCATTCCAAC GACAGTACCA
ACAAAGATCC CGACGACAGT TCCGACGACG ATCCCGGTGA CCCCGACTCC GACCATGCAG
CCGGTGACGG CCAACTTCAC GGCGAACGTG ACGGCCGGCC AGACCCCGCT CTCGGTGCAG
TTCACCGACC TGTCGACCGG CACGATCCGA CAGTACTTCT GGCAGTTCGG TGATGGTGGG
GCTTCGTTCG ATAAGAACCC GGTTCACACC TACTCGGCAG CCGGCACCTA CACCGTCTCC
CTCGTCGCCA TCGGTTCCAC CGGGGCTGAG GTGAAGACGA TCCCACAGTA CATCACCGTC
ACCGGTCCAG GGACTCCGAC AGTCTCGCCG ACCGTGACAA CAACGGTGAC AACGACAACG
CCAGCTCCAA CCACTACAAC AACCATCACT CCCACCGTGA CAACGACCAT CCCGGTCACG
ACGACCGTGA CTCCGCTGCC GACCTCGTCC GGGCATGACC TGCCGATCGC GAACTTCGTG
GTCACCTACC AGGCCGGCGC AGGCTCGATG GGCATCCAGG TCACCGACGC CACGACGAAC
GCCACCACCG TGAAGTACGA CCTCGGCGAC GGTACCACCA CCGCCTATAA GAATTTCAAG
TACACCTACT GGCAGCCCGG CACCTATACG ATCACCCTGA TCGCAACCAA TGACGCCGGG
TCTTCGACAA AGACCGTCAC GGTGACCGTG CCGGCCGGGT CACCCACGAC CATGACCGTG
ACGCCAACTC CAACGGCGAC CGTCTCGCCG ACCATCACGC CAACCCAGAC ACCAACCAAC
CCGAACCTGC CTGTGGCCAA CTTCACGATC ACATTCCAGG GAGGCTCGGG CTCGATGGGC
ATCCAGGTCA CCAACACCGC GGTGAATGCG ACCTCGGTCC ATTATAACCT CGGCGACGGG
ACGACCACCG CCTATCCGAA CTTCACCTAC ACCTACTGGC AGCCCGGCAA CTACACGATC
AACCAGACCG CAACCAACGC GGCCGGGTCC ACCACCAGGA CCATCAATGT GACCGTGCCG
GCGGTGGCAA CCCCAACGAC AATCCCAACC ACCACCGTCT CGCCGACCGT GACCGGCACC
GTTCCGGCCT ACAACGGGAC TCACACGATC CCCGGCCAGT TGCAGGCCGA GGACTACGAC
CTCGGCGGTG AAGGGATCGC CTACCACGAC ACCACCGCCG GCAACGAGGG CGGGGTCTAC
CGGCATGACG ACGTCGACAT CGAACAGATC GACACCGACA GAAGTCCGAG CGTCGGCTGG
ACCCGTGCTG GCGAATGGCT CGCCTACACT GTAAACATCA ACACGGCTGG CACCTACACC
GCCGGGTTCC GGGTCGCCTC GGCCCATGCC GGCTCGTCCG TCCAGGTGTA TCTCGACGAC
GGCACAACCC CGATCGCAAC CGTGAACGTC CCGAACACCG GCAACGATGC GACCTTCCAG
ACCCTCCAGG TCCCGGTGAC CCTGCCGGCC GGGCAGCACC GGCTGAAGCT TGCGTTCCCG
GGGAACTACG CCAACATCAA CTGGATCAAC TTCGCCTGA
 
Protein sequence
MIQSKRFDLI LIMILLASVL LLAPVTASSV TVGASGCDFT TLTDAINSAS VVDGDTIYVY 
NGTYSFTGLT KAITLTGEGA DLVTLNLGGS GSTIRGSGTI IEQMRFTNGR IQLTGAAPIA
QNMIIRQCIF EGLTFNGPSY YSAIQLLGTN NTFQDNAFRN NVAPIVIYFG TGSGNRLLNN
TFVNTAPASM LNRGVISITV PATSAVIENN TFKDNTISCI QLGSTMGTGN VIDRNNFIVP
NGVSPILGTG TIPIASWVTS AAVPYTYQGS SHIGILGNYW STYSGTDANG DGIGDTSYNT
GVSNQIDSAP LMDQAQFYFG ASHSATASVS VIPASATVNV SETKQFAGKA TDSDGLNIPG
LNYTWSSSNE TVGTISQSGL FTALTPGSAN ITASNGGISN TSVVTVLSAL SQPVADFTAN
VTNGTAPLSV QFTDASTGSP TGWSWDFGDG NTSTTQNPAY TYATAGNYTV NLTATNAGGS
NTTVKMNYIT VTESETPMAP VAGFTANVTN GTAPLAVGFT DQSTGTPTSW SWNFGDGNTS
IEQSPAHTYA TAGNYTVNLT ATNAGGNNTT TKTNYITVTT AVTPVPVANF SANITNGTAP
LAVQFNDTSL GENLTAWSWA FGDGNTSLEQ NPIYNYTTAG SYTVNLTVTN ATGSDNEVKT
SYITVSAPVI PTPTITVAPL SAIRHIFINT ANGVKYNYDG ATYGGPNNTY YVKADGGGLN
ELHLTNDANV ASGQVTTTGA QNGTFYVTNT GGRGFDDDII LLVSVKGPIP DNFGIHLTSN
GYVWTPAAAG AYNPTAPTDS SYVTGIDQTF SKADFIYGPQ TWKPGPHQGS ADLVTPYLPL
YTGQDINDAS TAQYLMFVDL KAGNVKQGVI PSATLNNGAL KVDYSITNLS TRADFNGFAW
ANASNQGQGI TWTNNVFNPG ASGYSVTAPV AAPVAGFNAS VTNGTAPLAV QFNDLSTGTP
TSWSWDFGDG NTSTTQSPSH TYTVAGNYTV NLTATNAGGS NTTTKTNYIT VTASESPSAP
VANFNANVTN GTAPLTVGFT DASTGAPTSW SWSFGDGNTS TTQSPSHTYT VAGNYTVNLT
ATNASGSNTS VRTNYITVTG SETPTAPVAN FNANITNGTA PLAVQFTDLS TGAPVTWSWD
FGDGNTSTVQ NPAYTYAAAG NYTVNLTATN AGGSNTTTKT NYITIHAVVP PTTAPTTTAP
TTVPTTIPTT IATTIPTTVP TKIPTTVPTT IPVTPTPTMQ PVTANFTANV TAGQTPLSVQ
FTDLSTGTIR QYFWQFGDGG ASFDKNPVHT YSAAGTYTVS LVAIGSTGAE VKTIPQYITV
TGPGTPTVSP TVTTTVTTTT PAPTTTTTIT PTVTTTIPVT TTVTPLPTSS GHDLPIANFV
VTYQAGAGSM GIQVTDATTN ATTVKYDLGD GTTTAYKNFK YTYWQPGTYT ITLIATNDAG
SSTKTVTVTV PAGSPTTMTV TPTPTATVSP TITPTQTPTN PNLPVANFTI TFQGGSGSMG
IQVTNTAVNA TSVHYNLGDG TTTAYPNFTY TYWQPGNYTI NQTATNAAGS TTRTINVTVP
AVATPTTIPT TTVSPTVTGT VPAYNGTHTI PGQLQAEDYD LGGEGIAYHD TTAGNEGGVY
RHDDVDIEQI DTDRSPSVGW TRAGEWLAYT VNINTAGTYT AGFRVASAHA GSSVQVYLDD
GTTPIATVNV PNTGNDATFQ TLQVPVTLPA GQHRLKLAFP GNYANINWIN FA