Gene Mpal_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2068 
Symbol 
ID7271308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2188162 
End bp2192232 
Gene Length4071 bp 
Protein Length1356 aa 
Translation table11 
GC content58% 
IMG OID643570680 
ProductCarbohydrate binding family 6 
Protein accessionYP_002467090 
Protein GI219852658 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.684202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATCG AAGCGTTTAT TGGTTCTTCC CCCAATTTTC AATATACCAA TACAAAATCC 
GATCTGGTTT TAATGATTGT TACAACGGAA AAAATGAGAT GGGTGCTGCT GATGATCGTG
CTGGTCGCGA TCTGCTGCAC CCCGGTATCT GCATCGTCGC TGTTCGTTGG AGTCGCTCCC
GCCTCAACAC CGGTTCCGTC GACAAACTCC AACCTGGTGA GTGAACTCTC TGTGTCCGGT
TATTCTGCCC AGTACCAGCA AAATGTCTCC TGGCCTTCCG AAAATTTCCT CCTCAAGGGG
CTGGCTGAGG ACGCCCCCAA CGGGACGCTC TTCGTCACGG ATAGGTCGAC TACGTCCGTC
ACAATCCTCA ATGCAACGAC GATGGCAGTG CAGGGATCGC CGATCCCGCT TTCAGGTATA
AGTATAGACT ATCCCCTGAA TAGTGTGTAC GATCCCGAGC ATAGCCGCCT GTATATCTAT
GATAATGGAA ATCAAGATCT GGTCGGGTTT GCATGGGATT CAACTAATCA AACGCTGAAC
TACGACGGGA TGACGATTGT CCTTCCCGAA CGTTGTTCCG GACTGGCTCT GGATTCGGTC
GCTAATATTC TCTATGTATT GCCAGAAGAT CCTGATCCGG ATATCGGCTC GATCCATCGA
TATAGTACGG TGACGGGAGC GTCTCTGGAT GATATCTCTC TCGGAGTATG GGGGCTCAGT
CAGATCTCCT CACTTGCCGT CGACAGTGAG AACCACTGTC TCTATATGAA AGCATATTAT
TCCCTCCTTG ATTCCGAATG CATGTTCAGC TGGAATACCT CCACTTCCAA TTTCAACACC
GGGATTGCTT CGGTTTCGGA TAGTGATACC ATAATGGCAC TGGCCGTGGA TCCTGCCTCT
CATTACCTCA TTGCAGCGAC AAAATCTCAG AATTTGGGTA ATAATCTGTA TTATCCAATG
ATTGAATTTT ATGACCGTTC CATGAACCAG ATGGACTCTG CCTTGGCCTG CCCGTCGGAT
TTCCCCGGCG TCTCAAGGAG AGGGAATCTG CCGGACGACT ATTATCTGGC CGTCCTCCCT
GAAACATTAG CCCCGCCTGT GGCGAACTTC ACGGCAGACG TGACGAACGG CACCGCTCCG
CTCGAGGTCG AGTTCACCGA TCAGTCGACC GGTTCCCCGG ATTCGTGGCA CTGGGAGTTT
GGTGACGGCA CTACCTCGAC CGATTTAGGG TTCGCAGACC ATACCTACAC GGAAGCCGGG
AACTACACGG TGAATCTGAC GGTGGCGAAC CCCGGCGGGA GCAACTCCAC GACAAAGACC
AACTACATCA CGGTCCATGC CCCGATATTT GCCCCGGTTG TGAACTTCAC GGCGAACGTA
ACGAACGGCA CGTCTCCGCT GACCGTCGGT TTCACTGACC TTTCGACCAA TTCCCCAACG
GATTGGCTCT GGGAGTTTGG TGACGGTACC ACCTCTACGA TGCAGAACCC GGTCCACACC
TTTACGGATA TCGGGAACTA CACGGTGAAC CTGACGGCGG CCAATACTGG TGGGAACAAC
ACCTCGATCA GGACCGACTA TATCACGGTC ACCCAGGTAC CCGTTCCAGT GGCCAACTTC
TCGGCGAACG TTACGAATGG GACATCCCCA CTGTCGGTTG GTTTCACTGA CCTTTCGACC
AATTCTCCAA CGGATTGGCA CTGGGACTTC GGTGACGGCA CCACCTCTAC GATGCAAAAC
CCGGTCCACA CCTTTTCGGA TATCGGGACC TACACCGTCA ACCTGACGGC GACGAACGCC
GGGGGGAATG CCACCCAGAC CAAGACGGAT TATATCACGG TCACCCAGGT GCCCGTTCCA
GTGGCCAACT TCTCGGCGAA CGTTACGAAT GGGACATCCC CGCTATCGGT CGGTTTCACT
GACCTTTCGA CCAATTCTCC AACCTCCTGG CTGTGGGACT TTGGTGATGG CACCACCTCT
ACGATGCAAA ACCCGGTCCA TGCCTACATG GCGGCCGGTA ACTATTCAGT GAACCTGACG
GCGACGAACG CCGGGGGGAA TGCCTCCCAG AACAAGACGG ACTATATCAC GGTCCATGCC
CCGGTGTTTG CCCCGGTTGC GAACTTCACG GCGAACGTGA CGAACGGCAC CGCACCGCTC
TCAGTCGGTT TCACTGACCT GTCGATAGGT TCCCCGGCGA CCAGGTCGTG GAACTTCGGT
GACGACAATA CCTCGACCGA GCAGAACCCG GTCCACACCT ATACGGCGGC AGGGAACTAC
ACGGTGAACC TGACGGTGAC GAACGCCGGC GGTTCCAACA CCTCGGTAAA GACGAACTAC
ATCACCGTCC ATGCCGTGGT GCCACCGACC ACCGCACCAA CGACAACTGT TCCGACGACG
GCACCGACCA CACCACCAAC AACAGTACCA ACGACTGTGC CGACGAGGAT TCCGACGACG
ATTCCCACGA CGATACCGGT GACCCCGACG CCAACCGCGA CGGTGCCGCC GCTTGTAGCG
AATTTCAGCG CGAATGTGAC GACCGGCCAG ACTCCGCTCG CCGTGCAATT CACTGATGCG
ACCTCTGGTT CTGTCCAACA GTACTTCTGG CAGTTCGGCG ACGGCGGGGC ATCGTTCGAT
AAGAACCCGG TCCACACCTA CTCGGCAGCC GGCACCTACA CTGTCTCCCT CGTCGCCATC
GGTTCTACCG GGGCTGATGT GAAGACGATC CCACAGTACA TCACCATCAC CACACCGGGA
ACACCGACCG TCTCACCGAC TGCGACGGCG CCGACCCAAA CCGTAACGAC AACGGTGACA
GCGACAACGC CAGCTCCAAC CACTACAACA ACCGTCACCC CAACCGTGAC GTTGACGACC
GCAACGACGA CCGTGACCCC GCTGCCGACC TCGTCCGGGC ATGACCTGCC GCTCGCGAAC
TTCGTGGTCA CCTACCAGGC CGGCTCGGGC TCGATGGGCA TCCAGGTCAC CGACGCCTCG
ACGAACGCCA CCACCTTGAA GTACGACCTC GGCGACGGTA CGACTACCGC CTATAAGAAC
TTCAAGTATA CCTACTGGCA GCCCGGCACC TATACGATCA CCCTGATCGC AACCAACGAC
GCCGGGTCTT CGACGAAGAC CGTCGCGGTG ACGGTGCCGG CCGGTTCGCC GACGATCACG
ACCATTGTAC CAACCGTGAC AACGCCGGCT CCAACGATGA CTGTCTCGCC GACAACGACG
GTGACCTTAC CAGTTACGCC GACCGTGACG TCGACCGTAA CCCCGACACC AACACCAACC
AATCCGAATC TGCCGGTGGC GAACTTCACG GTCACCTTCC CGGGCGGCCC GGGTTCGATG
GGCATTCAGG TCATCAACAC TGCGGTGAAC GCGACTTCAG TCCATTATAA CCTCGGCGAC
GGGGCGACCA CCGCCTATCC GAACTTCACC TACACCTACT GGCAACCCGG CACCTACACG
ATCAACCAGA CCGCGACGAA CGCGGCCGGG TCTACCAACA AAACTCTGGT CGTGACCATA
CCCGCGGTAC TGACTCCGAC CACGACAACA ACCCCGGCCC CAACCACGAT CTCACCGACG
GTGACCGTCA CCGGTTCAGC CTACAACGGC CCGCACACGA TCCCAGGAAC ATTGCAGGCC
GAGGATTACG ACCTCGGCGG TGAGGGTGTC GCCTACCACG ACACCACTGC CGGCAATGAA
GGTGGCGTCT ACCGGCACGA CGACGTCGAT ATCGAACAGC TCGACACCGA CGGGTCGCCG
AACGTCGGCT GGATCCGTTC CGGCGAGTGG CTTGGGTACA CCGTGAACGT CAGCACGGCC
GGCACCTACA CGGCCGGGTT CCGTGTTGCT TCCTCCCACT CCGGTTCATC GATCCAGGTC
TATGTCGACG ACGGTACGAC CCCGGTCGCG ACAGTGAGCG TCCCGAACAC CGGTGACTGG
CCCGCCTTCC GGACCGTCTC GGTGCCGGTG ACCCTGCCGG CAGGGCAGCA CCGGCTGAGA
CTTGCGTTCC CGACCGACTA CGTCAACATC AACTGGATCA GTTTCGCCTG A
 
Protein sequence
MQIEAFIGSS PNFQYTNTKS DLVLMIVTTE KMRWVLLMIV LVAICCTPVS ASSLFVGVAP 
ASTPVPSTNS NLVSELSVSG YSAQYQQNVS WPSENFLLKG LAEDAPNGTL FVTDRSTTSV
TILNATTMAV QGSPIPLSGI SIDYPLNSVY DPEHSRLYIY DNGNQDLVGF AWDSTNQTLN
YDGMTIVLPE RCSGLALDSV ANILYVLPED PDPDIGSIHR YSTVTGASLD DISLGVWGLS
QISSLAVDSE NHCLYMKAYY SLLDSECMFS WNTSTSNFNT GIASVSDSDT IMALAVDPAS
HYLIAATKSQ NLGNNLYYPM IEFYDRSMNQ MDSALACPSD FPGVSRRGNL PDDYYLAVLP
ETLAPPVANF TADVTNGTAP LEVEFTDQST GSPDSWHWEF GDGTTSTDLG FADHTYTEAG
NYTVNLTVAN PGGSNSTTKT NYITVHAPIF APVVNFTANV TNGTSPLTVG FTDLSTNSPT
DWLWEFGDGT TSTMQNPVHT FTDIGNYTVN LTAANTGGNN TSIRTDYITV TQVPVPVANF
SANVTNGTSP LSVGFTDLST NSPTDWHWDF GDGTTSTMQN PVHTFSDIGT YTVNLTATNA
GGNATQTKTD YITVTQVPVP VANFSANVTN GTSPLSVGFT DLSTNSPTSW LWDFGDGTTS
TMQNPVHAYM AAGNYSVNLT ATNAGGNASQ NKTDYITVHA PVFAPVANFT ANVTNGTAPL
SVGFTDLSIG SPATRSWNFG DDNTSTEQNP VHTYTAAGNY TVNLTVTNAG GSNTSVKTNY
ITVHAVVPPT TAPTTTVPTT APTTPPTTVP TTVPTRIPTT IPTTIPVTPT PTATVPPLVA
NFSANVTTGQ TPLAVQFTDA TSGSVQQYFW QFGDGGASFD KNPVHTYSAA GTYTVSLVAI
GSTGADVKTI PQYITITTPG TPTVSPTATA PTQTVTTTVT ATTPAPTTTT TVTPTVTLTT
ATTTVTPLPT SSGHDLPLAN FVVTYQAGSG SMGIQVTDAS TNATTLKYDL GDGTTTAYKN
FKYTYWQPGT YTITLIATND AGSSTKTVAV TVPAGSPTIT TIVPTVTTPA PTMTVSPTTT
VTLPVTPTVT STVTPTPTPT NPNLPVANFT VTFPGGPGSM GIQVINTAVN ATSVHYNLGD
GATTAYPNFT YTYWQPGTYT INQTATNAAG STNKTLVVTI PAVLTPTTTT TPAPTTISPT
VTVTGSAYNG PHTIPGTLQA EDYDLGGEGV AYHDTTAGNE GGVYRHDDVD IEQLDTDGSP
NVGWIRSGEW LGYTVNVSTA GTYTAGFRVA SSHSGSSIQV YVDDGTTPVA TVSVPNTGDW
PAFRTVSVPV TLPAGQHRLR LAFPTDYVNI NWISFA