Gene Mpal_1838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1838 
Symbol 
ID7270384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1953338 
End bp1956445 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content58% 
IMG OID643570453 
ProductCarbohydrate binding family 6 
Protein accessionYP_002466867 
Protein GI219852435 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3420] Nitrous oxidase accessory protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.533489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.441295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTGGA ATATGCAATT GAGAAACGTG ATGGCGCTTG CCATCCTCCT GGTACTGTCA 
TCCGCACTGC TGGTACTTCC AGCAACCGCG GCAAGTATCC CTGTCAACGG TCCGGTGGTC
ATCACCGAGC CCGGCACCTA TGTGCTCACA CAGGATATCA CCAGCAGCAG CCAGATCGTA
TGTATAGAGA TCAAGGCCTC CAACGTCGTC TTCGACGGTC AGGGCCATCA GATCAGTGGT
GTGAACAATG AGGGATCAGC CGGTATCTTC GTTTCGAAGG ATGCCAGCAC CCCGGTCACC
GGGGTCACCA TCAAGAATGT TCGTCTGAAC AACTGGTTCT ATGGGGTCTA TCTCCTGAAT
GCACAGAACA GTGCAATCCA GGATGTCACC ACGACCGGGA ATGCCAACGC AGGGATGGTG
CTCTACTCTG GAAGCACAGG GAATACCATC TCCGGCAGCA CGCTCACCGG CAACGGACGT
GGTATCATCC TCTCCACCTC CAGTGGTTCG AACACCATCA GCGGCAACAC CCTCACCGGC
AACAGTAATC AGGGTATCTA CATCTTTGAT TCGAACGGCA ACACGGTGAA CGGGAACACC
ATCACCAATA ACACCAATGC AGGGCTGTTC ATCTACAGCG CCTCGGCAAA CTCAGTATAT
AACAACAACT TCAGCAACCT CTACAATGCT CTCTTCGGGG GAACCATTGG CTCCAACTCG
TGGAACACCA ACCAGGCCAC CGGCACCAAC ATCGTGGGCG GTCCTTCGAT CGGCGGTAAC
TTCTGGGGCA ATCCAGATGG ATCCGGGTAC TCACAGACCA CCGCCGACTC CAATGGCGAT
GGGTTCTGCG ATCAGCCCCT CGTGATCACG ACCGGAAACA CCGACAACCT CCCGTTGCAC
ACTCCGTCCG CCGTCACACC GACGGTGACC CCGACGGCCA CCACAACCCC GGGTGTTGAG
TCTCCCTACA AGGATCATAA CCTCCCGGCC CGTGTCGAGG CTGAAGACTA CGACAACGGC
GGTCAGGGTG TTGGTTACTC CGATTCCACT CCCCAGAACC TCGGTAACGC CTACCGCCTG
ACTGAAGGCG TGGATGTCGA AGCCGGAGGA AGCGGGTATG ATGTCGGCTA CATCACCGAC
GGCGAGTACC TGAAGTACAC CATCAATGTT ACGACTGCGG GCACCTACAC CGCGACCTTC
AATGTCGGGT CCTGGGAAGC CGGACGGACG ATCACCGTCA GTGATGATGA TGGAGATATC
GCCGGCACGG TCAACGTCCC GAACACCGGG AGTTCGAGCA CCTTTGTCTC TGTTCCGCTG
ACGCTGAACC TTAACGCAGG CACCCACGTG CTGAAACTGA CCTTCAACGG CAACCACCAG
AACATCGACT ACATCGACTT CAGTACCTCA GTAACACCGA CCACCACCGC CACCACAGTG
CCGACCACCA CGGTCACCGT GACCCCAACC GTGACAACGA CCGTCACCCC GGGTAATGAG
ACTCCTTACA CGCCTCACAA CCTCCCGGCC CGTGTCGAGG CTGAGGACTA CGACAACGGC
GGCGAGGGTG TCGCCTACCA TGACTCGACC GCCCAGAACC TCGGAAACGC CTACCGCCTG
ACTGAGGGTG TGGACGTCGA GGCCGGTGCC ACCGGGTATA ACGTCGGCTA CATCACCGAC
GGCGAGTACC TGAAGTACAC CGTCAATGTC GCGACCGCCG GCACCTACAC CGCGACCTTC
AATGTCGGGT CCTGGGAAGC CGGACGGACG ATCACAATCA GTGATAATGA CGGAGATGCC
GTCGGCACGG TCAACGTCCC GAACACCGGG AATGATCACA CCTACCAGTC GGTCCCAGTG
ACGCTGAACC TCGGTGCAGG CACCCACGTG CTGAAACTGA CCTTCAACGG CAACCACCAG
AACATCGACT ACATTGACTT CAGTACCTCA GTAACACCGA CCACCACCGC CACCACAGTG
CCGACCACCA CGGTCACCGT GACCCCAACC GTGACAACGA CCGTCACCCC GGGGAACGAG
ACTCCATATA AGGCATACAA CCTCCCGGCC CGTGTCGAGG CCGAGGACTA TGACAACGGC
GGCGAAGGTG TCGCCTACCA TGACTCGACT GCCCAGAACC TCGGAAACGC CTACCGCCTG
ACTGAGGGCG TGGACGTCGA AGCCGGTGCC ACCGGGTATA ACGTCGGCTA CATCACCGAC
GGCGAGTACC TGAAGTACAC CGTCAATGTC GCTACTGCGG GCACCTACAC TGCTAACTTC
AATGTCGGGT CCTGGGAAGC CGGCCGGACA ATTGCGGTCA GTGTCGATGA CACGGCTGTG
GGCACCGTCA ATGTCCCGAA CACCGGGAAT GATCATACCT ACCAGTCGGT CCCACTGACG
CTGAACCTCG GTGCAGGCAC GCACGTGCTG AAGCTCACCT TTGGTGGTAA CCACCAGAAC
ATCGACTACG TCGACTTCGG AACAGCGGCG GCTCCGACCA ACACCGTTGT TCCGATCACC
ACCATAACGG TGACGCCCAC AACGACCACC ACCCCGTCCC AGACTGTCGG GGCCTACAAG
CCTCACAGCC TCCCGGTCCG CATCGAAGCC GAGGACTATG ACAACGGCGG TGCAGGTGCT
GCGTACTATG ATACGACAGC AGGCAACCTT GGAAAGGCCT ACCGTCTGGA TCAGGACGTC
GATATCGAGG CTGGTGCCTC AGGATATGAT GTCGGCTACG TCGCCGATGG CGAATGGCTG
ACCTATACCG TTGATATTCC GTCAGCTGGT TGGTACACGG CCTTCTTCAA TGTGGCCAGC
TGGGCGGACG GACGATCGAT CACCGTTAGT GTCGACAACA CTCCAGTTGG CACGGTGCAG
GTTCCGAACA CCGGCGACTC TACCATCTTT GTAGATGTCC CGATGAACCT GAATCTCCCG
GCAGGTTCGC ATGTGCTGAA ACTGTCCTTC ACCGGAAGCA AGCAGAACAT CGATTACATC
GACTTCCCCT CAGGTCCGCA TGCCGAGATG GCTCTGACCA CCACACCAAC AGTGGTCAAG
ACAACCTCTG CAACAGCGGT GAAGAACAAC ACCACCGCGT CTGAGTGA
 
Protein sequence
MNWNMQLRNV MALAILLVLS SALLVLPATA ASIPVNGPVV ITEPGTYVLT QDITSSSQIV 
CIEIKASNVV FDGQGHQISG VNNEGSAGIF VSKDASTPVT GVTIKNVRLN NWFYGVYLLN
AQNSAIQDVT TTGNANAGMV LYSGSTGNTI SGSTLTGNGR GIILSTSSGS NTISGNTLTG
NSNQGIYIFD SNGNTVNGNT ITNNTNAGLF IYSASANSVY NNNFSNLYNA LFGGTIGSNS
WNTNQATGTN IVGGPSIGGN FWGNPDGSGY SQTTADSNGD GFCDQPLVIT TGNTDNLPLH
TPSAVTPTVT PTATTTPGVE SPYKDHNLPA RVEAEDYDNG GQGVGYSDST PQNLGNAYRL
TEGVDVEAGG SGYDVGYITD GEYLKYTINV TTAGTYTATF NVGSWEAGRT ITVSDDDGDI
AGTVNVPNTG SSSTFVSVPL TLNLNAGTHV LKLTFNGNHQ NIDYIDFSTS VTPTTTATTV
PTTTVTVTPT VTTTVTPGNE TPYTPHNLPA RVEAEDYDNG GEGVAYHDST AQNLGNAYRL
TEGVDVEAGA TGYNVGYITD GEYLKYTVNV ATAGTYTATF NVGSWEAGRT ITISDNDGDA
VGTVNVPNTG NDHTYQSVPV TLNLGAGTHV LKLTFNGNHQ NIDYIDFSTS VTPTTTATTV
PTTTVTVTPT VTTTVTPGNE TPYKAYNLPA RVEAEDYDNG GEGVAYHDST AQNLGNAYRL
TEGVDVEAGA TGYNVGYITD GEYLKYTVNV ATAGTYTANF NVGSWEAGRT IAVSVDDTAV
GTVNVPNTGN DHTYQSVPLT LNLGAGTHVL KLTFGGNHQN IDYVDFGTAA APTNTVVPIT
TITVTPTTTT TPSQTVGAYK PHSLPVRIEA EDYDNGGAGA AYYDTTAGNL GKAYRLDQDV
DIEAGASGYD VGYVADGEWL TYTVDIPSAG WYTAFFNVAS WADGRSITVS VDNTPVGTVQ
VPNTGDSTIF VDVPMNLNLP AGSHVLKLSF TGSKQNIDYI DFPSGPHAEM ALTTTPTVVK
TTSATAVKNN TTASE