Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1838 |
Symbol | |
ID | 7270384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 1953338 |
End bp | 1956445 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643570453 |
Product | Carbohydrate binding family 6 |
Protein accession | YP_002466867 |
Protein GI | 219852435 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3420] Nitrous oxidase accessory protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.533489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.441295 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTGGA ATATGCAATT GAGAAACGTG ATGGCGCTTG CCATCCTCCT GGTACTGTCA TCCGCACTGC TGGTACTTCC AGCAACCGCG GCAAGTATCC CTGTCAACGG TCCGGTGGTC ATCACCGAGC CCGGCACCTA TGTGCTCACA CAGGATATCA CCAGCAGCAG CCAGATCGTA TGTATAGAGA TCAAGGCCTC CAACGTCGTC TTCGACGGTC AGGGCCATCA GATCAGTGGT GTGAACAATG AGGGATCAGC CGGTATCTTC GTTTCGAAGG ATGCCAGCAC CCCGGTCACC GGGGTCACCA TCAAGAATGT TCGTCTGAAC AACTGGTTCT ATGGGGTCTA TCTCCTGAAT GCACAGAACA GTGCAATCCA GGATGTCACC ACGACCGGGA ATGCCAACGC AGGGATGGTG CTCTACTCTG GAAGCACAGG GAATACCATC TCCGGCAGCA CGCTCACCGG CAACGGACGT GGTATCATCC TCTCCACCTC CAGTGGTTCG AACACCATCA GCGGCAACAC CCTCACCGGC AACAGTAATC AGGGTATCTA CATCTTTGAT TCGAACGGCA ACACGGTGAA CGGGAACACC ATCACCAATA ACACCAATGC AGGGCTGTTC ATCTACAGCG CCTCGGCAAA CTCAGTATAT AACAACAACT TCAGCAACCT CTACAATGCT CTCTTCGGGG GAACCATTGG CTCCAACTCG TGGAACACCA ACCAGGCCAC CGGCACCAAC ATCGTGGGCG GTCCTTCGAT CGGCGGTAAC TTCTGGGGCA ATCCAGATGG ATCCGGGTAC TCACAGACCA CCGCCGACTC CAATGGCGAT GGGTTCTGCG ATCAGCCCCT CGTGATCACG ACCGGAAACA CCGACAACCT CCCGTTGCAC ACTCCGTCCG CCGTCACACC GACGGTGACC CCGACGGCCA CCACAACCCC GGGTGTTGAG TCTCCCTACA AGGATCATAA CCTCCCGGCC CGTGTCGAGG CTGAAGACTA CGACAACGGC GGTCAGGGTG TTGGTTACTC CGATTCCACT CCCCAGAACC TCGGTAACGC CTACCGCCTG ACTGAAGGCG TGGATGTCGA AGCCGGAGGA AGCGGGTATG ATGTCGGCTA CATCACCGAC GGCGAGTACC TGAAGTACAC CATCAATGTT ACGACTGCGG GCACCTACAC CGCGACCTTC AATGTCGGGT CCTGGGAAGC CGGACGGACG ATCACCGTCA GTGATGATGA TGGAGATATC GCCGGCACGG TCAACGTCCC GAACACCGGG AGTTCGAGCA CCTTTGTCTC TGTTCCGCTG ACGCTGAACC TTAACGCAGG CACCCACGTG CTGAAACTGA CCTTCAACGG CAACCACCAG AACATCGACT ACATCGACTT CAGTACCTCA GTAACACCGA CCACCACCGC CACCACAGTG CCGACCACCA CGGTCACCGT GACCCCAACC GTGACAACGA CCGTCACCCC GGGTAATGAG ACTCCTTACA CGCCTCACAA CCTCCCGGCC CGTGTCGAGG CTGAGGACTA CGACAACGGC GGCGAGGGTG TCGCCTACCA TGACTCGACC GCCCAGAACC TCGGAAACGC CTACCGCCTG ACTGAGGGTG TGGACGTCGA GGCCGGTGCC ACCGGGTATA ACGTCGGCTA CATCACCGAC GGCGAGTACC TGAAGTACAC CGTCAATGTC GCGACCGCCG GCACCTACAC CGCGACCTTC AATGTCGGGT CCTGGGAAGC CGGACGGACG ATCACAATCA GTGATAATGA CGGAGATGCC GTCGGCACGG TCAACGTCCC GAACACCGGG AATGATCACA CCTACCAGTC GGTCCCAGTG ACGCTGAACC TCGGTGCAGG CACCCACGTG CTGAAACTGA CCTTCAACGG CAACCACCAG AACATCGACT ACATTGACTT CAGTACCTCA GTAACACCGA CCACCACCGC CACCACAGTG CCGACCACCA CGGTCACCGT GACCCCAACC GTGACAACGA CCGTCACCCC GGGGAACGAG ACTCCATATA AGGCATACAA CCTCCCGGCC CGTGTCGAGG CCGAGGACTA TGACAACGGC GGCGAAGGTG TCGCCTACCA TGACTCGACT GCCCAGAACC TCGGAAACGC CTACCGCCTG ACTGAGGGCG TGGACGTCGA AGCCGGTGCC ACCGGGTATA ACGTCGGCTA CATCACCGAC GGCGAGTACC TGAAGTACAC CGTCAATGTC GCTACTGCGG GCACCTACAC TGCTAACTTC AATGTCGGGT CCTGGGAAGC CGGCCGGACA ATTGCGGTCA GTGTCGATGA CACGGCTGTG GGCACCGTCA ATGTCCCGAA CACCGGGAAT GATCATACCT ACCAGTCGGT CCCACTGACG CTGAACCTCG GTGCAGGCAC GCACGTGCTG AAGCTCACCT TTGGTGGTAA CCACCAGAAC ATCGACTACG TCGACTTCGG AACAGCGGCG GCTCCGACCA ACACCGTTGT TCCGATCACC ACCATAACGG TGACGCCCAC AACGACCACC ACCCCGTCCC AGACTGTCGG GGCCTACAAG CCTCACAGCC TCCCGGTCCG CATCGAAGCC GAGGACTATG ACAACGGCGG TGCAGGTGCT GCGTACTATG ATACGACAGC AGGCAACCTT GGAAAGGCCT ACCGTCTGGA TCAGGACGTC GATATCGAGG CTGGTGCCTC AGGATATGAT GTCGGCTACG TCGCCGATGG CGAATGGCTG ACCTATACCG TTGATATTCC GTCAGCTGGT TGGTACACGG CCTTCTTCAA TGTGGCCAGC TGGGCGGACG GACGATCGAT CACCGTTAGT GTCGACAACA CTCCAGTTGG CACGGTGCAG GTTCCGAACA CCGGCGACTC TACCATCTTT GTAGATGTCC CGATGAACCT GAATCTCCCG GCAGGTTCGC ATGTGCTGAA ACTGTCCTTC ACCGGAAGCA AGCAGAACAT CGATTACATC GACTTCCCCT CAGGTCCGCA TGCCGAGATG GCTCTGACCA CCACACCAAC AGTGGTCAAG ACAACCTCTG CAACAGCGGT GAAGAACAAC ACCACCGCGT CTGAGTGA
|
Protein sequence | MNWNMQLRNV MALAILLVLS SALLVLPATA ASIPVNGPVV ITEPGTYVLT QDITSSSQIV CIEIKASNVV FDGQGHQISG VNNEGSAGIF VSKDASTPVT GVTIKNVRLN NWFYGVYLLN AQNSAIQDVT TTGNANAGMV LYSGSTGNTI SGSTLTGNGR GIILSTSSGS NTISGNTLTG NSNQGIYIFD SNGNTVNGNT ITNNTNAGLF IYSASANSVY NNNFSNLYNA LFGGTIGSNS WNTNQATGTN IVGGPSIGGN FWGNPDGSGY SQTTADSNGD GFCDQPLVIT TGNTDNLPLH TPSAVTPTVT PTATTTPGVE SPYKDHNLPA RVEAEDYDNG GQGVGYSDST PQNLGNAYRL TEGVDVEAGG SGYDVGYITD GEYLKYTINV TTAGTYTATF NVGSWEAGRT ITVSDDDGDI AGTVNVPNTG SSSTFVSVPL TLNLNAGTHV LKLTFNGNHQ NIDYIDFSTS VTPTTTATTV PTTTVTVTPT VTTTVTPGNE TPYTPHNLPA RVEAEDYDNG GEGVAYHDST AQNLGNAYRL TEGVDVEAGA TGYNVGYITD GEYLKYTVNV ATAGTYTATF NVGSWEAGRT ITISDNDGDA VGTVNVPNTG NDHTYQSVPV TLNLGAGTHV LKLTFNGNHQ NIDYIDFSTS VTPTTTATTV PTTTVTVTPT VTTTVTPGNE TPYKAYNLPA RVEAEDYDNG GEGVAYHDST AQNLGNAYRL TEGVDVEAGA TGYNVGYITD GEYLKYTVNV ATAGTYTANF NVGSWEAGRT IAVSVDDTAV GTVNVPNTGN DHTYQSVPLT LNLGAGTHVL KLTFGGNHQN IDYVDFGTAA APTNTVVPIT TITVTPTTTT TPSQTVGAYK PHSLPVRIEA EDYDNGGAGA AYYDTTAGNL GKAYRLDQDV DIEAGASGYD VGYVADGEWL TYTVDIPSAG WYTAFFNVAS WADGRSITVS VDNTPVGTVQ VPNTGDSTIF VDVPMNLNLP AGSHVLKLSF TGSKQNIDYI DFPSGPHAEM ALTTTPTVVK TTSATAVKNN TTASE
|
| |