Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1383 |
Symbol | |
ID | 7269988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 1423040 |
End bp | 1426183 |
Gene Length | 3144 bp |
Protein Length | 1047 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643570014 |
Product | hypothetical protein |
Protein accession | YP_002466436 |
Protein GI | 219852004 |
COG category | [S] Function unknown |
COG ID | [COG4983] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.161884 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.438502 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCCCG GCTCTGATAT CCGGCAGGGT CTCGAGGTGC TGCTCGCTCC GGGCCAGGTC TTCGAGGTCC GGAGCTGGAC GGGCGATCGC ATCGCCTCCG GCTACTTCGA CGACCTCGAC GTCGCCGCGA AGGCGATCGA GGCGCTCGAC GCGGCGAACC CCGATGGCAT TTACCTGACC CCGAACCCGG TATTACCGGA CCTCCTGGCG AGGAGGGCGA ACCGGATCAA GGGACCGCTC GCCAAGAAGG ACTCGTCGAC GAGCGATGGC GACATCCTGA GCCGGCGCTG GTTCCTGATC GATATCGACC CGGCCCGGCC TTCGGGGGTC TCCTCCTCCG ACAAGGAGCA CCAGGCGGCG CTGGACCGAG CGGCCGGGAT TGCGGCGGCC CTGGGTGAGA TGGGCTGGCC GTCCCCGGTC GCTGGCGACT CCGGCAACGG TGCTCACCTC CTGTACCGGG TGGACCTGCC CAACGATGAC AAGGTCACGG CTTTAATCAA AGCCGCTCTT GTCGCACTCG ACGGCCTCTT CTCTGATGAG AAGGCCTCGG TCGATACGGC CGTGTTCAAT GCCTCGCGGA TCTGGAAGGT CTACGGGACC GTTGCGAGAA AGGGTGATTC GACCGGGGCC AGGCCGCACC GACGGAGCCG GCTCCTCTCG GTGCCGGACT CGATCACGAT CGTGACGCGG GAGCAGCTCG CAGCCCTGGC GATCACGGAT CCTGGTGTTG TCAATGCAGC AGCGCCGGCT CCCTCGAACA GGGGGACCGG GCGCAAGCTG GGGGAGACAC TCAACCTTGC CGGCTGGTTC CACGACCACG GCCTCGGCTT CTCCGATCGC CGGCCGTACC AGGGCGGCGA CCTCTACCGG CTCGACGCCT GTCCGTTCTC CTCGGCTCAC ACAGATGGAG CCTTCGCGAT CCAGTTCGGG AGCGGGGCGG TCTATGCCAG CTGCCACCAC GCCTCGTGCG GCGGCGGTTC GCAGCGGTGG CCGGAGCTCC GGGAGATGTA CGAGAGGCCA AAACAGACGC CGGCGGCTCT GCAGAACCGC GACGAGAAGG AGGCGGCATG GCGGAAGGAG AAAGCCATCG CGAAAGAAGA GGCAGCCGGC CGGGATGGTG CAGCTCCGAC GCCGGTGGAG GCCGACGCAG CAGATGAAGC GGTCCTGGCC GAAGCCCGGC AGATCCTCGA AACCGGCGAC CCGCTCGCCT ACATGGTCGA CGCGTTCAAC CTGGACCACG TCGGGGATCA GGTCCTCGGT CAATGTCTCG CGTTGTCGCT CGCTTCGCGG CTAATCCAGA ACTCACAAGG ACTTCATGTC ATGGTAACAG GGGAGAGCGG GAAGGGTAAG AGCCACGGAT ACCGGGCGAT GCTCCGACAG GTACCGGACG CCTATAAGAT CAAAGGCTCC TTTTCGGATA AGTCCCTCTT CTACATGGAC GACCTCAGGC CTCAGTCCGT AATTCTGGTC GATGACAAGG ACCTGAGCGA CGGCATACAG GAGACACTGA AGGAGGCGAC CTCCGACTTT CGAAAGCCGA TCGTCCATCG GACCCTGACC ACCGAGCGGA AGTATATCGA ATACACGATC CCGGCCCGGT GCCTCTGGTG GATCGCAAAG ATGGAAGGGA CCGGAGACGA GCAGGTTCAG AACCGGGTCT TGATGGTCTG GGTCGACGAG TCAGCGGAAC AGGATCGCGC TGTCCTCGTT CGGAAACTGG CGCAGGAGTC ACGGGACGAA TCAGGGCCGG CCGGCGAGCC GCGACAGGTC ACGGTCTGCC GGGCGATGTG GGAGATTCTT CAGGGTCTCG GCCTTGTCGA GGTGAACCTC TCCCGGTTCG CTCATAGGAT CTGTTTCTCA TCCGCCCGGA ACCGGAGGAA TCCCGACATG CTCGCGGACC TGATCAGGTC CGCGGCCCTC CTCCGCTTCT TTCAGCGGGA CCGGCGCACC CTCGACGACC AGACGATCCG TCTCTATGCA ACGCCGGAGG ACTTCAAGAC GGCCGCCGAC ATCTTCACGG CCCTGAACGG TGAGGCCGGC AGCCAGGACG CGAAACTGAC CAAACGAGAA TCCCAGATCC TCGACATCAT CGACCGGGCT GGCCTTACGG AATTCACGCT ACAAACGATC GTTAAACTGA CACAGATCCC ATATCAACCA ATCCGCAGAA CCTTCGCGGG TTACTTCTCC CACGGGGCGA AGTATTCAGG GTTGCTGGAG AAGTGCCCGG CGTTATCGAC CCACGACCAG ACCACCTCAT CGGCCGGCCA GGATGGTGGT TCGTCGGTCG GCCGGAAAGA AACCGCCTTC ACCTTCAACC GGGAGATCTA TTCCTCCTGG TCGAAGGGTA GCCTTGTATG GCTCCGGCCG GACGATCACG ACGGCGGCGA TGCGGCTGAT CCGGAACCTA ATTCTTTTCA TCAACTTCAG CATTTATTCA GCATTTCTTC AGCAGTTGAT GAAAAAAACT CGCCGACAGA TGCACCGGCA GATTGCGGAA AAGAAGAAGG ATCTTCTATT AAAAAAAATA AGAAAGAATT ATTTTCATCA AATGAAATAA CGAAAGAGGT CTCGCTCGAT CAGAGAGCCG GTCCGGTGCC CCCCTTACGT TATTCAGCGT TTGCTGAAAA CAATTATGGA AAGAAATCAT ATGGCGCGGA ATCGACGAAA CACGAGCACA CTGGAGGCGG TTCGTTTTCA TCAACTGCTG AAAAAAAAGG AAATGCTACT GAAGTTGATG AAAAAAATTC AATCCGCCCA CCGGTGCTGA CTCTCGCCCA GGTCCGCGCC TCCGACTATT CCCAGATCGC CAAAGGTGCG ATCGTCGAGA CCTGCCCGCT CTGCTCCGGC CGGCTGGTGC ACTATCAGGA GAAGTTCACG GCGATGAAGA ACCGGGGCGG GGACCACCCG CGCCGGCTCT GCCGGTCTTG TTACACCCAG GCGAAGGAGC GAGAGCAGGC CGCCGTCCAG GTCCTGCCCG GTGCCATTCC GTTCGATGAG GTCAGGCCGA TCACCGCCGG CCTGCTCGGC CGGTGCTCGG TCTGCGGGCT CCAAGCGGCG ACGTACGACC ACGCCGGCAG CGGGACGGCG ATCTGCTCCA GGTGTTATGA GAAACTGGTG CGGGAGCAGG TGGATGTCCG GTAG
|
Protein sequence | MPPGSDIRQG LEVLLAPGQV FEVRSWTGDR IASGYFDDLD VAAKAIEALD AANPDGIYLT PNPVLPDLLA RRANRIKGPL AKKDSSTSDG DILSRRWFLI DIDPARPSGV SSSDKEHQAA LDRAAGIAAA LGEMGWPSPV AGDSGNGAHL LYRVDLPNDD KVTALIKAAL VALDGLFSDE KASVDTAVFN ASRIWKVYGT VARKGDSTGA RPHRRSRLLS VPDSITIVTR EQLAALAITD PGVVNAAAPA PSNRGTGRKL GETLNLAGWF HDHGLGFSDR RPYQGGDLYR LDACPFSSAH TDGAFAIQFG SGAVYASCHH ASCGGGSQRW PELREMYERP KQTPAALQNR DEKEAAWRKE KAIAKEEAAG RDGAAPTPVE ADAADEAVLA EARQILETGD PLAYMVDAFN LDHVGDQVLG QCLALSLASR LIQNSQGLHV MVTGESGKGK SHGYRAMLRQ VPDAYKIKGS FSDKSLFYMD DLRPQSVILV DDKDLSDGIQ ETLKEATSDF RKPIVHRTLT TERKYIEYTI PARCLWWIAK MEGTGDEQVQ NRVLMVWVDE SAEQDRAVLV RKLAQESRDE SGPAGEPRQV TVCRAMWEIL QGLGLVEVNL SRFAHRICFS SARNRRNPDM LADLIRSAAL LRFFQRDRRT LDDQTIRLYA TPEDFKTAAD IFTALNGEAG SQDAKLTKRE SQILDIIDRA GLTEFTLQTI VKLTQIPYQP IRRTFAGYFS HGAKYSGLLE KCPALSTHDQ TTSSAGQDGG SSVGRKETAF TFNREIYSSW SKGSLVWLRP DDHDGGDAAD PEPNSFHQLQ HLFSISSAVD EKNSPTDAPA DCGKEEGSSI KKNKKELFSS NEITKEVSLD QRAGPVPPLR YSAFAENNYG KKSYGAESTK HEHTGGGSFS STAEKKGNAT EVDEKNSIRP PVLTLAQVRA SDYSQIAKGA IVETCPLCSG RLVHYQEKFT AMKNRGGDHP RRLCRSCYTQ AKEREQAAVQ VLPGAIPFDE VRPITAGLLG RCSVCGLQAA TYDHAGSGTA ICSRCYEKLV REQVDVR
|
| |