Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2033 |
Symbol | |
ID | 7272014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 2154447 |
End bp | 2157548 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643570645 |
Product | hypothetical protein |
Protein accession | YP_002467055 |
Protein GI | 219852623 |
COG category | [S] Function unknown |
COG ID | [COG4983] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.923118 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.210395 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCCG GCTCTGAGAT CCGGCGGGGT CTTGAGATCC TGCTCGCCCC CGGCCAGGTC TTCGAGGTCC GTTCTTGGAC CGGCGACCGG ATATCCTCCG GCTACTTCGA CGACCTGACC GCCGCCGAAA AGGCGATCGA GGCACTGGAC GCCGCCGGCC CTGATGGGAT CTACCTGACC CCGAATCCGG TCCTGCCCGA CCTGCTGAGC AGGCGAGCGA ATCGGATTAA GGGGCCGCTC GCAAAGAAGG ACAGCAGCAC CAGCGACGCC GACATCATCG GCCGGCGCTG GCTGCTCATC GACATCGACC CGGCCCGGCC ATCTGGGGTC TCTTCATCCG AGGAAGAGCA CGCCGCCGCC CTGACACGAG CCGCAGACAT CGCCGCAGCC CTGGGTGAAC GGGGATGGCC GGCCCCGGTG GCGGGCGACT CCGGCAATGG TGCTCACCTC CTCTATCGGC TGGACCTCCC GAACGACGAT CAGGTAACAG CACTGGTTAA GGCCGCCCTG GTGGCCCTCG ATGCGCTCTT CTCCGACCCG TTCGCCTCAG TGGACACGGC CGTCTTCAAT GCCTCCCGTA TCTGGAAGGT ATACGGGACC GTGGCCCGAA AGGGTGACAA TACCACGAGC AGACCGCATC GACGGAGCCG GCTGATCTCG GTGCCGAAGA CGATCGAGAT CGTGACCCGC GAGCAACTCG CCGCCCTGGC GATCACCGAC CCCGGAGGGG ACACCGTGGC GCCGGCTCCT TCAACGAAGG GCACCGGCCG CAAGGCTGGC GAGAAGATCG ACCTGGCCGG CTGGCTCCGG GACCACGGCC TCGGCTTCTC CGATCGCCGG CCCTACCAGG GCGGGGACCT GTACCGGCTC GACACCTGTC CGTTCTCTTC GGCTCACACC GATGGTGCGT TCGCGATCCA GTTCGCGAGC GGGGCCGTCT ATGCCGGCTG CCATCATGCC TCCTGTGGCG GCGGGTCGCA GCGGTGGCCT GAACTGCGGG AGATGTTCGA GGGACCTAAG CGGATCCCGG CCGGCCTGGC GAACCATGAC GAGAAGGAAG TGGAGTGGCG GAAGAAGAAA GCCGAAGCAA GGGAAGCGGC GGCCGGCCGG GATGACTCCG AACCGGTTGA AGCCGACGCA GCTGATGAAG TGATCCTGGC AGAGGCCCGG CAGATCCTTG AGCACGGCGA CCCGCTCGCC TACATGGTCG ACGCGTTCAA CCTGGAGCAC GTCGGCGACA GGGATTTAGC GAAATGTCTG GTATTGTCTC TCGCATCCAG GTTAATCGCA AACTCAAAGG GTCTGCATGT GATGACATCA GGGGACAGTG GAAAGGGGAA GAGCTGCGGA TACCAAACAA TGTTGGAACA AGTGCCGGCC GCGTTCAAGC TCGAAGGATC GTTTTCTGAT AAATCTCTCT ACTACAGCAA CGATCTCCGC CCACGGACCG TGTTTTTCAT CGACGACAAA GATCTTTCAG AATCCATGCA AGAGGTATTG AAGGAATCGA CTTCCTCGTT TCAAAAACCG ATCACTCACC GCACGCTCTC CATCGATCGA AAACTCCAGG TTTGCACAAT CCCACAGAAT TGTGTCTGGT GGATCGCGAA GGTGGAAGGG ACAGGAGACG ACCAGGTTCT CAACCGGGTA TTGATGGTCC ACGTCAACAA CTCAGCCGAA CAGGACCAGG CCGTACAGCA GGCCGCCATA GAACGAGAAT CACGGGACGA CCCACCAGGC GAACCACACC AGGTCAAGGC ATGCCGGGCC ATGTGGATCC ACCTGGACGC CGCTGCTCAA AACCCAGTGG TCGTATCTCT CTCCAGGTTC GCCAGACGGA TTGTCTTTTC ATCTGCCCGG GATCGGCGGA ATACCGAAAT TTTCTTCGAT CTGATCCGGT CCGTAACCCT GATCCGATTC TTTCAACGGG ATCGGGAAGA ACTGCCAGAC GGAACGATCA GAGTATACGC CACCCCGGAC GACTTCAGAA CCGCGAACGA CGTTTTCAAC GCAATTCAAA AAGACTCAGG ATCGCAGGGT GCGAAACTCA ACCCACGCGA GGCGGGATTG TTAGACGTGA TCGACCGGAC CGGTCGAGAT GAATTTACCG TACAGGAACT CGCAAGACTC ACAAACCTTC CAGACTACAC AATCAGGCGA GCGTTTACAG GAATCCCTGG AAGGACAAGC GCGGGATTGT TGGAAAAATG CCCGGCCTGC GGGATCCTCG ATCGTTCTGT CAGCTCGCCG GACGACACAC AGGGATCAGC CCGACGCAAT CAAACCGTGT TCACGTTCGA CCGTGGGATC TATGCCATAT GGGATCGTGG GTGTCGTGTT CACTTGAAAC CGGCCGATGG GGAAGATGAC GTTACACCGT TACACAGACA TTACACAAAC GTTACACAAC CCTGTGTAAA GGTAACGCCG GCACCAGAGG CGGCGGATAC ACAAAACAAC TCTCTCCTCT CTAGTGTACA TAAGAAAGAA GAGAACGTTA CACAAAAACA GGAGAGTACC CACCCCAGTA ACAATAATCG AGAGATCACA CCTGGAGGTG TATGTGCTCC ACCAGAATGT GTAAAGGTGT CAGGTCAAAA TGTTGAAAGC ACACCTGTAC CCCCAAATAA CAAAACGAGT GACGGAAACG CCGAATCAAA CCGTACACAA ACAGGTGTAC GGTTTGTGTC AAGCAGTGTA AATGTGCAAA ACGCCCCACC CGCCCTTGAT CTCGCCCAGG TCCGCGCCGG CGATTACTCC CCGATCGCCG ATGGTGCGAT CGCCGAGACC TGCCCGATCT GCTCCGGCCG GCTGGTGCAC TATCAGGAAA AATTCACGGC GATGAAGAGC CGGGGCGGGG ACCACCCGCG GCGGCTCTGC CGGTCCTGTT ACACCCAGGC GAAGGAGCGA GAGCAGACCG CGGTCCAGGC CCTCCCCGGC GCCATCCCGA TCGACGAGGT GAAGCCGATC GCCTCCGGCC TGCTCGGTCG GTGCACGGTC TGCGGGCTGC AGGTCGCGAC CTACTCCCAT GTCGACAGCG GGACCGCGCT CTGCTCGGTC TGTTATGAGA AACTGGTGAG GGAGCAGGTG GAGATCCGGT AG
|
Protein sequence | MDAGSEIRRG LEILLAPGQV FEVRSWTGDR ISSGYFDDLT AAEKAIEALD AAGPDGIYLT PNPVLPDLLS RRANRIKGPL AKKDSSTSDA DIIGRRWLLI DIDPARPSGV SSSEEEHAAA LTRAADIAAA LGERGWPAPV AGDSGNGAHL LYRLDLPNDD QVTALVKAAL VALDALFSDP FASVDTAVFN ASRIWKVYGT VARKGDNTTS RPHRRSRLIS VPKTIEIVTR EQLAALAITD PGGDTVAPAP STKGTGRKAG EKIDLAGWLR DHGLGFSDRR PYQGGDLYRL DTCPFSSAHT DGAFAIQFAS GAVYAGCHHA SCGGGSQRWP ELREMFEGPK RIPAGLANHD EKEVEWRKKK AEAREAAAGR DDSEPVEADA ADEVILAEAR QILEHGDPLA YMVDAFNLEH VGDRDLAKCL VLSLASRLIA NSKGLHVMTS GDSGKGKSCG YQTMLEQVPA AFKLEGSFSD KSLYYSNDLR PRTVFFIDDK DLSESMQEVL KESTSSFQKP ITHRTLSIDR KLQVCTIPQN CVWWIAKVEG TGDDQVLNRV LMVHVNNSAE QDQAVQQAAI ERESRDDPPG EPHQVKACRA MWIHLDAAAQ NPVVVSLSRF ARRIVFSSAR DRRNTEIFFD LIRSVTLIRF FQRDREELPD GTIRVYATPD DFRTANDVFN AIQKDSGSQG AKLNPREAGL LDVIDRTGRD EFTVQELARL TNLPDYTIRR AFTGIPGRTS AGLLEKCPAC GILDRSVSSP DDTQGSARRN QTVFTFDRGI YAIWDRGCRV HLKPADGEDD VTPLHRHYTN VTQPCVKVTP APEAADTQNN SLLSSVHKKE ENVTQKQEST HPSNNNREIT PGGVCAPPEC VKVSGQNVES TPVPPNNKTS DGNAESNRTQ TGVRFVSSSV NVQNAPPALD LAQVRAGDYS PIADGAIAET CPICSGRLVH YQEKFTAMKS RGGDHPRRLC RSCYTQAKER EQTAVQALPG AIPIDEVKPI ASGLLGRCTV CGLQVATYSH VDSGTALCSV CYEKLVREQV EIR
|
| |