Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0370 |
Symbol | |
ID | 7271396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 391483 |
End bp | 394659 |
Gene Length | 3177 bp |
Protein Length | 1058 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643569018 |
Product | hypothetical protein |
Protein accession | YP_002465470 |
Protein GI | 219851038 |
COG category | [S] Function unknown |
COG ID | [COG4983] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0956297 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACCG GCTCTGAGAT CCGGCGGGGT CTCGAGGTGC TGGTCGCTCC CGGTCAGGTC TTCGAGGTCC GGAGCTGGAC GGGCGATCGC ATCGCCTCTG GTTACTTCGA CGACCTCGAT ACCGCCGGGA AGGCGATCGA AGCACTCGAC GCGGCGAACC CAGACGGGAT CTACCTGACC CCGAACCCGG TGCTGCCCGA CCTCCTGGCG AGGCGGGCGA ACCGGATCAA GGGACCGCTC GCCAAGAAGG ACTCCTCGAC GAGCGATGGC GATATCCTGA GCCGGCGCTG GTTCCTGATC GACATCGACC CGGCCCGGCC ATCCGGGGTC TCCTCATCCG ATGAGGAACA TCAGGCAGCG CTGGACCGGG CGACTGAGAT CGCGGCAGCC CTGGGCGAGA GGGGCTGGCC AGCCCCGGTC GCCGGCGATT CCGGCAACGG TGCTCACCTC CTGTACCGGG TGGACCTCCC GAACGACGAG CAGGTGACGG CGCTGATCAG GGCCGCCCTG GTCGGACTCG ACGGTCTCTT CTCCGATGCG AGGGCCTCCG TCGATACAGC CGTCTTCAAT GCGAGCCGGA TCTGGAAAGT CTACGGAACC GTCGCAAGAA AGGGTGACAA CACCAGATCC CGACCACACC GACGGAGCCG GCTGCTGTCG ATGCCGGACC CGATCGCGAT CGTGACACGG GAGCAGCTCG CAGCCCTGGC GATCACCGAT CCAGGAGGAG ACACAGCAGC GCCGGCACCC TCGAACAGGG GGACCAGCCG CCAGGTTGGC GAGAAGATCG ACCTGGCCGG CTGGCTCCGG GACCACGACC TCGGTGTCAT CGACCGCCGG CCCTACCGGG GCGGCGATCT CTACCGACTC GACGCCTGTC CGTTCTCATC AGCCCACACC GACGGGGCGT TTGCGATCCA GTTCGCGAGC GGGGCAATCC ATGCCGGCTG CCATCACGCC TCGTGCGGCG GCGGGTCGCA GCGGTGGCCC GAACTCCGGG AGATGTACGA GAAGCCGAAG GTCTCCGCCA TAACCCCTGA AGAGAAGGGG GCGGCCTACA GAAAGAAGAA GGCTACAGCG AGAGAGGCGG TGGCCGGCCG GGATGACGCA GCACCGGTTG AGACCGATGC AGCAGATGAA GCGGTCCTGG CCGAGGCCCG GCAGATCCTG GAGACCGGCG ACCCGCTCGC GTACATGGTC GACACGTTCA ACCTTGAGCA CGTCGGTGAT CGGGACCTGG CTCACTGCCT GGCTCTCTCT CTTGCGTCCC GCCTGGTCGC AACATCCGAA GGGCTGCACG TCATGACGAC CGGTGAGTCG GGCAAGGGGA AGAGCGACGG CTACCGGGTG ATGCTCAGGC AGCTCCCCGA CGCGTGGAAG ATCAAGGGCT CGTTCTCCGA CAAATCCCTC TATTACATGG GTGAGTCCCT GAAGCCCCAG ACGGTCTTTC TCATCGACGA CAAGGACCTC TCAGACTCCC TTCAGGAAGT GCTCAAGGAA GCCACTACTG ATTTCAGAAA ACCGATCGAA CATCGCACCG TCACGACAGA TAGAAAACCA CAGATTTGTT ACATCCCGGA GCGCTGCATC TGGTGGATTG CCAGGGTCGA AGGAGTCGGC GACGACCAGG TCAAAAACCG GATGCTCATG CCATGGGTCG ACGACTCCGA TGAACAGGAT CGGGCTGTCC TGGCCGGCAT TCTTGAGCGT CTGGCCCGGG ACGAAGACGA ACCGATCGGT GAGCTGCACG AGATCGCAGT CTGTAAGGCA CTGTGGACGA TCCTTCAGAG CGTCGGACTG GTCGACGTCA ACCTCTCCCG GTTTGCGCTC AGGATCCATT TCTCATCTGC CCGGAACCGG CGCAACTCCC GGATGTTGAC TGACATGATC CAGTCCGCAG CCATGCTCAG ATATTTCCAG CGGGACCGCC GGGACCTGCA GGACGGCATA ACCCGGATAT ACGCCACCGA GGACGACTTC AGGACCGCGG CGGCCATCTT CACGGCCCTG CATACGGTAT CTGGGTCGCA GGACGCGAAA CTGACCCGGC GGGAGGATGA TGTTCTGCGG TTGCTCGCGG CGACCGGGGA GACGGAGTTT ACCGTACAGA AGATCATGAA CCTGACGAAA CTCTCGTACG ATTCGGTCCG CAGAATGCTG GTCGGATACG GAGACCGGGG CATCCATCGC CCTGGACTCC TGGAGAAGTG CCCGGCGATT GCAAAGATGG ATACGTCGGT CAGCATACGG GATGAGGACG ACGACAGAAC GGTCCGGACG ATCCATCGGC GCGAGACCAC GTTTTCCTTC GACCTGCAGG TCTACCGGCG CTGGCAGACC GGCGGTCAGG TGGTCTGGCT GGACGAGCCC GGAGACCGGG ACCGGGATCC CGATCCGCGG TCGAACACTT GTCAGCAGCA TCAGCAGCAT ATACGCAGCA TAAACGCAGT TTCAACTGCG ACCATAAAAA CCGACAATCC GGCCCTTTCT GCGGAAACGG ATGATAAATC ATTATGCGTA CAAAGAGAAG GATCGTTAAA CGCAGCAGAA ATGAAGAGCA CGCACCCCGC ACCCTCACCG GAAGGAAGGG TGTGTGCGGG TGTGTGTGTT CCTGAAAAAA ACTGCGACCT AACATCAAAT ACTGGTATCC GCAACCTGAT TCAAAAACAG GGCCGGCAAT CACCTCCAAA AGTCAGCAGC AAAAACTGCA AAGATGCTGC GACCACCTGC AAAGACTGCG GAGATGCTGC GGTCACCAGT ATGGCAGGCG GTGAACCGGC ACGGCCGGCC CCACCACTGA CCCTCGCCCA GGTGAACCCG GGTGACTACT CCCTGATCCC GGGCGGTCCC CTGATCGAGC CGTGCCCGGT CTGCGGTGGC CGGGTGGTCC ACTACACCGA ACGCTACCAG GCCCGGAAGG CCCGGGGTCC AAAGGAGAAG AGCCGGAATA TCTGCCGGGG CTGCTATAAC CGGGCCCGGG CCTGTGAGCA GGCCGCCATC CAGGTCCTGC CGGGTGTGAT CCCGCTGGAC GAGGTTGAGT CGGTCGATGC TGGCCTCTTC GGCCGGTGCT CAGTCTGCGG GCTCCAGGCG GCAACATACA ACCACGCCGG CAGCGGGACC GGGATCTGCT CGGTCTGTTA TGAGAAACTG GTGAGAGAGC AGGTGGATAT TCGATAA
|
Protein sequence | MNTGSEIRRG LEVLVAPGQV FEVRSWTGDR IASGYFDDLD TAGKAIEALD AANPDGIYLT PNPVLPDLLA RRANRIKGPL AKKDSSTSDG DILSRRWFLI DIDPARPSGV SSSDEEHQAA LDRATEIAAA LGERGWPAPV AGDSGNGAHL LYRVDLPNDE QVTALIRAAL VGLDGLFSDA RASVDTAVFN ASRIWKVYGT VARKGDNTRS RPHRRSRLLS MPDPIAIVTR EQLAALAITD PGGDTAAPAP SNRGTSRQVG EKIDLAGWLR DHDLGVIDRR PYRGGDLYRL DACPFSSAHT DGAFAIQFAS GAIHAGCHHA SCGGGSQRWP ELREMYEKPK VSAITPEEKG AAYRKKKATA REAVAGRDDA APVETDAADE AVLAEARQIL ETGDPLAYMV DTFNLEHVGD RDLAHCLALS LASRLVATSE GLHVMTTGES GKGKSDGYRV MLRQLPDAWK IKGSFSDKSL YYMGESLKPQ TVFLIDDKDL SDSLQEVLKE ATTDFRKPIE HRTVTTDRKP QICYIPERCI WWIARVEGVG DDQVKNRMLM PWVDDSDEQD RAVLAGILER LARDEDEPIG ELHEIAVCKA LWTILQSVGL VDVNLSRFAL RIHFSSARNR RNSRMLTDMI QSAAMLRYFQ RDRRDLQDGI TRIYATEDDF RTAAAIFTAL HTVSGSQDAK LTRREDDVLR LLAATGETEF TVQKIMNLTK LSYDSVRRML VGYGDRGIHR PGLLEKCPAI AKMDTSVSIR DEDDDRTVRT IHRRETTFSF DLQVYRRWQT GGQVVWLDEP GDRDRDPDPR SNTCQQHQQH IRSINAVSTA TIKTDNPALS AETDDKSLCV QREGSLNAAE MKSTHPAPSP EGRVCAGVCV PEKNCDLTSN TGIRNLIQKQ GRQSPPKVSS KNCKDAATTC KDCGDAAVTS MAGGEPARPA PPLTLAQVNP GDYSLIPGGP LIEPCPVCGG RVVHYTERYQ ARKARGPKEK SRNICRGCYN RARACEQAAI QVLPGVIPLD EVESVDAGLF GRCSVCGLQA ATYNHAGSGT GICSVCYEKL VREQVDIR
|
| |