Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1317 |
Symbol | |
ID | 7271178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 1351937 |
End bp | 1353886 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643569951 |
Product | protein of unknown function DUF147 |
Protein accession | YP_002466373 |
Protein GI | 219851941 |
COG category | [S] Function unknown |
COG ID | [COG1624] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.521946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAACT GTTTGTATGA CCCAGGGAGA CTGAGACCTC ACTATGCACA GGTTATCATC TGTGCAGTGC TCTGTCTTCT CCTCTCAATG GCAATCCCTT CTCTGGCATC AGCTGCTGAC CCGCTGCTGC CCAATGAGAC AACTCCAACG GTGGTACCGG TAACGCCCAC CACAGTCCCG ACGGTGAGCA CAACGACTAC CACCGCCACA CCGTCGCCGA CCCTGACCAC AATTCCAACG ACAACAATAA CAACAACTAT TCCGACATCG ACCATTCCCA CCACGACCCA GATCCCGACG GTGACGACGA CAGTCTCAAC ACCGATGACG ACGGTCTCAA CACCGACAAC CACGATCCCA ACGCTCACCA CGATAACGAC CACACCGATC ACGACCCTGA CGACGATCTG GACTCAATCG ACCGCCACGA CCACATCCAC CTCACATACC CCGCCGGCCC TCCTGATCAA TCCGCCGAGC ATCGACCAGT TGAAGGTTAC TGTCGACGGG ACCTCGACGC CTGGGGAGTA TGGTCAGACG ATCAGCAGGA TCTCATGGAA CTGGGGTGAT GGTAGGATTG AGGACCAGCC CTTCCCTGCC TCGCACACCT ATGCACGGGC AGGTAAATAT GTGGTTGCGG TCACCAGTTA CCAGAGTGAC GGAACCTACA ACACCCGCAC CCTTGATGTG GAGTTGACGA CCCCAACAAT CACCACGACA GCAGTCACGA CGACCATGCC TCCGCCGGGC GCGCCCCGTC TGATCCTGAA CGCCCCCGAG ATCAACAACA GCACGGTCAT GGTGAATGGT ATTCCGCAGG CGGGAAGCCC GTCGATGTCC ATCCAGAAGT TAATGGTAGA TTGGGGAGAC CAGAAGAAGG ACGAATATGT GGCTTTCCCA TTTCAGCATA CCTACGATAG TCCCGGGCAG TACCAGATCT GGATCACTGG TGTCCAGAGC GATGGACAGT CAACCCTCCA GAACCTGACC GTTGAGATCA GTTCCCAGGT TCCCAGCACC CTCCCAACCT CCGGTTCACC CACGGCCGTC CCCTTCTGGG CCAAGCCCGG AATCATCCCC GGGTTGCTGG TCGTGCTGAT CGCAGCAATA GTTGGTGGTG GGGCGATGCT CTGGCGGCGG CGCGAGCTGG GGTCCGTGGT GATCCCGACC TCAGAACCAC TCGAAGAGGC CGTGGCCGCT TATACCCGGG CCCGTGAGCA TGGTGATCTG GTGCAGGCCC GGAAGAGCGC CCTTGAATGT GCCCAGTTGC TGGTGATCCT TGCTGAGGCA GAACCTCAGT ACAGCTCGGC GTACCTTGAG AAGGCGGACG ACTGGAAGGC GATCGCTCGT TCACTGACCC AGCATCTGGA GGACGAGCCA GAACAGGCAG ACCCAGGCCC GACGCTCTCC CTCGAAAAGA ACGGAGCATG GGGGGACGCC GAGAAGGGGG AGGATGTAAC CCTCGATGAG ATGTCGACCG AGGTTCTTGA GGGAACCGAC ATCGATCCGG TGGTCTTTGA AGCAGTGCTG ACGATTGCCC TTGAGATCGC TCGGGAAGGC AGAGAAGGGC AGTCTGTCGG GACCGCGTTC ATCGTTGGTG ATGCCGAACA GGTGATGAAC GCCTCGACCC AGTTCATCCT GAACCCGTTC AAGGGACATC TGGTCGAGGA ACGGTTGATC ACGGATCCGA ATCTGCATGA GAACATCAAG GAGTTCTCAC TCCTCGATGG TGCTTTTGTC ATCGCGAGCA ACGGTGTGGT CGAGGCGGCA GGTCGGTACA TCACGGTTGA CACCAGTGGT GTCTCCCTGC CGGCAGGTCT CGGTTCGCGC CATGCTTCGG CAGCAGGGAT CACCAGGGTC ACCAACTCGG TTGGTGTTGT TGTCTCACAG AGCGGAGGAT TGATCAAGGT GATCAAGGGT GGAAAGATCC TCTGGACGAT CACGCCATAA
|
Protein sequence | MGNCLYDPGR LRPHYAQVII CAVLCLLLSM AIPSLASAAD PLLPNETTPT VVPVTPTTVP TVSTTTTTAT PSPTLTTIPT TTITTTIPTS TIPTTTQIPT VTTTVSTPMT TVSTPTTTIP TLTTITTTPI TTLTTIWTQS TATTTSTSHT PPALLINPPS IDQLKVTVDG TSTPGEYGQT ISRISWNWGD GRIEDQPFPA SHTYARAGKY VVAVTSYQSD GTYNTRTLDV ELTTPTITTT AVTTTMPPPG APRLILNAPE INNSTVMVNG IPQAGSPSMS IQKLMVDWGD QKKDEYVAFP FQHTYDSPGQ YQIWITGVQS DGQSTLQNLT VEISSQVPST LPTSGSPTAV PFWAKPGIIP GLLVVLIAAI VGGGAMLWRR RELGSVVIPT SEPLEEAVAA YTRAREHGDL VQARKSALEC AQLLVILAEA EPQYSSAYLE KADDWKAIAR SLTQHLEDEP EQADPGPTLS LEKNGAWGDA EKGEDVTLDE MSTEVLEGTD IDPVVFEAVL TIALEIAREG REGQSVGTAF IVGDAEQVMN ASTQFILNPF KGHLVEERLI TDPNLHENIK EFSLLDGAFV IASNGVVEAA GRYITVDTSG VSLPAGLGSR HASAAGITRV TNSVGVVVSQ SGGLIKVIKG GKILWTITP
|
| |