Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2030 |
Symbol | |
ID | 7272011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 2152296 |
End bp | 2153282 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643570642 |
Product | hypothetical protein |
Protein accession | YP_002467052 |
Protein GI | 219852620 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.258009 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGATCA CATCTGAGGA GATCGCCTGG AACGTGGACG GAATTCCTGT ACAGGGCACC CTGACACGGC CGGCGGGTGA GGGCCTTCAC CCGGGGATCG TCTTTGTCGC CGGGAGCGGA CCGACCGATC GGGACTGGTG TTCACCCTTA ATACCGGGAA CCAATGGCAG TGGTCGCCTG CTGGCTGAAG ACCTCACCCG AGCAGGATTC ATGACGTTGC GTTACGATAA ACGGGCTTCT GGACCGCATG GACAGGAGAA CGCCCGGCAG CTCGTGGGCA GGATCAGCAT GCAGAGTCAC CTCGACGAGC TGTCAGGTGC GGTCGACGCC CTGCTCGCCG ACGGCGCAAT GGATCCCGCC CGACTATTCG TGCTGACGAA CAGCGAGGGG ACCATCCACG CCCTGAACTA TCAGGTGCAG GCGACGGAGA GGCGCTTTTC GGGCCTCGTG CTCACCGGCG CACCTGGCCG TTCCATCGGG CAGGTTGCCC GTAGCCAGAT ACTCGCACAG GTGAGAGACC TGCCAAACGG AGATCTGATG ATGGACCAGT ACGATGCGGC CATCGCTGCA TTCGAGGCCG GCGAGTCGAT ACAGCCCGAT CCATCGCTCC CCGAAGGGCT GCGGGTCCTG CTCCTCAGTC TTACGACTCC TGCAAACCTG CCGTTCGCGC GAGAACTCTG GTCGACCAAT CCGTCCGACC TCATCACAAA GGTGAAGGAG CCCATCCTGG TCGTGATCGG CAAGAAAGAC ATCCAGGTCG ACTGGAAGAC CGACGATGGC CCGCTGGAGC AGGCGGCCTT TGGGAACGGC AACGTCGAGT TCGCGTACCC GGAGGAGGCC GACCACGTCC TGAAACATGA GGACCGGCTC CGGGAGATGC TCGCGGCCGC CGAGGTCGGG GCACACTACA ACAGCGAAGA TCGAGTGCTT GACGCTGCGG CTCTAACGGT CATCACACGC TGGATTCGTG ACCGTGCCCG GCTATAA
|
Protein sequence | MMITSEEIAW NVDGIPVQGT LTRPAGEGLH PGIVFVAGSG PTDRDWCSPL IPGTNGSGRL LAEDLTRAGF MTLRYDKRAS GPHGQENARQ LVGRISMQSH LDELSGAVDA LLADGAMDPA RLFVLTNSEG TIHALNYQVQ ATERRFSGLV LTGAPGRSIG QVARSQILAQ VRDLPNGDLM MDQYDAAIAA FEAGESIQPD PSLPEGLRVL LLSLTTPANL PFARELWSTN PSDLITKVKE PILVVIGKKD IQVDWKTDDG PLEQAAFGNG NVEFAYPEEA DHVLKHEDRL REMLAAAEVG AHYNSEDRVL DAAALTVITR WIRDRARL
|
| |