Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0004 |
Symbol | |
ID | 7270116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 3462 |
End bp | 4652 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643568663 |
Product | DNA methylase N-4/N-6 domain protein |
Protein accession | YP_002465123 |
Protein GI | 219850691 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0863] DNA modification methylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0090313 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGTGCCGA CACCAGATCT GCAGAACGAT CGGTACAGGG TGATCGTGGT CCCCTTGGCT GCTCCCCTCT CACCGGAGGG TTCAGCCCTG ATAGGAAGGA TCGGTGCAGG GTTCAGCCCT CAGGATCTGC TCCTCTCCCT GACTGATACC TCCCTGACAT GTCACCTGCA TGCCCTCTCC CCCAATCTTC TCCGCGATCG GATCCAGATG ATCAGTGCCG CATCCCCGCG ACCAGGGTCT CTCGCCGGCG CCGTAGCGGT GCTCGGAGGG CTGGCCGATG CCGTGGATCG TCAGACCAGC TACACGATCC ATGATGGAAA ACGAGTCTAT GCCGGCTTTA CGGCCGAACG GAAGGTTATC GGGCGGAGCG AGAAGGTACA GCGGCGGGGG GGTTGTTATT ATGCCGGGGA CCACCAGTTC AGCAGGGAGG AGAACCGCCT CTCTGGTGTT CCAGTGGACT CGATCGTCTG CGGGGACAGC GAGGAGGTGC TCTCCCGCCT GCCTGACAAC TGCGTGGACC TGGTGCTCAC CTCGCCGCCG TACAACTTCG GTCTCTCATA CCACGAAGGA GATGACGGGC GGCACTGGGA TGCCTACTTC AGCAAACTCT TCTCGATCCT CGACCAGTGC GTCCGGGTAC TGAAGTTTGG CGGCCGATGT CTGGTCAACA TCCAGCCGCT CTTCTCCGAC AACATCCCGA CCCATCACCT GATCTCGCAG CATCTGCTGT TGCGGCGGAT GATCTGGAAG GGGGAGATCC TCTGGGAGAA GAACAACTAT AACTGCAAGT ACACGGCCTG GGGCTCCTGG AAGAGTCCGT CGGCCCCGTA CCTGAAGTAC ACCTGGGAGT TCATCGAGGT CTTCTCGAAA GGGGACCTCA AAAAGACCGG TCCCAAGGAG GGGATCGATA TCACAGCCGA TGAGTTCAAG GCCTGGGTGG TGGCCAGGTG GTCGATTGGG CCTGAACGGC AGATGAAGCG GTATAACCAC CCGGCTATGT TCCCTGAGGA GCTGGTTGAA CGGGCCCTGA AGCTCTTCTC CTATCAGGGT GATCTGGTCC TCGATCCGTT TAACGGGGTC GGCACCACCA CGCTGGTCGC CCGGCGGCTG CAGCGCAGGT TCATCGGGGT CGATCTCTCT CCGGAGTACT GTGCGACCGC CAGGGAACGG TTATCTAACC GGGGAACCTG A
|
Protein sequence | MVPTPDLQND RYRVIVVPLA APLSPEGSAL IGRIGAGFSP QDLLLSLTDT SLTCHLHALS PNLLRDRIQM ISAASPRPGS LAGAVAVLGG LADAVDRQTS YTIHDGKRVY AGFTAERKVI GRSEKVQRRG GCYYAGDHQF SREENRLSGV PVDSIVCGDS EEVLSRLPDN CVDLVLTSPP YNFGLSYHEG DDGRHWDAYF SKLFSILDQC VRVLKFGGRC LVNIQPLFSD NIPTHHLISQ HLLLRRMIWK GEILWEKNNY NCKYTAWGSW KSPSAPYLKY TWEFIEVFSK GDLKKTGPKE GIDITADEFK AWVVARWSIG PERQMKRYNH PAMFPEELVE RALKLFSYQG DLVLDPFNGV GTTTLVARRL QRRFIGVDLS PEYCATARER LSNRGT
|
| |