Gene Mpal_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0004 
Symbol 
ID7270116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp3462 
End bp4652 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content60% 
IMG OID643568663 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_002465123 
Protein GI219850691 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0090313 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGTGCCGA CACCAGATCT GCAGAACGAT CGGTACAGGG TGATCGTGGT CCCCTTGGCT 
GCTCCCCTCT CACCGGAGGG TTCAGCCCTG ATAGGAAGGA TCGGTGCAGG GTTCAGCCCT
CAGGATCTGC TCCTCTCCCT GACTGATACC TCCCTGACAT GTCACCTGCA TGCCCTCTCC
CCCAATCTTC TCCGCGATCG GATCCAGATG ATCAGTGCCG CATCCCCGCG ACCAGGGTCT
CTCGCCGGCG CCGTAGCGGT GCTCGGAGGG CTGGCCGATG CCGTGGATCG TCAGACCAGC
TACACGATCC ATGATGGAAA ACGAGTCTAT GCCGGCTTTA CGGCCGAACG GAAGGTTATC
GGGCGGAGCG AGAAGGTACA GCGGCGGGGG GGTTGTTATT ATGCCGGGGA CCACCAGTTC
AGCAGGGAGG AGAACCGCCT CTCTGGTGTT CCAGTGGACT CGATCGTCTG CGGGGACAGC
GAGGAGGTGC TCTCCCGCCT GCCTGACAAC TGCGTGGACC TGGTGCTCAC CTCGCCGCCG
TACAACTTCG GTCTCTCATA CCACGAAGGA GATGACGGGC GGCACTGGGA TGCCTACTTC
AGCAAACTCT TCTCGATCCT CGACCAGTGC GTCCGGGTAC TGAAGTTTGG CGGCCGATGT
CTGGTCAACA TCCAGCCGCT CTTCTCCGAC AACATCCCGA CCCATCACCT GATCTCGCAG
CATCTGCTGT TGCGGCGGAT GATCTGGAAG GGGGAGATCC TCTGGGAGAA GAACAACTAT
AACTGCAAGT ACACGGCCTG GGGCTCCTGG AAGAGTCCGT CGGCCCCGTA CCTGAAGTAC
ACCTGGGAGT TCATCGAGGT CTTCTCGAAA GGGGACCTCA AAAAGACCGG TCCCAAGGAG
GGGATCGATA TCACAGCCGA TGAGTTCAAG GCCTGGGTGG TGGCCAGGTG GTCGATTGGG
CCTGAACGGC AGATGAAGCG GTATAACCAC CCGGCTATGT TCCCTGAGGA GCTGGTTGAA
CGGGCCCTGA AGCTCTTCTC CTATCAGGGT GATCTGGTCC TCGATCCGTT TAACGGGGTC
GGCACCACCA CGCTGGTCGC CCGGCGGCTG CAGCGCAGGT TCATCGGGGT CGATCTCTCT
CCGGAGTACT GTGCGACCGC CAGGGAACGG TTATCTAACC GGGGAACCTG A
 
Protein sequence
MVPTPDLQND RYRVIVVPLA APLSPEGSAL IGRIGAGFSP QDLLLSLTDT SLTCHLHALS 
PNLLRDRIQM ISAASPRPGS LAGAVAVLGG LADAVDRQTS YTIHDGKRVY AGFTAERKVI
GRSEKVQRRG GCYYAGDHQF SREENRLSGV PVDSIVCGDS EEVLSRLPDN CVDLVLTSPP
YNFGLSYHEG DDGRHWDAYF SKLFSILDQC VRVLKFGGRC LVNIQPLFSD NIPTHHLISQ
HLLLRRMIWK GEILWEKNNY NCKYTAWGSW KSPSAPYLKY TWEFIEVFSK GDLKKTGPKE
GIDITADEFK AWVVARWSIG PERQMKRYNH PAMFPEELVE RALKLFSYQG DLVLDPFNGV
GTTTLVARRL QRRFIGVDLS PEYCATARER LSNRGT