Gene Elen_1890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1890 
Symbol 
ID8416194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2220218 
End bp2221255 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content70% 
IMG OID645024860 
Productmodification methylase, HemK family 
Protein accessionYP_003182243 
Protein GI257791637 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000215054 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAACGACA TCTGGACCAT TCAGGCCGCC CTTGATTGGA CGGTGGGCTA CCTCGAGCGC 
AAGGGCGACG AGAACCCGCG GCTGTCCGCG CAGTGGCTGC TGTCCGAAGC GACCGGCCTT
TCCCGCATCG AGCTGTACGC GAATTTCGAG CAGCCGCTGT CGATGGAGGA GCGCGATGTG
CTGCGCGCCT ACGTCACGCG CCGCGGCAAG GGCGAGCCTC TGCAGTACAT CACGGGCGAG
GTCGGATTTC GCCATATCAC GGTGAAGGTT CGCCCCGGCG TGCTGATCCC GCGCCCCGAG
ACCGAGGTGC TGGTCAGTGA AGCCTTGGCG CTGCTGCCGG CGGCGCCGAA GCGGGTGGCG
CAGCATGCAT GGCCGGAAGA CGACCTTCCC CCTGTCCCTT GGCCGGAAGG CGAAGCCGGG
GAGCAACGCC CCGAGCGAAC GGCCGACCAG GGCGTCGCCG AGGATGAATC GACGCCGGCC
GTCGCCTCGG GTGAACCCGC TCCCGAGCCC CCCGAGCTGC TCGTGGCCGA CCTCTGCACC
GGTTCCGGCT GCATCGCCTG CTCCGTCGCC TACGAGCATC CGCTCGCGCG CGTGGTGGCG
ACCGACATCG TGCCCGAGGC CGTCGCGCTC GCTCGCGACA ACGTGGCTGC CCTCGAGCTG
GGCGACCGCG TCGAGGTGCT GTCGTGCGAT CTGGGCGAGG GCGTCGATCC TACGCTCATG
GGCGCGTTCG ACCTCGTGGT GTCCAACCCG CCGTACGTGC CCACGGCCGT CATGGACGAT
ATCCCCCGCG AGGTGGCCGA ATTCGAGCCC GCGCTCGCGC TCGACGGCGG CGCCGACGGG
CTTGACGTGC TGCGCCGCCT GCTTCCCTGG TGCCGCCGAG CCCTCAAGGA GGGCGGCGGC
TTCGCCTTCG AGCTGCACGA GACCTGCCTG GACGAGGCCG CCCGCCTTGC CGAAGAGGCC
GGCTTCTCCG ATGTTCGCGT CACGGCCGAT CTCGCCGGGC GCCCCCGCGT TCTCACCGCC
CGAAAGCGCG CCGTGTAG
 
Protein sequence
MNDIWTIQAA LDWTVGYLER KGDENPRLSA QWLLSEATGL SRIELYANFE QPLSMEERDV 
LRAYVTRRGK GEPLQYITGE VGFRHITVKV RPGVLIPRPE TEVLVSEALA LLPAAPKRVA
QHAWPEDDLP PVPWPEGEAG EQRPERTADQ GVAEDESTPA VASGEPAPEP PELLVADLCT
GSGCIACSVA YEHPLARVVA TDIVPEAVAL ARDNVAALEL GDRVEVLSCD LGEGVDPTLM
GAFDLVVSNP PYVPTAVMDD IPREVAEFEP ALALDGGADG LDVLRRLLPW CRRALKEGGG
FAFELHETCL DEAARLAEEA GFSDVRVTAD LAGRPRVLTA RKRAV