Gene Elen_0217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0217 
Symbol 
ID8414501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp300411 
End bp301382 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content69% 
IMG OID645023197 
ProductMazG family protein 
Protein accessionYP_003180600 
Protein GI257789994 
COG category[R] General function prediction only 
COG ID[COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain 
TIGRFAM ID[TIGR00444] MazG family protein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCT CCGAACCCAC GCCGCCCATC GCATCCCCGT CTCCTGCCGC GCCCCTCGCA 
AGCCAGCTTG CGGCCAACCC CTGCAGCCAT CCGTCGTTCG ACCAGTTCGT CGCCACCATC
GCCGCGCTGC GCGCGCCGGA CGGATGCCCG TGGGATCGCA CGCAGACGCA CCAGAGCATC
GCGCACAACA TGATCGAAGA GGCATACGAG GCGGTGGACG CCATCGAAGC CGCCGATGTC
GCGCACCTGC GCGAGGAGCT GGGCGACGTG CTGCTGCAGG TGGTGTTGCA AAGCCAGATA
GCTTCCGATG CCGGCGAGTT CGACATCAAC GACGTGTGCG CCGACGTGAA CGAGAAGATG
GTCCGCCGCC ATCCTCACGT GTTCGGCGAG GCGCAAGCCG CCAACGCCGG GGACGTGCTG
GATCTGTGGG AACGGGTGAA GATGGCGGAG AAGGGCGCCG CCGACGAGGC GGCCGACGGT
GCGGGCGAGC GGCGCGAAGG CCTGCTGGAC GGCGTGCCCA CCAGCTTCCC CGCGCTCATG
CAGGCGCAGA AGATATCTCG CAAGGCCGCG GCCGCCGGGT TCGAGTGGGA CTCGCTTGAC
GGCGTGTGGG AGAAAGTGCG CGAGGAAATC GCCGAGCTGC AAGAAGCCTA CGCCGTCGCG
CCCAAGGCGG CGAACGGCAA GGTGGACGCC GCGGCCGCTT CCGCAGGCGC GGCCGTCGAC
CCCGCCGCGG CCGAGGCGGC CGTCGCCGCC GTCGAGGACG AGCTCGGCGA CGTGCTGTTC
TCGCTGGTGA ACGTGGGCCG CCGCATGGGC GTGGACGCAG AAGGTGCGCT GCGCTCCACC
TGCCGCAAGT TCCGCGACCG ATGGGCCTGG ATGGAGCAAG CCGCCTGGCA GCAGGGTCGA
ACCATCGAAG ACCTCTCCAG CGAAGAGCGC GAAACCCTGT GGAACGAGGC GAAGAAGCGC
GAGCGATCGT AG
 
Protein sequence
MTASEPTPPI ASPSPAAPLA SQLAANPCSH PSFDQFVATI AALRAPDGCP WDRTQTHQSI 
AHNMIEEAYE AVDAIEAADV AHLREELGDV LLQVVLQSQI ASDAGEFDIN DVCADVNEKM
VRRHPHVFGE AQAANAGDVL DLWERVKMAE KGAADEAADG AGERREGLLD GVPTSFPALM
QAQKISRKAA AAGFEWDSLD GVWEKVREEI AELQEAYAVA PKAANGKVDA AAASAGAAVD
PAAAEAAVAA VEDELGDVLF SLVNVGRRMG VDAEGALRST CRKFRDRWAW MEQAAWQQGR
TIEDLSSEER ETLWNEAKKR ERS