Gene Emin_0876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0876 
Symbol 
ID6262597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp964062 
End bp965105 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content45% 
IMG OID642611355 
Productdienelactone hydrolase 
Protein accessionYP_001875768 
Protein GI187251286 
COG category[R] General function prediction only 
COG ID[COG1073] Hydrolases of the alpha/beta superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000509815 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000273866 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAATCTTA AACTAATACT AATAGTAGTA GCGGCGGCGG TTTTAACAAG TTCTATAGTA 
TGGGGTGTAT ATAAAATATG TAAGGGGACA ACAACGGCGG CTCATCTTAC ATTCCGGCAG
AGCGATAGTG TGCGAATGGA AAAAGTTATA TTTAAAAACA GAATTGGCAT AGAAATAGCG
GGACATATGT ATATGCCAAA AAATATTGAT AAAAATAAAA AACATCCCGC AATTGTTGTT
GGCCATACTT TTACCGGTGT TAAGGAGCAA ACGTCAGGCC TGCATGCACA AAAATTGGCG
GAAATGGGCT TTGTTACTCT TGCTTTTGAC GCTTCATTTT GGGGTGAAAG CGGCGGGCAG
CCGCGCAATA TAGAAATACC TGATATCCGC ATAGAGGACT TTATTGCGGC GGTAGATTTT
TTAAGCACCC AATCTTTAGT TGATGCAGGA CGCATCGGTC TTTTGGGTAT TTGCGGAGGC
GGCGGATATG TGGTAAGTGC GGCGGCTATT GACCATAGAG TTAAAGCTGT TGCTACAGTA
AGCATGTATG ACTTGGGCCG CGCACGCAGG CAGGGCCTTG GCGACGCTAT CTCCTACGAA
CAACGCATGA AAACGCTTGA CCTTATAGGC GATTTACGCA CAAAGGAATT TAGAGGGGAA
AAACGTACCG ATACTCTTGG CGTTCCTGCC AGTATTACTG ATAAAGATAC AGAAAACACC
CGTGAGTTTT ATGACTATTA CCGCACGCCC CGTGCGCAAC ACCCAAATAC GGATACCGCA
TACTCTTTAG TAAGCCAAGC GGCAATGATG AACTTCTTCC CGTTTATACA GATAGAAACA
ATCTCACCCC GCCCGTTGCT ATTTATTGTT GGAGAGCGGG CCGTATCTGC CTACTTTAGC
GAAGATGCTT ACAGCAAGGC AGCTCAGCCT AAGGAGCTAT ATGTAGTGCC CGGCGCGTCG
CACGTGGACC TTTACGACAG GCCGGAATAT ATGAAATTAA CTATCCCGAA ACTGGATAGT
TTCTTTAAGC AAAATCTTAA GTAA
 
Protein sequence
MNLKLILIVV AAAVLTSSIV WGVYKICKGT TTAAHLTFRQ SDSVRMEKVI FKNRIGIEIA 
GHMYMPKNID KNKKHPAIVV GHTFTGVKEQ TSGLHAQKLA EMGFVTLAFD ASFWGESGGQ
PRNIEIPDIR IEDFIAAVDF LSTQSLVDAG RIGLLGICGG GGYVVSAAAI DHRVKAVATV
SMYDLGRARR QGLGDAISYE QRMKTLDLIG DLRTKEFRGE KRTDTLGVPA SITDKDTENT
REFYDYYRTP RAQHPNTDTA YSLVSQAAMM NFFPFIQIET ISPRPLLFIV GERAVSAYFS
EDAYSKAAQP KELYVVPGAS HVDLYDRPEY MKLTIPKLDS FFKQNLK