Gene Emin_0354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0354 
Symbol 
ID6263994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp379437 
End bp380477 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content43% 
IMG OID642610820 
Productpeptidase M42 family protein 
Protein accessionYP_001875250 
Protein GI187250768 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.326213 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATTG AATTATTTAG AAAAATAGCC GAATTGCCGG GTATATCCGG CAGGGAAGAA 
GCAGTTAGAG CCGCCCTTCT TAAAATGCTT AAAACATGTA CCGATGAGCA GCGTGTTGAC
GCTATGGGCA ATATTATTGC CGTTAAAAAG GGTAAAGGCG TAAGAAAGCT TATGCTTGCC
GCCCATATGG ACGAGATAGG CCTTCTTGTA AGCCATATTG AAAATAACGG TTTTTTACGT
TTTGTTCCCG TAGGCGGTAT TGACGCAAGA ACGCTTATGA GCCAGCGTGT TGTAATACAC
ACATCAAAAG GGCCGATATT CGGCGTTATA GGAACAAAAC CGGTACATTT GCTTGACGCC
GCTGAAGCTT CAAAAGCGCC TGGTATTAAA AGTTTATTTA TTGATACCGG TTTGGACGGA
TCTGAAATAA ATTCAATTGT AAGCATAGGC GACCCTGTTA CTTTAGACAG AACTACTGTT
GAATTCGGAT CTCAAATGAT TAACTCCAAA GCTATTGATG ACAGAGCAGG CGTTTACGTT
TTTATTGAAG CTTTAAAGAA AGTTAAAAAA TTTGACTGTG ATATTTACGC CGTTTTCAGC
GTGCAGGAAG AAGTTGGTTT AAGAGGAGCA GTAACTTCAA CCTTCGGGGT GGACCCGGAT
TTGGCGCTTG TTGTTGACGC TACCGCCGCT AATGATTTGC CCGCCACTCC CCCGCAGGAA
TTTAACTGCC GTTTAGGGCA AGGCGTAGCC ATAACAATTA TGGACGGCGG CTCCATTATC
AATCCGCAAA TAGTTAAAAC TTTAAAAAAG CTGGCTTCGG ATAAAAACAT TAAACACCAG
TTTAAAGTTT CGGCCCGCGG TTCTAACGAC GCTGCTGCCG TGCAAAAAAC AAAAAGCGGC
GTTCCCGTAG GGCTGCTTTC AATACCTACG CGTTATATAC ATTCAAGCAT AGAAACGGCT
TCAAAAATTG ATATAGACGC GGCGGTTGAT TTAACAGTGG CTTTTATAGA GAACGCCTGT
AAATATAATT TTGATTACTA A
 
Protein sequence
MDIELFRKIA ELPGISGREE AVRAALLKML KTCTDEQRVD AMGNIIAVKK GKGVRKLMLA 
AHMDEIGLLV SHIENNGFLR FVPVGGIDAR TLMSQRVVIH TSKGPIFGVI GTKPVHLLDA
AEASKAPGIK SLFIDTGLDG SEINSIVSIG DPVTLDRTTV EFGSQMINSK AIDDRAGVYV
FIEALKKVKK FDCDIYAVFS VQEEVGLRGA VTSTFGVDPD LALVVDATAA NDLPATPPQE
FNCRLGQGVA ITIMDGGSII NPQIVKTLKK LASDKNIKHQ FKVSARGSND AAAVQKTKSG
VPVGLLSIPT RYIHSSIETA SKIDIDAAVD LTVAFIENAC KYNFDY