Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0354 |
Symbol | |
ID | 6263994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 379437 |
End bp | 380477 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 642610820 |
Product | peptidase M42 family protein |
Protein accession | YP_001875250 |
Protein GI | 187250768 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.326213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATTG AATTATTTAG AAAAATAGCC GAATTGCCGG GTATATCCGG CAGGGAAGAA GCAGTTAGAG CCGCCCTTCT TAAAATGCTT AAAACATGTA CCGATGAGCA GCGTGTTGAC GCTATGGGCA ATATTATTGC CGTTAAAAAG GGTAAAGGCG TAAGAAAGCT TATGCTTGCC GCCCATATGG ACGAGATAGG CCTTCTTGTA AGCCATATTG AAAATAACGG TTTTTTACGT TTTGTTCCCG TAGGCGGTAT TGACGCAAGA ACGCTTATGA GCCAGCGTGT TGTAATACAC ACATCAAAAG GGCCGATATT CGGCGTTATA GGAACAAAAC CGGTACATTT GCTTGACGCC GCTGAAGCTT CAAAAGCGCC TGGTATTAAA AGTTTATTTA TTGATACCGG TTTGGACGGA TCTGAAATAA ATTCAATTGT AAGCATAGGC GACCCTGTTA CTTTAGACAG AACTACTGTT GAATTCGGAT CTCAAATGAT TAACTCCAAA GCTATTGATG ACAGAGCAGG CGTTTACGTT TTTATTGAAG CTTTAAAGAA AGTTAAAAAA TTTGACTGTG ATATTTACGC CGTTTTCAGC GTGCAGGAAG AAGTTGGTTT AAGAGGAGCA GTAACTTCAA CCTTCGGGGT GGACCCGGAT TTGGCGCTTG TTGTTGACGC TACCGCCGCT AATGATTTGC CCGCCACTCC CCCGCAGGAA TTTAACTGCC GTTTAGGGCA AGGCGTAGCC ATAACAATTA TGGACGGCGG CTCCATTATC AATCCGCAAA TAGTTAAAAC TTTAAAAAAG CTGGCTTCGG ATAAAAACAT TAAACACCAG TTTAAAGTTT CGGCCCGCGG TTCTAACGAC GCTGCTGCCG TGCAAAAAAC AAAAAGCGGC GTTCCCGTAG GGCTGCTTTC AATACCTACG CGTTATATAC ATTCAAGCAT AGAAACGGCT TCAAAAATTG ATATAGACGC GGCGGTTGAT TTAACAGTGG CTTTTATAGA GAACGCCTGT AAATATAATT TTGATTACTA A
|
Protein sequence | MDIELFRKIA ELPGISGREE AVRAALLKML KTCTDEQRVD AMGNIIAVKK GKGVRKLMLA AHMDEIGLLV SHIENNGFLR FVPVGGIDAR TLMSQRVVIH TSKGPIFGVI GTKPVHLLDA AEASKAPGIK SLFIDTGLDG SEINSIVSIG DPVTLDRTTV EFGSQMINSK AIDDRAGVYV FIEALKKVKK FDCDIYAVFS VQEEVGLRGA VTSTFGVDPD LALVVDATAA NDLPATPPQE FNCRLGQGVA ITIMDGGSII NPQIVKTLKK LASDKNIKHQ FKVSARGSND AAAVQKTKSG VPVGLLSIPT RYIHSSIETA SKIDIDAAVD LTVAFIENAC KYNFDY
|
| |