Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0539 |
Symbol | |
ID | 6262737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 590831 |
End bp | 592000 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 642611010 |
Product | TPR repeat-containing protein |
Protein accession | YP_001875431 |
Protein GI | 187250949 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0475117 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGT GGTTACTGGT TTTAATTGTT GTTTTGCTTT CTTCACAGGC GCTTTTTGCC CAGAAGGCGG CCGTTACCGC GTTTAACCAG GGCCGAAAAG CTAAAGATAA TACGGAAAAA CTTAAGTATT TTGACCGCGC TGTTTTACTT AAAAACAACT ATGCGGACGC TTACCATTAC CGTGGCGACG TTTATAAAGA AATGAATAAA ATAAGCCGCG CCACGGCAGA CTATACCAAG GCCATAAAAT TTGCGCCGAA AGACCCTTTT AAATATTACA GCAGAGCTGT TTTGTATATA GATCAGAAAA AATATCTGCC TGCTATTGAC GACCTTACAA AAGCGATTTC CCTTAAGCCT GATTTTTTAG ATTTTTATCT GAAGCGGGGC CAGGTGTATT TAAAGCGCGA TAATTTTGAT TTGGCGGTAA AGGATTTTGA AAAATACTCT TCCAAAAGAA AAAAGCCCAA CAGCTTTTAC CTTGAGTTAG GGCGTTCTTA TTTGGGGAAT TATAATTATG ACAAGGCCCA CAAACAATTT GAAACATTTA TAGCCTTAGA ACCGAAAAAC CATGAAGGCT ACTTTTATTT AGGAAGGGTT GAGTACGCCA GGGGAAATTA TGACGAAGCG ATTTCTCTTT TCAGTAAAGC CGTAAACCGT AACGAAAACT ACGCTCCGGC CTACAGACTC CGCGGTACGG TTTTTAAAGA TATTGGGGAT TTCGAATCCG CGGTGGAAGA TTTTACAAAA CTTATTGAAC TGCTGCCTGA TTATTCTTAT TACAACAGGC GCGGCCTTGT TTATGAAGAG CTTGGCAATC TGAAAGCCGC CGCGGAGGAT TACGGTAAGA CTATTGAACT TAACCCCAAA TGGGCCGTAG CTTATAATAA CAGGGGATTT GTATATTTAA AACTAAAAGA ATATGCTTTA GCCAGAGCAG ATTTGGAAAC AGCCATTAAG TTAGAACCGC AGATGTTTTT GCCTTATGTT AATATTGCCG GCGGCTATTG GCTTAATAAA AAAGACAAAA AGAACGCGCT TGATAATTTA GATAAAGCCG TAAAACGCGG GTTTAAAGAC TTTGAAAGCC TTTACGACGA ACATAAAAAA GGCTGGATGT TTAAGAATCT CAATAATACC TCTGAATTCA GGGCTATTAT TTATAATTGA
|
Protein sequence | MKKWLLVLIV VLLSSQALFA QKAAVTAFNQ GRKAKDNTEK LKYFDRAVLL KNNYADAYHY RGDVYKEMNK ISRATADYTK AIKFAPKDPF KYYSRAVLYI DQKKYLPAID DLTKAISLKP DFLDFYLKRG QVYLKRDNFD LAVKDFEKYS SKRKKPNSFY LELGRSYLGN YNYDKAHKQF ETFIALEPKN HEGYFYLGRV EYARGNYDEA ISLFSKAVNR NENYAPAYRL RGTVFKDIGD FESAVEDFTK LIELLPDYSY YNRRGLVYEE LGNLKAAAED YGKTIELNPK WAVAYNNRGF VYLKLKEYAL ARADLETAIK LEPQMFLPYV NIAGGYWLNK KDKKNALDNL DKAVKRGFKD FESLYDEHKK GWMFKNLNNT SEFRAIIYN
|
| |