Gene Emin_0539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0539 
Symbol 
ID6262737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp590831 
End bp592000 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content39% 
IMG OID642611010 
ProductTPR repeat-containing protein 
Protein accessionYP_001875431 
Protein GI187250949 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0475117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT GGTTACTGGT TTTAATTGTT GTTTTGCTTT CTTCACAGGC GCTTTTTGCC 
CAGAAGGCGG CCGTTACCGC GTTTAACCAG GGCCGAAAAG CTAAAGATAA TACGGAAAAA
CTTAAGTATT TTGACCGCGC TGTTTTACTT AAAAACAACT ATGCGGACGC TTACCATTAC
CGTGGCGACG TTTATAAAGA AATGAATAAA ATAAGCCGCG CCACGGCAGA CTATACCAAG
GCCATAAAAT TTGCGCCGAA AGACCCTTTT AAATATTACA GCAGAGCTGT TTTGTATATA
GATCAGAAAA AATATCTGCC TGCTATTGAC GACCTTACAA AAGCGATTTC CCTTAAGCCT
GATTTTTTAG ATTTTTATCT GAAGCGGGGC CAGGTGTATT TAAAGCGCGA TAATTTTGAT
TTGGCGGTAA AGGATTTTGA AAAATACTCT TCCAAAAGAA AAAAGCCCAA CAGCTTTTAC
CTTGAGTTAG GGCGTTCTTA TTTGGGGAAT TATAATTATG ACAAGGCCCA CAAACAATTT
GAAACATTTA TAGCCTTAGA ACCGAAAAAC CATGAAGGCT ACTTTTATTT AGGAAGGGTT
GAGTACGCCA GGGGAAATTA TGACGAAGCG ATTTCTCTTT TCAGTAAAGC CGTAAACCGT
AACGAAAACT ACGCTCCGGC CTACAGACTC CGCGGTACGG TTTTTAAAGA TATTGGGGAT
TTCGAATCCG CGGTGGAAGA TTTTACAAAA CTTATTGAAC TGCTGCCTGA TTATTCTTAT
TACAACAGGC GCGGCCTTGT TTATGAAGAG CTTGGCAATC TGAAAGCCGC CGCGGAGGAT
TACGGTAAGA CTATTGAACT TAACCCCAAA TGGGCCGTAG CTTATAATAA CAGGGGATTT
GTATATTTAA AACTAAAAGA ATATGCTTTA GCCAGAGCAG ATTTGGAAAC AGCCATTAAG
TTAGAACCGC AGATGTTTTT GCCTTATGTT AATATTGCCG GCGGCTATTG GCTTAATAAA
AAAGACAAAA AGAACGCGCT TGATAATTTA GATAAAGCCG TAAAACGCGG GTTTAAAGAC
TTTGAAAGCC TTTACGACGA ACATAAAAAA GGCTGGATGT TTAAGAATCT CAATAATACC
TCTGAATTCA GGGCTATTAT TTATAATTGA
 
Protein sequence
MKKWLLVLIV VLLSSQALFA QKAAVTAFNQ GRKAKDNTEK LKYFDRAVLL KNNYADAYHY 
RGDVYKEMNK ISRATADYTK AIKFAPKDPF KYYSRAVLYI DQKKYLPAID DLTKAISLKP
DFLDFYLKRG QVYLKRDNFD LAVKDFEKYS SKRKKPNSFY LELGRSYLGN YNYDKAHKQF
ETFIALEPKN HEGYFYLGRV EYARGNYDEA ISLFSKAVNR NENYAPAYRL RGTVFKDIGD
FESAVEDFTK LIELLPDYSY YNRRGLVYEE LGNLKAAAED YGKTIELNPK WAVAYNNRGF
VYLKLKEYAL ARADLETAIK LEPQMFLPYV NIAGGYWLNK KDKKNALDNL DKAVKRGFKD
FESLYDEHKK GWMFKNLNNT SEFRAIIYN