Gene Emin_0304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0304 
Symbol 
ID6263532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp326310 
End bp327284 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content36% 
IMG OID642610769 
ProductTPR repeat-containing protein 
Protein accessionYP_001875201 
Protein GI187250719 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000296078 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00226693 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGAAATA AAAAACTTAT TATTTTTATT TTATTTTTAC TTTGCTGCGC TTTAAGCGCG 
GAGGCTGCCG TATGGCCTTT TGGAAAAAAA CAAAAAAAAC TGCTTAATGA CGCCAGACAA
GATTATTCGG AAGGAAATTA TTATTCGGCC ATAGATAAGC TCAAAGTTTT TCTTGTTGAA
GGTACCGTTA AAAGGCGCGA AAAAAGAGCC TATCTTCTCC TTGGTGAAAG TTATGAAAAA
ATAGGAGAAA TTGATTCAGC TCTTAACACA TATCTTGAAG GCGTTGAGTT AAACCCCAAA
GATAAAGACC TTTTGTTAAA GCTGGGAGCT CTTTACCAAA GAAACGATTT AATACGGGAC
AGCATAGAAA TTTATGAGCG TATTTTAGCT CTTGATAAAA ACAATTCACA GGCTTTTCTG
GGGCTTGCCA GGGCATATAC AGATGAAGGA TTTTTTTCTA AAGCGGAAGG ATATTTCCAG
CAATATTTAA GGATAACAAA GATAGAGGAT TTTGACGGTG ATATTTTTTT GGAACATGCC
GGCGCTTATT TCAGACAGAG GAAATATAAT GAAGCTCTCT TTAACGCGGC CTTATCTATT
GACAAGCTTG GCGAAAATAA AGATAACACA TTTCTTGTTG CCAAAATAAA CAGAATGCAG
GGAAATATGG AAGACGCTTA CATTTATATA GATAAAGCGA TAAATCTTGA AGGTTACGAT
AATTGTTATA CCGCTTTACT TACAAAGGCC CTGTGGCTTA CGCAAGATAA AAGATATGAA
GAAGCAAAAA TAATATCGGA CTCCGTTTTA TTGGAAAAAC CAAATAACAG ACTTGCGTTA
TACGTTAACT TCCTGGCATA CAGGGGTAAA GGAAATAAAA ATAAAGCGGA CGAGTATTTA
AAACGCATAT CCGCTTATGA GGATAACAGT TTTATATCCC GCGTTGCACG TACGCATCTT
AGTGTTGATA ATTAA
 
Protein sequence
MRNKKLIIFI LFLLCCALSA EAAVWPFGKK QKKLLNDARQ DYSEGNYYSA IDKLKVFLVE 
GTVKRREKRA YLLLGESYEK IGEIDSALNT YLEGVELNPK DKDLLLKLGA LYQRNDLIRD
SIEIYERILA LDKNNSQAFL GLARAYTDEG FFSKAEGYFQ QYLRITKIED FDGDIFLEHA
GAYFRQRKYN EALFNAALSI DKLGENKDNT FLVAKINRMQ GNMEDAYIYI DKAINLEGYD
NCYTALLTKA LWLTQDKRYE EAKIISDSVL LEKPNNRLAL YVNFLAYRGK GNKNKADEYL
KRISAYEDNS FISRVARTHL SVDN