Gene Emin_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0032 
Symbol 
ID6262984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp34686 
End bp35801 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content43% 
IMG OID642610495 
Productsignal peptide peptidase SppA, 36K type 
Protein accessionYP_001874937 
Protein GI187250455 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.557774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones102 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACAG AGAAAAACAG CGGTTTCCCT GAATTTACGG ACGGGAATGC CCAAGAAAAA 
AAAGAGGATA TTATTTTAAC TCCCTCCGCA GAGACACCCC TGCCTATAGC CCAAACAAAA
GAAACCGGGC CCAAAGAGTC TAAAAAAAGC AACTCCGGGG GCAAAAACTG GTTAAGCGTA
TTGGTTTTAT TGTTTTTTGT TTCCTCAATC AGCGGACTTT ATATAATTTT TTCGGGCGGA
ATTTGCAAAA AAGAAACAAA AACAGAATCC TCTTTACCCA GGCTATCTTC CCAGGCAAGA
CATGGTGAAA CAGGCGTAGC CGTTATAAGA ATAAGAGGCG TTATAACGGA ACCGCAGGCC
AGCTCGTGGA GAGACCAGTC CGCCAGCTCA ATAGCTAGAA GAATACGAAC AACAGCCGAT
AAAGACAATG TTAAAGCCAT TATAATTGAT ATTAACTCGC CGGGAGGAAC AGTCGCCGCG
GTGCAAGATA TTTACAACGC CATTTTATAC GCCAGGCAGG TAAAAAATAA AAAAGTTGTG
GCTTTATTTA GAGATGTTTC CGCCAGCGGC GGGTATTATA TAGCCGTGGC TTGCGACAAA
ATTGTCGCGC AGCCCGGCAC GCTTACAGGC TCTATAGGCG TAATTTTCCA AACAGGCAAT
TTTGAAGGGC TTATGAATAA AATAGGCGTT TCATTTTCGA CAATAAAATC AGGCCAGCAT
AAAGACATAG GTTCCCCTTA CAGAAAAATG ACGGAGGAAG AAAGAACTTT ACTGCAAGAG
CTTATTGACG ACTCCTACAA CCAGTTTTTA GACGTGGTTA AAACAGGAAG GCCGAACATG
AACCCCGTTG AGCTTAAAGT TTACGCGGAC GGCAGAATAT TTACGGGCAG AAAAGCATTT
AGCATAGGGC TTATTGACGC GCTTGGCGGC GAAGAAGAGG CCTTAAAAAT AGCGGGCGAA
CTTGCCGACA TAAAAGACCC AAAAATAATT TCAAACAGAC CGACAACGTT TCGTGAATGG
TTATCATCCT TAGACCCGGA AATGTCAAGC AAAACTTTAG ACAGACAAAT TGAAGCCATC
TCCTCGCCCA AAGTAGCCTA TTTGTGGACG AATTAA
 
Protein sequence
MDTEKNSGFP EFTDGNAQEK KEDIILTPSA ETPLPIAQTK ETGPKESKKS NSGGKNWLSV 
LVLLFFVSSI SGLYIIFSGG ICKKETKTES SLPRLSSQAR HGETGVAVIR IRGVITEPQA
SSWRDQSASS IARRIRTTAD KDNVKAIIID INSPGGTVAA VQDIYNAILY ARQVKNKKVV
ALFRDVSASG GYYIAVACDK IVAQPGTLTG SIGVIFQTGN FEGLMNKIGV SFSTIKSGQH
KDIGSPYRKM TEEERTLLQE LIDDSYNQFL DVVKTGRPNM NPVELKVYAD GRIFTGRKAF
SIGLIDALGG EEEALKIAGE LADIKDPKII SNRPTTFREW LSSLDPEMSS KTLDRQIEAI
SSPKVAYLWT N