Gene Emin_1368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1368 
Symbol 
ID6263412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1470620 
End bp1471924 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content39% 
IMG OID642611849 
Productpeptidase M23 
Protein accessionYP_001876255 
Protein GI187251773 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAA AAATAAAAAA TACGGTTAGC TTTTGCAAAA TCTGTTATTC TAAAGTAGAA 
AAAGGGCTTT TCTACTTAAT GCTGCTTTGT ATTGTTTTAT GTTCCATTGC TATCTTCTTA
ATAATGAGAG ATAAAAAAGC GCAAAGAAAC GCTATTATAC CGCCGCCTAA TACCTCTGTA
CGCGAACAGA ATATGCAAAA GACGCTTCTT TATAACCACC TTGAACAAAC AGGTTTAGAT
AAAATTCTTA TTTATAACAT AGTAAATAAA CTTGATACTG TAATGCATAC GCGTAAAATA
AGACCGCAAG ATTCATTTAT GCTTGTTACC GGCGATGACA ACGCCTTTAA AATGCTTGTG
GTAACACGCG ATTTAACAAG GTATTATGTA GCCGCGCTTG AAGACGGTGA ATTAATCGCG
GGAATTATAG ACATAGAAGT AAAAACAAGG CAAAAAACGG CTTACGGCAC CATATACAGT
TCTTTATTCG CCTCCATGCA GAGCGAAGGA ATGACTGTGC CTTTAATAGT GGCTTTTACG
GACATTTTTT CTTGGAATAT CGATTTTAAT TCCGAAACCA GAAAAGGCGA CACGTACAGC
ATAATTTGGG ACGAAGACTA CACCGTTACC GGAATGGTGG TTGACCAGCA TATACTTGCC
GCAAAATATG AAGGCGGTAT GGCCGGCAAA AACTACGCGT TCGGCTTTGA GGGTGATTTT
TACGACAAAG ACGGCAAAGT AACAAAAAAA ATGTTTTTAA AATCACCGAT AAGTTTTAAA
GGTGTAAGAA TAACATCGCG CTTTAATCCG CGCAGAATGC ACCCCATACT TCGCATAAGA
CGCCCGCATT TAGGCATTGA TTACGCCGCG CCTGTGGGCA CGCCGGTTGA AACAATAGCC
GACGGCGTTG TAACCTTTGT GGGCTGGAAG GGCGGTTTTG GCAATTATAT AGAAGTTAAG
CACGCTAACT CATTTGTAAC CACTTACGGG CATTTAAAAA GTTTTAATGT TAAAAAAGGG
GAAAAGGTTA AGCAGGGTAA AGTTATAGGG TATGTAGGCT CCACGGGATT AAGCACCGGA
CCGCATTTGG ATTTTAGAAT AAGCGAACAC GGCAAATTTC AGGATTTCTT GAAAATGAAA
AACAGAAATT CCGCAGTAAG CGAAATAGCT AAAGACAAAA TGAAAGAGTT TGAAGTTGCC
AGAGATAAAT ATTTGGAAAC TTTAAATAAG TTAGATGAAA AACTAAAAAC CCCTGCGCAT
ATCGAAAATG CGCCGGAAGA CACCATCCAA GAGGGCGAGC TATGA
 
Protein sequence
MKEKIKNTVS FCKICYSKVE KGLFYLMLLC IVLCSIAIFL IMRDKKAQRN AIIPPPNTSV 
REQNMQKTLL YNHLEQTGLD KILIYNIVNK LDTVMHTRKI RPQDSFMLVT GDDNAFKMLV
VTRDLTRYYV AALEDGELIA GIIDIEVKTR QKTAYGTIYS SLFASMQSEG MTVPLIVAFT
DIFSWNIDFN SETRKGDTYS IIWDEDYTVT GMVVDQHILA AKYEGGMAGK NYAFGFEGDF
YDKDGKVTKK MFLKSPISFK GVRITSRFNP RRMHPILRIR RPHLGIDYAA PVGTPVETIA
DGVVTFVGWK GGFGNYIEVK HANSFVTTYG HLKSFNVKKG EKVKQGKVIG YVGSTGLSTG
PHLDFRISEH GKFQDFLKMK NRNSAVSEIA KDKMKEFEVA RDKYLETLNK LDEKLKTPAH
IENAPEDTIQ EGEL