Gene Emin_1268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1268 
Symbol 
ID6263243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1368183 
End bp1370228 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content39% 
IMG OID642611747 
Productthimet oligopeptidase 
Protein accessionYP_001876155 
Protein GI187251673 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000110784 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA CAATAATTAT ATTAAACAGT TTTTTTGCGG CTGCATTTGT TTTTGGAGAA 
AGTATGGATA AAATGTTTAA AGACGGCGCT TTAAGGTTTG ATTATACCCC GGCGGAGATT
ATGCTTTTAG AAAAGCAGGC TTCCGAACGG TTTGAAAAAA ATATCGCGGC TGTTTTATCC
GTGCCTAAAG AACAAAGAAC CTTTGAAAAC ACGCTTTTAG CTTTTGAGCG CGCTTATACG
GATTATTGGT ATGTTCCCAA AGCTTTGTCT TTGCTTACTT ATTTTCATAA AGACGAGGAC
GTGCGTGAGG CGGCGGCTAA ACTTGAATCA AAAGGAAGCC AGTTTAAAGC CTCTATTCTG
GCAAGAAAAG ATATTTATGG CGCTTTAAAA GAATTCTCTT CTTTAAATCC TAAACTTGGT
AATGAGGAGG AACGCCTTTT AGCTTTTTGG CTTGCCAAAT TTAAAAGAGC CGGAGCCGAG
CTTGAAGGGG AAGCGGCCAA AGTATATGCG GATTTAACTT CTTCGAAAAT GGAAAACATT
ACCAAGTATA ACGTTAACCT TATGCAAAAT ACGGACAGCC TTGAACTTAC AAGAGAAGAA
CTTGACGGTA TGAGCGACGT TTACATTAAC AGGCTTAATA AAACAAAAGA CGGTAAATAT
ATTGTTACTT TAAAGTATCC GGATTATAAC CCTTTTATGG CAAACGCCAA AAACGCCGAG
GCGCGCAAAG CGCTTCAAAT AAAATTTGCA AATAGAGGCG GGCTTGAAAA TGTGGGTCTT
TTAGAAACAG TTTTGGCGCA GCGTTCGATA ACTTCAAGAC TGCTTGGCTA TAAAGACCAC
CCGCAATACG TTCTTGAGGA CCGTATGGCT AAAGACGAAA AAACTTTAAA AAAGTTTTTA
TCCGGTATTG AAAAAAACCT CAGGCCCATA GGTAAAAAAG AATTAAAAGA ATATAAGGCC
TTAAAAGATA AAGAAGCCGG TTATAAGACA GACGGTTTTT ATCTTTGGGA CGCGCCGTAT
TACACAAATT TATATAAAAA GCTTTATTAT AATGTCGACC ATGAGAAAAT AAAAGAATAT
TTTATGACCG ATACGGTAAT AAAAGGAATG TTTGAAATAT TCGGAGGGCT TTTTGGCCTG
GTTTTTGAGC GCGTGGATTT GCCCGTTTGG CATGAAGACG TTTTGGTTTA CAAAATTAAA
GATGCTAAAA CAGGCGCGCA TATTTCCAAT TTTTATATGG ACCTTTACCC GCGCGACGGT
AAATACACGC ACGCCGCGTG CTGGAGTTTT ATTGACGGCT TTTTATTGGA AAACGGGCAG
TATCAAACGC CGTCTGTGGT AATAGCTTCT AATTTAAACC CGCCGGGCAA CGGCATTCCT
TCTCTATTAA CGCACAGCGA AGTGGAAACA TTATTTCACG AATTCGGACA TGTTTTGCAA
ATGTCGCTTA CGCATCCTAA ATACGCTTCC TTAGGAGGTG ACAATATTAC CTGGGATTAT
ATTGAAACGC ATTCCCAGCT TTTGGAAAAT TGGGCCTGGA ATAAAGATAC TCTTAAAAAA
ATATCTAAAC ATTATAAAAC GGGCGAGTCT CTTCCTAACG AGATGATTGA CAGTCTTATA
AAAAGTAAAC ACGCGGGCGT GGCTTTGCCT ATGCTGAGGC AAAACTTTCA AGGCCAATTG
GATTATAAAT ACCACAAATC AAATAAACAT GTAGATACTA CGGGTGTTTA TGAAAAACTT
ATAAAACAGA TTTATTTGAT TCCCATGACA GAGGGCACAT ATCCGCAGGC AAATTTTGCG
CACATAATGT CGCTTACCGA CCCATACGAT GTGGGGTATT ATGTTTACGC ATGGTCATTA
GTTATAGCGG ACGATATATT TTCCGAATTT GAAAAGCAAG GCCTTGATAA TAAAGAGCTG
GGCTTAAAAT TAAGAAAATA TATTTATACG CCGGGCCTTA CAGAGGAGCC GAACGAAATG
GTTGAAAAGT TTTTGGGAAG ACCTTATAAT AACAAAGCGT TTTTAAAGAA CTTCGGGGTT
AAATAG
 
Protein sequence
MKKTIIILNS FFAAAFVFGE SMDKMFKDGA LRFDYTPAEI MLLEKQASER FEKNIAAVLS 
VPKEQRTFEN TLLAFERAYT DYWYVPKALS LLTYFHKDED VREAAAKLES KGSQFKASIL
ARKDIYGALK EFSSLNPKLG NEEERLLAFW LAKFKRAGAE LEGEAAKVYA DLTSSKMENI
TKYNVNLMQN TDSLELTREE LDGMSDVYIN RLNKTKDGKY IVTLKYPDYN PFMANAKNAE
ARKALQIKFA NRGGLENVGL LETVLAQRSI TSRLLGYKDH PQYVLEDRMA KDEKTLKKFL
SGIEKNLRPI GKKELKEYKA LKDKEAGYKT DGFYLWDAPY YTNLYKKLYY NVDHEKIKEY
FMTDTVIKGM FEIFGGLFGL VFERVDLPVW HEDVLVYKIK DAKTGAHISN FYMDLYPRDG
KYTHAACWSF IDGFLLENGQ YQTPSVVIAS NLNPPGNGIP SLLTHSEVET LFHEFGHVLQ
MSLTHPKYAS LGGDNITWDY IETHSQLLEN WAWNKDTLKK ISKHYKTGES LPNEMIDSLI
KSKHAGVALP MLRQNFQGQL DYKYHKSNKH VDTTGVYEKL IKQIYLIPMT EGTYPQANFA
HIMSLTDPYD VGYYVYAWSL VIADDIFSEF EKQGLDNKEL GLKLRKYIYT PGLTEEPNEM
VEKFLGRPYN NKAFLKNFGV K