Gene Emin_0918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0918 
Symbol 
ID6262620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1017473 
End bp1018624 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content39% 
IMG OID642611397 
Productpeptidase M20 
Protein accessionYP_001875808 
Protein GI187251326 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01883] peptidase T-like protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000400462 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.70833e-19 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACAATAA AAATAAACTA TAAAAGAATG ATGGACAATT TTTTGGAACT TGTTAAAATC 
GAAAGCCCTT CCAATGAAGA ACTTAATATG CAGCTTTACG CCCAAAAAAA GCTTAAAAGC
CTTGGCTGTA AAGTAACGGT TGATAACGCG GGCAAAACCT TTCCCACAAA CGCCAAAGGC
AATGTTATAG GCTTTTTACC CGGTACGATA AAGTCCGAGC CTTTTGTTTT AGCCGGGCAT
TTAGACACCG TAAAGCCCTG CAAAAATATT AAACCCGTTA TAAAAGGAAA CAAAGTTACA
TCTAGCGGTA AAACTATTTT AGGGGCGGAC GATAGGGCAG GACTTGCCAT TATTTTTGAA
GTTCTTAATG TTTTAAAGGA AAATAAAATA CCGCATCCGC CGATATCGGT GCTTTTTACT
CTTTGTGAGG AAAACGGCAT GTACGGCGCG AAAGGTTTGG ATATAACTAA ATTAAAAGGG
CGCGAAGGTA TAATTTTAGA CAGTTCAGAC AATGATAAGC TTACCGTAAG CGCTCCTGAG
GCAAACACTA TTGACGTTGA AATCACCGGC TTTGCCGCCC ACGCGGGCGT TGAGCCTGAA
AAAGGAATTT CCGCTTTAGA AGTTGCCGCT TACGCTTTGT CAATAATGCA GCTGGGCCGT
ATAGACAAAC TTACAGTAGC TAACTTCGGC GTTGTTAACG GGGGGGAAAG CACCAACGTG
GTAATGCCGT CTCTTTTTTT AAAAGGCGAA GTGCGAAGCC GCAACCTTGC AAGCCTTAAA
AAACAAATTA AACATATGCA AGATTGTTTT GTAAAAGCTC AAAAAAAGTT TACAAAAAAA
GTTAACGGCA AAATGGTAAA ACCCGTTATA GATTTTAAAG TGGGTTTAAA ATACCCTATT
TTAGATATAG CGGTTAACTC GCCTCTTATT AAACATATTA CGGCCGAGGC TAAAAAACAC
GGCGTTAAAA TAAAACCTTA TTCAAGCGGC GGCGGTTACG ACGCTAACAT TTTGTCGGGC
AAGGGGCTTC TTACTCCTAT AATAGGCGTG GGTTATCGCC AAATGCATAC ATTAAACGAA
TGGCTTGATA TTAAAATGTT TAACCAAACG GCGGATATTA TTCTAGACAT AGTTTTAAAT
TATAAAAAGT AG
 
Protein sequence
MTIKINYKRM MDNFLELVKI ESPSNEELNM QLYAQKKLKS LGCKVTVDNA GKTFPTNAKG 
NVIGFLPGTI KSEPFVLAGH LDTVKPCKNI KPVIKGNKVT SSGKTILGAD DRAGLAIIFE
VLNVLKENKI PHPPISVLFT LCEENGMYGA KGLDITKLKG REGIILDSSD NDKLTVSAPE
ANTIDVEITG FAAHAGVEPE KGISALEVAA YALSIMQLGR IDKLTVANFG VVNGGESTNV
VMPSLFLKGE VRSRNLASLK KQIKHMQDCF VKAQKKFTKK VNGKMVKPVI DFKVGLKYPI
LDIAVNSPLI KHITAEAKKH GVKIKPYSSG GGYDANILSG KGLLTPIIGV GYRQMHTLNE
WLDIKMFNQT ADIILDIVLN YKK