Gene Emin_0732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0732 
Symbol 
ID6263113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp807531 
End bp808586 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content40% 
IMG OID642611206 
Productpeptidase M24 
Protein accessionYP_001875624 
Protein GI187251142 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000000296906 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAGATT ACAAAGTAAA AATAAAAACG TTTTTAAAAA CTTTAAAACA ATGCGAAATT 
GAAGGATATA TAACCACTAA CGTTATTGAT ATGCAATATT TTTGCGCGCG CCCTTTCCAG
CCAAGCGAAA GAAGCGTGCT TTTAATAACG CCTAAACACT TCATGATATT CGCCCGCCCG
CTTGCTTTTA ATGCTATTAA AGAAAGCGTT AAGGAAGCTA AGGTTGTAAT GGCCGAAGAT
ATTTCAGCAA TAGCAGCGGC GGCTGAGTTT GTTATTAAAA ACAAAATTAA AAATATTTGT
TTTGACCAAG ATAAAGAGCT GTTTTCAGCG GGGCAGATTT TCCAAAAAGC GGGTATAAAG
CCCGAACTTG CCGTTACCAA TACGGTAAGA ATGGTAAAAA ATAAAGAAGA AATTAAAAAT
ATCCGCAAAG CCTGCCAAAT AGCTTATAAC GCTTTTCTTT ATATAAAACC CAGGATTAAA
ACGAGTATGA CAGAGCTTGA AGCGGCCTCA ATGCTTGAAA ATTATATGAA ATCACAAGGA
GCGAGCGGCG TTTCTTTTGA CACAATTATG GCTTTTGGCA AAAACAGCGC TGACCCGCAT
CACGCCACTG ATACGACTAA GCTTAAAAAT GAGGATGTGA TTTTGGTAGA TTTCGGCTGT
ATTTACAAAG GCTACTGCTC TGACATTACA AGAACCTGGT GGCACGGCAA AAAACCGGCG
GCAGAATTTA CAAAAGTTTG GAATATTGTC GAACGGGCCA GAAAAGAGGG TGTAAAAAAA
GTTCGCCCCA ACATGAGCGC GCGTAACGCC GATAAAATAT GCCGTGATAT TATTGAAACG
GCTTCTTACG GCCCGCTTAT ACATTCAACA GGGCATGGCG TGGGGATGAA TTTGCATGAG
TCGCCCTTTC TTAACCCTCC TTCACAGGAA ATACTTAAAA AGGGTAATGT TTTTACTATA
GAACCGGGCA TTTATATACC CGGCAAATTC GGAGTACGCC TTGAGGATAC CGTTGAACTT
ACGGCAAAAG GCGCGAATAT TTTAACTAAA AAATAA
 
Protein sequence
MADYKVKIKT FLKTLKQCEI EGYITTNVID MQYFCARPFQ PSERSVLLIT PKHFMIFARP 
LAFNAIKESV KEAKVVMAED ISAIAAAAEF VIKNKIKNIC FDQDKELFSA GQIFQKAGIK
PELAVTNTVR MVKNKEEIKN IRKACQIAYN AFLYIKPRIK TSMTELEAAS MLENYMKSQG
ASGVSFDTIM AFGKNSADPH HATDTTKLKN EDVILVDFGC IYKGYCSDIT RTWWHGKKPA
AEFTKVWNIV ERARKEGVKK VRPNMSARNA DKICRDIIET ASYGPLIHST GHGVGMNLHE
SPFLNPPSQE ILKKGNVFTI EPGIYIPGKF GVRLEDTVEL TAKGANILTK K