Gene Emin_1216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1216 
Symbol 
ID6263717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1315581 
End bp1316609 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content44% 
IMG OID642611694 
Productglycoprotease family metalloendopeptidase 
Protein accessionYP_001876103 
Protein GI187251621 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0159819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCG ATAAAGATAT AACAATTTTA GGTATAGAAA CCACATGTGA CGAAACTTCC 
GCCGCCATAC TTAAAAGCGG GCGGGATTTA GTTTCTAACG TGGTGCACAC CCAAATCGAT
ATACATAAAA AATATTGCGG CGTAGTGCCC GAACTCGCCA GCCGCGCCCA TGCGGTTAAA
GTGGCAGAAG TGGTAAAAGA AGCGCTTGGT AACCATAAAA TAGATTTAGT AGCTTTCGCA
AGCGGCCCCG GTTTGCCCGG CGGTCTTATG GTAGGCAGAG TAGCTGCGGA AGCAGTGTCC
GCTTTAAAAA ATGTTCCTAT AATAGGAGTA AACCATTTGG AAGGACATTT GTTTGCCTGT
GAATTTGACG CTAAAGAAGG GAAAATAGCA GCCGATAAAC AACTTAAATT TCCTTTAATA
GCTTTAATAG TTTCCGGCGG ACATACCGAA CTTTGGTACG TAAAAAATTA CGGCGATTAT
AAAATGCTGG GACGCACAAG GGACGACGCT GCGGGCGAAG CTTTTGACAA AGTGGCTAAA
CTTTTGGGGC TTGGTTATCC GGGCGGGCCT GTTGTCGCTA AAGAGGCTTT AAAAGGAAAC
CCAGAGGCTA TTAAATTCCC AAGACCGATG ATGAAGGGAA CTTTTGAATT TTCTTTCAGC
GGTATTAAAA CAGCTGTAAG CTATTACCTG CGCGACCATA AAGATATAAA AAAAGAAGAT
GTGTGCGCTT CTTTCCAGGC GGCGATGGTG GAAACTCTTG TGGCTAAAAC TTTCCAGGCT
GTAAAAAAAT ATAAAGTTAA AAATGTGGCT GTCGGCGGCG GCGTCGCGGC TAATGAACTT
TTAAAAGAAA GCATGGTAAA ACGCGGTCAG AAGGAAGGAG TGGATGTTTC TTTCGTACCG
AGGGCGCTCT CTTCCGATAA CGGCGCCATG ATTGCCCTTG CCGGATATAA AAAATTTATG
TTTGCCGGTA AGTTTAACGC TAATATTAGA ATCAACCCTA ACATGAGAAT TAAAAACTGG
GGGAAATAA
 
Protein sequence
MKTDKDITIL GIETTCDETS AAILKSGRDL VSNVVHTQID IHKKYCGVVP ELASRAHAVK 
VAEVVKEALG NHKIDLVAFA SGPGLPGGLM VGRVAAEAVS ALKNVPIIGV NHLEGHLFAC
EFDAKEGKIA ADKQLKFPLI ALIVSGGHTE LWYVKNYGDY KMLGRTRDDA AGEAFDKVAK
LLGLGYPGGP VVAKEALKGN PEAIKFPRPM MKGTFEFSFS GIKTAVSYYL RDHKDIKKED
VCASFQAAMV ETLVAKTFQA VKKYKVKNVA VGGGVAANEL LKESMVKRGQ KEGVDVSFVP
RALSSDNGAM IALAGYKKFM FAGKFNANIR INPNMRIKNW GK