Gene Emin_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0046 
Symbol 
ID6262878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp47684 
End bp49288 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content38% 
IMG OID642610509 
ProductZn-dependent protease-like protein 
Protein accessionYP_001874951 
Protein GI187250469 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones98 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TAATTTTTTT AATTTCCTTA GCGTTCTTTT CTTTAGCCTC CTACGCGACG 
GTTGATTATA ATGTTGTTAC AAAAGCCATG CAAGATGAAA TGAACCGGGC AAAAAAAGAA
CTCAAAATGA AGGGCTTTGA TAAACCTTAT TATATAGCTT ATTATTTTAA AGACAGCGAA
TCGGTTAAAA TAACGGCTTC TTTAGGGGCT TTAGTTTCCT CTGTAACGCC AAAATACAAA
GATATAGAAG TAAGTATAAG AGTTGGTGAC TATAAATTTG ATAACAGCAA TTTTAAAGTA
TCTTTTTTTG ATTCGGATTA TCCGGCCTAC CAAACCCCCG GCGACGGGTA CGGCACCATA
AGAGCGGCAT TGTGGCAGGA AACGGACAAT GTGTATAAGG AAGCGCTGGA AACCCTTTCC
AAGAAAAAAG GATTTATGAA GCAAAAAAAT ATTACTGATT ATTATGATGA TTTTTCCCCC
GCGGCAAAGG TAAATTTAAA GGAAGAAAAA AATACCGAAA AATTCGACAA AGAGTATTTT
GAGGCTTTAA GCAAGCAACT TTCGGCTATC GGCTTGAAAT ACCCCGAGAT TGAAAAATTT
GTGGTTAGAA TAATATATAA CAATGAAAAT AAATATTATT TGGATAATTT GGGAAGCAGC
TATTATAACA ATCCGGTAGA AATAATATTA TCCGTTGAGT CTTCCGTAAG GGCGGCGGAC
GGGTTTTTAC TGGAAAATTC TTTTGAAAAA AACTTCGCTT TGGTAACAGA TTTTCCGCCG
TCAGATAAAC TTATGGAAAC GGTAAAAGAA TTTGCCTCCG ATACGGCGGC GCAGTCAAAG
GCCGTAAAGG CCGAGCCGTA TATAGGTCCC ATATTATTGT ATAAATACGC CGCGGCCGAG
TTTTTTATGA ATTTATTTGT TTATAATATT GAAAAAATAA AACCTGAATA CTCCGATAAA
GGAACTTTTA CAAACGCCGG GGAGTTTAAA GACCGCCTGG GGCTAAAAGT TATAAGTAAT
ATTTTTGACG TTACGGACAA TCCTTTGGCT AAAGAATACA AAGGCGCGGC TTTATCCGGA
TATTACAGGG TTGACGACGA AGGCGTAAAA GCGCAAAAGG TTGATATTGT TAAACACGGC
AAGCTGGTTA ATTTTTTAAC GACAAGAAGT TTAATTAAAG GCCAAAAAGG CTCTACGGGG
CACGGCAAAA AGAGAGGGAT GGAAAAGTTT CCCTCCGCTT TAACAAGCAA TATCTTTTTT
ACCCCGTTAA AAACCATTCC TTATAATGAG TTAAAAAGCA AATTAAGGGA AGAATGCGCT
AAGCAGGAAT TGGAATATTG CCTAATGGCT AAAGGCGGCC TTGGCAATAA TTTTACTGTA
TATAAGATAT ATACAAACGA CGGAAGAGAA GAAGTAGCCT ACGGGGCTAG AATGATGAAC
AATACCCCCC GCGCTTTAAG AGACATTATT TACGCGGGCG ATGATATTGA TGTATACAAT
TTTTACAGAG GCAGCCTTAT AGCTCCGTCG GTAATTATTT CGGAAATGGA AATCAGCCCT
ATTGATGAAA AGCCCGTGCG TAAACCGCTT GTTTCTAGGC CGTAA
 
Protein sequence
MKKIIFLISL AFFSLASYAT VDYNVVTKAM QDEMNRAKKE LKMKGFDKPY YIAYYFKDSE 
SVKITASLGA LVSSVTPKYK DIEVSIRVGD YKFDNSNFKV SFFDSDYPAY QTPGDGYGTI
RAALWQETDN VYKEALETLS KKKGFMKQKN ITDYYDDFSP AAKVNLKEEK NTEKFDKEYF
EALSKQLSAI GLKYPEIEKF VVRIIYNNEN KYYLDNLGSS YYNNPVEIIL SVESSVRAAD
GFLLENSFEK NFALVTDFPP SDKLMETVKE FASDTAAQSK AVKAEPYIGP ILLYKYAAAE
FFMNLFVYNI EKIKPEYSDK GTFTNAGEFK DRLGLKVISN IFDVTDNPLA KEYKGAALSG
YYRVDDEGVK AQKVDIVKHG KLVNFLTTRS LIKGQKGSTG HGKKRGMEKF PSALTSNIFF
TPLKTIPYNE LKSKLREECA KQELEYCLMA KGGLGNNFTV YKIYTNDGRE EVAYGARMMN
NTPRALRDII YAGDDIDVYN FYRGSLIAPS VIISEMEISP IDEKPVRKPL VSRP