Gene Emin_1322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1322 
Symbol 
ID6263183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1423486 
End bp1425381 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content44% 
IMG OID642611801 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_001876209 
Protein GI187251727 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.131641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA AAATGAATCC AAACATTGTT TCGCTTAAGC AAATTCTTTT TTGGATTGCT 
GTTTTTGCCG TTACAATTTT TTTATTTAAC AGATCTGATA CATCAAAAAC AATTACGTTA
GATTATTCCG ATTTCAGGGC CAGGGTTGAA AGGACCGAGG TAAGCAACCT TGTTATCGGC
ACGGAACTTA TAAAAGGCGT TGTAAAAGAA GGAGACTCGG TTGTTAATTT CCAAACCGTT
AAAGTGGAAG ATAAAGATCT TGTATCCGAC TTGATGAAAA AGGGTATAAA GTTTAAAGCC
GAGACGGATA AAAGCTGGAT TTTCAGCGTT TTGGGAAATG TCGGTTTTAT TTTACTTTTC
TTTTTAGCCT GGTGGTTTTT GCTTATTCGC CCGCAACAAG GCGGCGGTAA AGGCAATCCT
TTAAGCTTCG GCAATACAAA AGCCAAACTC CAGGTTGGAA GCCCCGACGG CACTACATTT
AAAGACGTTG CCGGCTGCGA TGAGGCGAAA GAAGAACTTG AGGACACCAT TACATTTTTA
AAAAATCCTA AAAAATTCCA GAAATTAGGC GGAAAGCTTC CCAAAGGCGT GCTTCTTTAC
GGCGCACCGG GCACTGGTAA AACTCTGCTC GCGAAAGCCA CTGCGGGTGA AGCGGGGGTC
GCGTTTTTCT CGGCGTCAGC TTCGGAATTC GTTGAAATGT TTGTAGGCGT AGGCGCTTCA
CGCGTTAGGG ATTTATTTGA TAAAGCTAAA AAAATGGCGC CCGCTATTGT GTATATCGAC
GAGCTTGACG CTGTGGGCAG ACGCCGAGGC GCCGGTATCG GCGGCGGCCA TGACGAAAGA
GAGCAGACGT TAAACCAGCT TCTTATTGAG CTTGACGGCT TTGAGTCAAA ACAGGGCATT
ATTTTAATGG CTTCAACCAA CAGACCCGAC GTGCTTGACC CGGCGCTTAT ACGCCCCGGC
CGTTTTGATA GGCATATAAA TGTTCCCGCC CCGGACATGA AAGGAAGGGA AGAAATTTTG
GCAGTGCATT CTAAAAGAGT AAAACTTGCT CCTTCAGTTA AGCTTAAAGA TATCGCCAAA
GGGACACCCG GTTTTGTAGG GGCTGATTTG GCTAACGTAG TTAACGAAGC CGCTATTTTG
GCCGCCCGCT TTAATAAAGA AGCTGTTACT GAAAGCGATT TGGAAGAAGC CGTTGAACGT
GTTATGGCGG GCCCGCAAAG AAGAAGCAGG CTTATTTCAA ATAAAGAAAA AAGGATTATC
GCTTATCATG AAGCGGGACA CACCGTTATA GCTAAAAAAA CGGACAACAG CGACCCCGTG
CACAAAGTTT CGGTTATTCC CAGAGGGCCC GCTTTGGGTT ATACAATGCA GCTTCCTTTG
GAGGATAAAT TTTTAACCAC TAAATCCGAA ATTTTGGACA GGCTTTGCGT TTTGCTCGGC
GGCAGAGCGG CCGAAGAAAT TGTTTTTAAA GAAATTACTA CAGGCGCGCA TGACGACTTG
TCCAGAACCA CCGCTTACGC CAGGCGTATG GTTTCGGAGC TGGGCATGAG TGAAAAACTT
GGGCCCATTT CCGTTCATAC GGGTGAGGAT GAAGTTTTTC TCGGCCGTGA TATAAGCAGG
GCTAAGCATT CTGAAGAATT ATTAAGAAGC ATTGACGAGG AAGTAAGCCA GCTTGTTAAA
GGTTCTTATG AAAGAGCCAA AGATATTCTT GTTAAAAACA GAATGGCTTT AGACGTGCTT
GTTGACAGGC TTTTAGAAAT TGAGGTTGTT GAAGCTAAAG AGATTGACGA AATTTTGACA
GACCCGGCTG CTTACAAACT TAAGCTTGAA GAAATGAGAA AAGCGAAAGA AGCCCAAAAA
ATACAGCCTA TGGATGAAAG CGTACAGGAA GGTTAG
 
Protein sequence
MKKKMNPNIV SLKQILFWIA VFAVTIFLFN RSDTSKTITL DYSDFRARVE RTEVSNLVIG 
TELIKGVVKE GDSVVNFQTV KVEDKDLVSD LMKKGIKFKA ETDKSWIFSV LGNVGFILLF
FLAWWFLLIR PQQGGGKGNP LSFGNTKAKL QVGSPDGTTF KDVAGCDEAK EELEDTITFL
KNPKKFQKLG GKLPKGVLLY GAPGTGKTLL AKATAGEAGV AFFSASASEF VEMFVGVGAS
RVRDLFDKAK KMAPAIVYID ELDAVGRRRG AGIGGGHDER EQTLNQLLIE LDGFESKQGI
ILMASTNRPD VLDPALIRPG RFDRHINVPA PDMKGREEIL AVHSKRVKLA PSVKLKDIAK
GTPGFVGADL ANVVNEAAIL AARFNKEAVT ESDLEEAVER VMAGPQRRSR LISNKEKRII
AYHEAGHTVI AKKTDNSDPV HKVSVIPRGP ALGYTMQLPL EDKFLTTKSE ILDRLCVLLG
GRAAEEIVFK EITTGAHDDL SRTTAYARRM VSELGMSEKL GPISVHTGED EVFLGRDISR
AKHSEELLRS IDEEVSQLVK GSYERAKDIL VKNRMALDVL VDRLLEIEVV EAKEIDEILT
DPAAYKLKLE EMRKAKEAQK IQPMDESVQE G