Gene Emin_0225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0225 
Symbol 
ID6262999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp241259 
End bp242995 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content34% 
IMG OID642610688 
ProductATP-dependent OLD family endonuclease 
Protein accessionYP_001875124 
Protein GI187250642 
COG category[L] Replication, recombination and repair 
COG ID[COG3593] Predicted ATP-dependent endonuclease of the OLD family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.000140656 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATTA AAAATGTCCA TATTCATAAC TTTAGGTCCA TACTAGATGC CGATTTCCCC 
TTGGAATCAT ATTCCATACT TGCGGGAGAA AATAATTCAG GAAAGACAAC TTTTATTAAT
GCTTTACGAG TTTTTTATGA AGATATAAAA TTTAACAAGG CAGAAGACTT TCCTAAAATG
GAAACATCTG ATGTAGAGTC TTGGATAGAG GTTGAGTTCA TAACTGTACA AGAAGAGCAA
GACTTATTAA AAGAAGAATA TAGAAGTGCA GACAAGATAC TAAGGGTTAG AAAGTATCTA
GCATCCTCCT CTATAGAAAT AAAGTCAACT CAAAGTAATA TCTATGGGTA CGAAAATGGT
GTTTTGTCTA CCACGCTCTT CTATGGTGCT AAAAATATAT CCCAAGCAAA GTTGGGTAAT
ATTTTGTATA TCCCAGAGTT AAGTCGCACC GAAGAAGTTA TGAATTTATC AAAAGCAGCG
TCTCCATTAA AAAGCATTGT AGAATATGTT CTTGGTAAGA TATTAAAAGC AAGTTCTTCT
TTTTCTGAGC TAAATAAGGC CTTTGAGGTG TTTAACAAAG AATTCCAAGC AGAAAGTTCC
CCAGAGGGAT TATCTATTAA CAAAATGAAA GAGGATATAA ACTCTGAGCT TAAAGACTGG
GGTGTTAGTT TGGGTATAAA TATCAATGCG GTCTCTCCAG AAGTTATAAC AAAAAATCTC
CTCTCTCATT ATTTGCAGGA TTCTAAGCTG GGTGAACAGG AGATAAACAC AAGCAGTGTC
GGGCAAGGGC TTCAGAGACA CATTATTTAT TCTTTAATAA AAATAGCATC TAAATACCAA
GACCCTAAAG AAATTAAAAA GAAAGATTTT TCTCCAGATT TTACGCTCAT TTTATTTGAA
GAGCCTGAAG CCTTTTTACA TCCGTCGCAA CAACAGATTT TAAACATAGA TTTAGAGAGT
ATTGCTAAGG GAGATAATGA GCAAGTAATA GCAACTACAC ATTCCCCAAT ATTTGTTTCT
AAAAATATAA ACAATTTGCC CTCACTAATA AGATTGAGTA GGGAAGAGCG AAATAAATGG
GAAACCAAAA GTTTTAATAT TTCTAAAGAA AAACTGGAAT TATTGCTGTC AGATAATGCG
GGGCTTGAAG CCTGCTTCAA AAGCACTGTC GCCTTACCTG CTTGTCAAGA AGAACTGAAG
AAAGCTTTGG CAAAATCTCT GGCCTCGGGT ACATTGGAAG GTTTTTCTGG AAGTGATAAG
GAAATTATGA ATATTTGCAT GTGGCTTGAC ACAGAACGAG CAAATGCTTT TTTTTCAAAA
CATGTAATAA TATGTGAGGG GGCAACAGAA AAAGTTCTAT TAGAGTATTT ATTTGCAACG
CATTGGAAAG ATTTTGCTAA AAAACACATA TATTGTCTGG ATTCTTTAGG TAAGTTTAAC
ATACATAGAT TTATAAACTT ATTTGATTCC CTAGGAATAT ATCATTCTGT TGTTTATGAT
TCAGATAACA ATCGTGGTGT TCATGAAATA GTTAATAAAT TTATTCAGAG TAAAAAAAGT
GCGTACACAC AAAACCTGTT AGCCTTATCT GGCGATGTTG AGTCTGAATT TGGGATTACT
AAGCCAGCTA ATACGCATTT GAAACCTGCT AACTTATTGT TGCATTTAAT TAAAAATAAA
ATTACCCAGA AACAAATAGA CTCCTTTAAG GAGAAGTTTA ATTATCTTTC TATATAG
 
Protein sequence
MKIKNVHIHN FRSILDADFP LESYSILAGE NNSGKTTFIN ALRVFYEDIK FNKAEDFPKM 
ETSDVESWIE VEFITVQEEQ DLLKEEYRSA DKILRVRKYL ASSSIEIKST QSNIYGYENG
VLSTTLFYGA KNISQAKLGN ILYIPELSRT EEVMNLSKAA SPLKSIVEYV LGKILKASSS
FSELNKAFEV FNKEFQAESS PEGLSINKMK EDINSELKDW GVSLGININA VSPEVITKNL
LSHYLQDSKL GEQEINTSSV GQGLQRHIIY SLIKIASKYQ DPKEIKKKDF SPDFTLILFE
EPEAFLHPSQ QQILNIDLES IAKGDNEQVI ATTHSPIFVS KNINNLPSLI RLSREERNKW
ETKSFNISKE KLELLLSDNA GLEACFKSTV ALPACQEELK KALAKSLASG TLEGFSGSDK
EIMNICMWLD TERANAFFSK HVIICEGATE KVLLEYLFAT HWKDFAKKHI YCLDSLGKFN
IHRFINLFDS LGIYHSVVYD SDNNRGVHEI VNKFIQSKKS AYTQNLLALS GDVESEFGIT
KPANTHLKPA NLLLHLIKNK ITQKQIDSFK EKFNYLSI