Gene ECH74115_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1038 
Symbol 
ID6968722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1050904 
End bp1052562 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content53% 
IMG OID643385050 
Producthypothetical protein 
Protein accessionYP_002269550 
Protein GI209396552 
COG category[L] Replication, recombination and repair 
COG ID[COG3593] Predicted ATP-dependent endonuclease of the OLD family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTTG AGCGCGTTGA AATTGTGGGT TTTCGCGGTA TCAACCGTTT GTCGTTGATG 
CTGGAACAAA ACAACGTCCT GATTGGGGAG AACGCGTGGG GTAAATCCAG CTTGCTGGAC
GCCTTAACCC TGCTGCTATC GCCAGAATCA GATCTCTACC ATTTTGAGCG CGACGATTTC
TGGTTCCCGC CGGGAGATAT CAACGGGCGA GAACATCATC TGCATATTAT TTTGACCTTC
CGCGAATCGC TGCCAGGTCG ACATCGGGTT CGCCGTTATC GGCCGCTGGA AGCGTGCTGG
ACGCCATGCA CCGATGGCTA TCACCGTATT TTTTATCGTC TGGAAGGGGA GAGTGCGGAA
GACGGCAGCG TGATGACACT GCGCAGTTTT CTCGATAAAG ACGGACATCC GATTGATGTT
GAGGATATTA ACGATCAGGC ACACCATCTG GTGCGTTTAA TGCCGGTGCT GCGCTTGCGT
GATGCCCGTT TTATGCGCCG TATTCGTAAC GGCACGGTGC CAAATGTCCC CAATGTGGAA
GTCACCGCGC GCCAGCTCGA TTTCCTCGCC CGTGAGTTGT CCTCACATCC GCAAAATCTC
TCTGATGGAC AGATTCGTCA GGGACTTTCC GCAATGGTGC AGTTGCTTGA GCATTATTTC
TCTGAGCAGG GGGCTGGACA GGCGCGATAT CGTTTAATGC GGCGGCGGGC CAGCAATGAG
CAACGAAGCT GGCGCTATCT GGACATCATC AACCGGATGA TTGACCGACC TGGAGGGCGC
TCGTATCGGG TTATTTTGCT CGGCCTGTTT GCTACTTTAT TGCAGGCAAA AGGCACATTG
CGACTGGATA AAGACGCCCG TCCATTGTTG CTGATCGAAG ATCCAGAAAC GCGTTTACAC
CCCATTATGC TTTCAGTTGC CTGGCATCTG TTGAATCTTT TGCCTTTACA ACGTATTGCT
ACCACCAACT CTGGCGAGCT GCTTTCGTTA ACTCCAGTGG AACATGTTTG CCGCCTGGTG
CGTGAATCTT CTCGTGTTGC CGCCTGGCGT CTGGGGCCAA GTGGCTTGAG CACCGAAGAC
AGCCGCCGCA TCTCTTTTCA CATTCGTTTT AATCGTCCGT CATCGCTGTT TGCTCGCTGC
TGGTTGCTGG TGGAAGGGGA AACGGAAACC TGGGTTATCA ATGAACTGGC GCGTCAGTGC
GGACATCATT TTGATGCCGA AGGGATCAAG GTCATTGAGT TTGCCCAGTC CGGGCTAAAG
CCACTGGTTA AATTTGCCCG CCGAATGGGG ATTGAATGGC ATGTACTGGT CGATGGCGAT
GAAGCAGGGA AGAAATATGC CGCTACGGTA CGCAGCCTGT TGAATAATGA TCGGGAAGCC
GAACGAGAAC ATTTAACGGC GTTACCGGCG CTGGATATGG AACATTTTAT GTATCGCCAG
GGATTTTCCG ATGTGTTCCA CCGCGTGGCG CAAATCCCGG AAAATGTCCC GATGAATCTG
CGCAAAATTA TCTCGAAAGC GATCCATCGC TCTTCCAAAC CCGATCTTGC CATTGAAGTG
GCAATGGAGG CCGGACGTCG TGGTGTGGAC TCCGTACCGA CGCTGCTGAA AAAAATGTTC
TCACGCGTGC TGTGGCTGGC GCGCGGTCAC GCGGATTAA
 
Protein sequence
MILERVEIVG FRGINRLSLM LEQNNVLIGE NAWGKSSLLD ALTLLLSPES DLYHFERDDF 
WFPPGDINGR EHHLHIILTF RESLPGRHRV RRYRPLEACW TPCTDGYHRI FYRLEGESAE
DGSVMTLRSF LDKDGHPIDV EDINDQAHHL VRLMPVLRLR DARFMRRIRN GTVPNVPNVE
VTARQLDFLA RELSSHPQNL SDGQIRQGLS AMVQLLEHYF SEQGAGQARY RLMRRRASNE
QRSWRYLDII NRMIDRPGGR SYRVILLGLF ATLLQAKGTL RLDKDARPLL LIEDPETRLH
PIMLSVAWHL LNLLPLQRIA TTNSGELLSL TPVEHVCRLV RESSRVAAWR LGPSGLSTED
SRRISFHIRF NRPSSLFARC WLLVEGETET WVINELARQC GHHFDAEGIK VIEFAQSGLK
PLVKFARRMG IEWHVLVDGD EAGKKYAATV RSLLNNDREA EREHLTALPA LDMEHFMYRQ
GFSDVFHRVA QIPENVPMNL RKIISKAIHR SSKPDLAIEV AMEAGRRGVD SVPTLLKKMF
SRVLWLARGH AD