Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1038 |
Symbol | |
ID | 6968722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1050904 |
End bp | 1052562 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643385050 |
Product | hypothetical protein |
Protein accession | YP_002269550 |
Protein GI | 209396552 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3593] Predicted ATP-dependent endonuclease of the OLD family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCTTG AGCGCGTTGA AATTGTGGGT TTTCGCGGTA TCAACCGTTT GTCGTTGATG CTGGAACAAA ACAACGTCCT GATTGGGGAG AACGCGTGGG GTAAATCCAG CTTGCTGGAC GCCTTAACCC TGCTGCTATC GCCAGAATCA GATCTCTACC ATTTTGAGCG CGACGATTTC TGGTTCCCGC CGGGAGATAT CAACGGGCGA GAACATCATC TGCATATTAT TTTGACCTTC CGCGAATCGC TGCCAGGTCG ACATCGGGTT CGCCGTTATC GGCCGCTGGA AGCGTGCTGG ACGCCATGCA CCGATGGCTA TCACCGTATT TTTTATCGTC TGGAAGGGGA GAGTGCGGAA GACGGCAGCG TGATGACACT GCGCAGTTTT CTCGATAAAG ACGGACATCC GATTGATGTT GAGGATATTA ACGATCAGGC ACACCATCTG GTGCGTTTAA TGCCGGTGCT GCGCTTGCGT GATGCCCGTT TTATGCGCCG TATTCGTAAC GGCACGGTGC CAAATGTCCC CAATGTGGAA GTCACCGCGC GCCAGCTCGA TTTCCTCGCC CGTGAGTTGT CCTCACATCC GCAAAATCTC TCTGATGGAC AGATTCGTCA GGGACTTTCC GCAATGGTGC AGTTGCTTGA GCATTATTTC TCTGAGCAGG GGGCTGGACA GGCGCGATAT CGTTTAATGC GGCGGCGGGC CAGCAATGAG CAACGAAGCT GGCGCTATCT GGACATCATC AACCGGATGA TTGACCGACC TGGAGGGCGC TCGTATCGGG TTATTTTGCT CGGCCTGTTT GCTACTTTAT TGCAGGCAAA AGGCACATTG CGACTGGATA AAGACGCCCG TCCATTGTTG CTGATCGAAG ATCCAGAAAC GCGTTTACAC CCCATTATGC TTTCAGTTGC CTGGCATCTG TTGAATCTTT TGCCTTTACA ACGTATTGCT ACCACCAACT CTGGCGAGCT GCTTTCGTTA ACTCCAGTGG AACATGTTTG CCGCCTGGTG CGTGAATCTT CTCGTGTTGC CGCCTGGCGT CTGGGGCCAA GTGGCTTGAG CACCGAAGAC AGCCGCCGCA TCTCTTTTCA CATTCGTTTT AATCGTCCGT CATCGCTGTT TGCTCGCTGC TGGTTGCTGG TGGAAGGGGA AACGGAAACC TGGGTTATCA ATGAACTGGC GCGTCAGTGC GGACATCATT TTGATGCCGA AGGGATCAAG GTCATTGAGT TTGCCCAGTC CGGGCTAAAG CCACTGGTTA AATTTGCCCG CCGAATGGGG ATTGAATGGC ATGTACTGGT CGATGGCGAT GAAGCAGGGA AGAAATATGC CGCTACGGTA CGCAGCCTGT TGAATAATGA TCGGGAAGCC GAACGAGAAC ATTTAACGGC GTTACCGGCG CTGGATATGG AACATTTTAT GTATCGCCAG GGATTTTCCG ATGTGTTCCA CCGCGTGGCG CAAATCCCGG AAAATGTCCC GATGAATCTG CGCAAAATTA TCTCGAAAGC GATCCATCGC TCTTCCAAAC CCGATCTTGC CATTGAAGTG GCAATGGAGG CCGGACGTCG TGGTGTGGAC TCCGTACCGA CGCTGCTGAA AAAAATGTTC TCACGCGTGC TGTGGCTGGC GCGCGGTCAC GCGGATTAA
|
Protein sequence | MILERVEIVG FRGINRLSLM LEQNNVLIGE NAWGKSSLLD ALTLLLSPES DLYHFERDDF WFPPGDINGR EHHLHIILTF RESLPGRHRV RRYRPLEACW TPCTDGYHRI FYRLEGESAE DGSVMTLRSF LDKDGHPIDV EDINDQAHHL VRLMPVLRLR DARFMRRIRN GTVPNVPNVE VTARQLDFLA RELSSHPQNL SDGQIRQGLS AMVQLLEHYF SEQGAGQARY RLMRRRASNE QRSWRYLDII NRMIDRPGGR SYRVILLGLF ATLLQAKGTL RLDKDARPLL LIEDPETRLH PIMLSVAWHL LNLLPLQRIA TTNSGELLSL TPVEHVCRLV RESSRVAAWR LGPSGLSTED SRRISFHIRF NRPSSLFARC WLLVEGETET WVINELARQC GHHFDAEGIK VIEFAQSGLK PLVKFARRMG IEWHVLVDGD EAGKKYAATV RSLLNNDREA EREHLTALPA LDMEHFMYRQ GFSDVFHRVA QIPENVPMNL RKIISKAIHR SSKPDLAIEV AMEAGRRGVD SVPTLLKKMF SRVLWLARGH AD
|
| |