Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2283 |
Symbol | |
ID | 6144478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2310775 |
End bp | 2312433 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617158 |
Product | hypothetical protein |
Protein accession | YP_001744331 |
Protein GI | 170682448 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3593] Predicted ATP-dependent endonuclease of the OLD family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0243665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.014671 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCTTG AGCGCGTTGA AATTGTGGGT TTTCGCGGTA TCAACCGTTT GTCGTTGATG CTGGAACAAA ACAACGTCCT GATTGGGGAG AACGCGTGGG GTAAATCCAG CTTGCTGGAC GCCTTAACCC TGCTGCTATC GCCAGAATCA GATCTCTACC ATTTTGAGCG CGACGATTTC TGGTTCCCGC CGGGAGATAT CAACGGACGG GAACATCATT TGCATATTAT TTTGACCTTC CGCGAATCGC TGCCAGGCCG ACATCGGGTA CGCCGCTATC GGCCACTGGA AGCGTGCTGG ACGCCATGCA CCGATGGCTA TCACCGTATT TTTTATCGTC TGGAAGGGGA GAGTGCGGAA GACGGCAGCG TGATGACACT CCGCAGTTTT CTCGATAAAG ACGGACATCC GATTGATGTC GAGGATATTA ACGATCAGGC ACGCCATCTG GTGCGTTTAA TGCCGGTGCT GCGCTTGCGT GATGCCCGTT TTATGCGCCG TATTCGTAAC GGCACGGTGC CAAATGTCCC TAATGTGGAA GTCACCGCGC GCCAGCTCGA TTTCCTCGCC CGTGAGTTAT CCTCACATCC GCAAAATCTC TCTGATGGGC AGATTCGTCA GGGACTTTCC GCAATGGTAC AGCTGCTTGA GCATTATTTC TCTGAGCAGG GGGCCGGACA GGCGCGATAT CGTTTAATGC GGCGGCGAGC CAGCAATGAG CAACGAAGCT GGCGCTATCT GGATATCATC AACCGGATGA TTGACCGACC TGGAGGGCGC TCGTATCGGG TTATTTTGCT CGGCCTGTTT GCTACTTTGT TGCAGGCAAA AGGCACATTG CGACTGGATA AAGACGCCCG TCCATTGTTG CTGATCGAAG ATCCAGAAAC CCGTTTACAC CCCATTATGC TTTCAGTTGC CTGGCATCTG TTGAATCTTC TGCCATTGCA GCGCATCGCT ACTACCAACT CGGGAGAGTT GCTTTCGTTA ACGCCGGTAG AGCATGTTTG CCGACTGGTA CGTGAGTCCT CGCGCGTTGC CGCCTGGCGT CTGGGGCCGA GTGGTTTGAG TACCGAAGAT AGCCGACGCA TCTCTTTCCA CATTCGTTTT AATCGTCCAT CATCGCTGTT TGCACGCTGC TGGCTGCTGG TGGAAGGGGA AACAGAAACC TGGGTTATCA ACGAACTGGC GCGCCAGTGT GGACACCATT TCGATGCCGA AGGGATTAAG GTCATTGAAT TTGCCCAGTC CGGGCTAAAA CCACTGGTGA AATTTGCCCG GCGAATGGGG ATTGAATGGC ATGTACTGGT CGATGGCGAT GAAGCAGGGA AGAAATATGC CGCTACGGTA CGCAGCCTGT TGAATAACGA TCGGGAAGCC GAACGAGAAC ATTTAACGGC GTTACCGGCG CTGGATATGG AACATTTTAT GTATCGCCAG GGATTTTCCG ATGTGTTCCA CCGCGTGGCG CAAATCCCGG AAAATGTACC GATGAATCTG CGCAAAATTA TCTCGAAAGC GATCCATCGC TCTTCCAAAC CCGATCTTGC CATTGAAGTG GCAATGGAGG CCGGACGTCG TGGTGTGGAT TCCGTACCGA CGCTGCTGAA AAAAATGTTC TCACGCGTGC TGTGGCTGGC GCGCGGTCGC GCGGATTAA
|
Protein sequence | MILERVEIVG FRGINRLSLM LEQNNVLIGE NAWGKSSLLD ALTLLLSPES DLYHFERDDF WFPPGDINGR EHHLHIILTF RESLPGRHRV RRYRPLEACW TPCTDGYHRI FYRLEGESAE DGSVMTLRSF LDKDGHPIDV EDINDQARHL VRLMPVLRLR DARFMRRIRN GTVPNVPNVE VTARQLDFLA RELSSHPQNL SDGQIRQGLS AMVQLLEHYF SEQGAGQARY RLMRRRASNE QRSWRYLDII NRMIDRPGGR SYRVILLGLF ATLLQAKGTL RLDKDARPLL LIEDPETRLH PIMLSVAWHL LNLLPLQRIA TTNSGELLSL TPVEHVCRLV RESSRVAAWR LGPSGLSTED SRRISFHIRF NRPSSLFARC WLLVEGETET WVINELARQC GHHFDAEGIK VIEFAQSGLK PLVKFARRMG IEWHVLVDGD EAGKKYAATV RSLLNNDREA EREHLTALPA LDMEHFMYRQ GFSDVFHRVA QIPENVPMNL RKIISKAIHR SSKPDLAIEV AMEAGRRGVD SVPTLLKKMF SRVLWLARGR AD
|
| |