Gene EcSMS35_2283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2283 
Symbol 
ID6144478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2310775 
End bp2312433 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content53% 
IMG OID641617158 
Producthypothetical protein 
Protein accessionYP_001744331 
Protein GI170682448 
COG category[L] Replication, recombination and repair 
COG ID[COG3593] Predicted ATP-dependent endonuclease of the OLD family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0243665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.014671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTTG AGCGCGTTGA AATTGTGGGT TTTCGCGGTA TCAACCGTTT GTCGTTGATG 
CTGGAACAAA ACAACGTCCT GATTGGGGAG AACGCGTGGG GTAAATCCAG CTTGCTGGAC
GCCTTAACCC TGCTGCTATC GCCAGAATCA GATCTCTACC ATTTTGAGCG CGACGATTTC
TGGTTCCCGC CGGGAGATAT CAACGGACGG GAACATCATT TGCATATTAT TTTGACCTTC
CGCGAATCGC TGCCAGGCCG ACATCGGGTA CGCCGCTATC GGCCACTGGA AGCGTGCTGG
ACGCCATGCA CCGATGGCTA TCACCGTATT TTTTATCGTC TGGAAGGGGA GAGTGCGGAA
GACGGCAGCG TGATGACACT CCGCAGTTTT CTCGATAAAG ACGGACATCC GATTGATGTC
GAGGATATTA ACGATCAGGC ACGCCATCTG GTGCGTTTAA TGCCGGTGCT GCGCTTGCGT
GATGCCCGTT TTATGCGCCG TATTCGTAAC GGCACGGTGC CAAATGTCCC TAATGTGGAA
GTCACCGCGC GCCAGCTCGA TTTCCTCGCC CGTGAGTTAT CCTCACATCC GCAAAATCTC
TCTGATGGGC AGATTCGTCA GGGACTTTCC GCAATGGTAC AGCTGCTTGA GCATTATTTC
TCTGAGCAGG GGGCCGGACA GGCGCGATAT CGTTTAATGC GGCGGCGAGC CAGCAATGAG
CAACGAAGCT GGCGCTATCT GGATATCATC AACCGGATGA TTGACCGACC TGGAGGGCGC
TCGTATCGGG TTATTTTGCT CGGCCTGTTT GCTACTTTGT TGCAGGCAAA AGGCACATTG
CGACTGGATA AAGACGCCCG TCCATTGTTG CTGATCGAAG ATCCAGAAAC CCGTTTACAC
CCCATTATGC TTTCAGTTGC CTGGCATCTG TTGAATCTTC TGCCATTGCA GCGCATCGCT
ACTACCAACT CGGGAGAGTT GCTTTCGTTA ACGCCGGTAG AGCATGTTTG CCGACTGGTA
CGTGAGTCCT CGCGCGTTGC CGCCTGGCGT CTGGGGCCGA GTGGTTTGAG TACCGAAGAT
AGCCGACGCA TCTCTTTCCA CATTCGTTTT AATCGTCCAT CATCGCTGTT TGCACGCTGC
TGGCTGCTGG TGGAAGGGGA AACAGAAACC TGGGTTATCA ACGAACTGGC GCGCCAGTGT
GGACACCATT TCGATGCCGA AGGGATTAAG GTCATTGAAT TTGCCCAGTC CGGGCTAAAA
CCACTGGTGA AATTTGCCCG GCGAATGGGG ATTGAATGGC ATGTACTGGT CGATGGCGAT
GAAGCAGGGA AGAAATATGC CGCTACGGTA CGCAGCCTGT TGAATAACGA TCGGGAAGCC
GAACGAGAAC ATTTAACGGC GTTACCGGCG CTGGATATGG AACATTTTAT GTATCGCCAG
GGATTTTCCG ATGTGTTCCA CCGCGTGGCG CAAATCCCGG AAAATGTACC GATGAATCTG
CGCAAAATTA TCTCGAAAGC GATCCATCGC TCTTCCAAAC CCGATCTTGC CATTGAAGTG
GCAATGGAGG CCGGACGTCG TGGTGTGGAT TCCGTACCGA CGCTGCTGAA AAAAATGTTC
TCACGCGTGC TGTGGCTGGC GCGCGGTCGC GCGGATTAA
 
Protein sequence
MILERVEIVG FRGINRLSLM LEQNNVLIGE NAWGKSSLLD ALTLLLSPES DLYHFERDDF 
WFPPGDINGR EHHLHIILTF RESLPGRHRV RRYRPLEACW TPCTDGYHRI FYRLEGESAE
DGSVMTLRSF LDKDGHPIDV EDINDQARHL VRLMPVLRLR DARFMRRIRN GTVPNVPNVE
VTARQLDFLA RELSSHPQNL SDGQIRQGLS AMVQLLEHYF SEQGAGQARY RLMRRRASNE
QRSWRYLDII NRMIDRPGGR SYRVILLGLF ATLLQAKGTL RLDKDARPLL LIEDPETRLH
PIMLSVAWHL LNLLPLQRIA TTNSGELLSL TPVEHVCRLV RESSRVAAWR LGPSGLSTED
SRRISFHIRF NRPSSLFARC WLLVEGETET WVINELARQC GHHFDAEGIK VIEFAQSGLK
PLVKFARRMG IEWHVLVDGD EAGKKYAATV RSLLNNDREA EREHLTALPA LDMEHFMYRQ
GFSDVFHRVA QIPENVPMNL RKIISKAIHR SSKPDLAIEV AMEAGRRGVD SVPTLLKKMF
SRVLWLARGR AD