Gene EcolC_2720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2720 
Symbol 
ID6066620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2993113 
End bp2994771 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content53% 
IMG OID641602126 
ProductATP-dependent OLD family endonuclease 
Protein accessionYP_001725675 
Protein GI170020721 
COG category[L] Replication, recombination and repair 
COG ID[COG3593] Predicted ATP-dependent endonuclease of the OLD family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.560965 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTTG AGCGCGTTGA AATTGTGGGT TTTCGCGGTA TCAACCGTTT GTCGTTGATG 
CTGGAACAAA ACAACGTCCT GATTGGGGAG AACGCGTGGG GTAAATCCAG CTTGCTGGAC
GCCTTAACTC TGCTGCTATC GCCAGAATCA GATCTCTACC ATTTTGAGCG CGACGATTTC
TGGTTCCCGC CGGGAGATAT CAACGGGCGA GAACATCATC TGCATATTAT TTTGACCTTC
CGCGAATCGC TGCCAGGCCG ACATCGGGTT CGCCGTTATC GGCCGCTGGA AGCGTGCTGG
ACGCCATGCA CCGATGGCTA TCACCGTATT TTTTATCGTC TGGAAGGGGA GAGTGCGGAA
GACGGCAGCG TGATGACACT GCGCAGTTTT CTCGATAAAG ACGGACATCC GATTGATGTC
GAGGATATTA ACGATCAGGC ACGCCATCTG GTGCGTTTAA TGCCGGTGCT GCGCTTGCGT
GATGCCCGTT TTATGCGCCG TATTCGTAAC GGCACGGTGC CAAATGTCCC TAATGTGGAA
GTCACCGCGC GCCAGCTCGA TTTCCTCGCC CGTGAGTTAT CCTCACATCC GCAAAATCTC
TCTGATGGGC AGATTCGTCA GGGACTTTCC GCAATGGTAC AGCTGCTTGA GCATTATTTC
TCTGAGCAGG GGGCCGGACA GGCGCGATAT CGTTTAATGC GGCGGCGAGC CAGCAATGAG
CAACGAAGCT GGCGCTATCT GGATATCATC AACCGGATGA TTGACCGACC TGGTGGGCGC
TCGTATCGGG TTATTTTGCT CGGCCTGTTT GCTACTTTGT TGCAGGCAAA AGGCACATTG
CGACTGGATA AAGACGCCCG TCCATTGTTG CTGATCGAAG ATCCAGAAAC CCGTTTACAC
CCCATTATGC TTTCAGTTGC CTGGCATCTG TTGAATCTTC TGCCATTGCA GCGCATTGCC
ACCACCAACT CGGGTGAGTT GCTTTCGTTA ACGCCGGTAG AGCATGTTTG CCGACTGGTA
CGTGAGTCCT CGCGCGTTGC CGCCTGGCGT CTGGGGCCGA GTGGCTTGAG TACCGAAGAT
AGCCGACGCA TATCCTTTCA CATTCGTTTT AACCGTCCGT CATCGCTGTT TGCACGCTGC
TGGTTGCTGG TGGAAGGGGA AACGGAAACC TGGGTTATCA ATGAACTGGC GCGTCAGTGC
GGACATCATT TTGATGCCGA AGGGATCAAG GTCATTGAGT TTGCCCAGTC CGGGCTAAAG
CCACTGGTTA AATTTGCCCG CCGAATGGGG ATTGAATGGC ATGTACTGGT CGATGGCGAT
GAAGCAGGGA AGAAATATGC CGCTACGGTA CGCAGCCTGT TGAATAACGA TCGGGAAGCC
GAACGAGAAC ATTTAACGGC GTTACCGGCG CTGGATATGG AACATTTTAT GTATCGCCAG
GGATTTTCCG ATGTGTTCCA CCGCATGGCG CAAATCCCGG AAAATGTACC GATGAATCTA
CGCAAAATTA TCTCGAAAGC GATCCATCGC TCTTCCAAAC CCGATCTTGC CATTGAAGTG
GCAATGGAGG CAGGACGTCG TGGTGTGGAC TCCGTACCGA CGCTGCTGAA AAAAATGTTC
TCACGCGTGC TGTGGCTGGC GCGCGGTCGC GCGGATTAA
 
Protein sequence
MILERVEIVG FRGINRLSLM LEQNNVLIGE NAWGKSSLLD ALTLLLSPES DLYHFERDDF 
WFPPGDINGR EHHLHIILTF RESLPGRHRV RRYRPLEACW TPCTDGYHRI FYRLEGESAE
DGSVMTLRSF LDKDGHPIDV EDINDQARHL VRLMPVLRLR DARFMRRIRN GTVPNVPNVE
VTARQLDFLA RELSSHPQNL SDGQIRQGLS AMVQLLEHYF SEQGAGQARY RLMRRRASNE
QRSWRYLDII NRMIDRPGGR SYRVILLGLF ATLLQAKGTL RLDKDARPLL LIEDPETRLH
PIMLSVAWHL LNLLPLQRIA TTNSGELLSL TPVEHVCRLV RESSRVAAWR LGPSGLSTED
SRRISFHIRF NRPSSLFARC WLLVEGETET WVINELARQC GHHFDAEGIK VIEFAQSGLK
PLVKFARRMG IEWHVLVDGD EAGKKYAATV RSLLNNDREA EREHLTALPA LDMEHFMYRQ
GFSDVFHRMA QIPENVPMNL RKIISKAIHR SSKPDLAIEV AMEAGRRGVD SVPTLLKKMF
SRVLWLARGR AD