Gene ECH74115_1745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1745 
Symbol 
ID6969126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1680466 
End bp1682937 
Gene Length2472 bp 
Protein Length823 aa 
Translation table11 
GC content50% 
IMG OID643385697 
Productexonuclease family protein 
Protein accessionYP_002270189 
Protein GI209396209 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000113116 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0000795102 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTAAAG TCTTTATTTG CGCCGCCATT CCGGACGAAC AGGCAATAAA GGAAGAAGGT 
GCAGTCGCTG TAGCCACTGC CATTGAAGCC GGTGATGAAC GTCGCGCCCG CGCAAAATTT
CACTGGCAAT TCCTGGAGCA TTATCCGGCT GCTCAGGACT GCGCTTATAA ATTTCTTGTT
TGCGAGGATA AACCCGGTAT ACCCCGCCCT GCCCTCGATT CCTGGGATGC TGAATATATG
CAGGAAAACC GCTGGGATGA GGGGGCTGCT TCCTTTGTCC CGGTTGAGAC TGAATCCGAT
CCGATGAACG TCGCTTTTGA CAAGCTGGCC CCTGAAGTAC AGAACGCTGT CATGGTTAAG
TTCGACACAT GTGAAAACAT CACCGTTGAT ATGGTGATTA GCGCGCAGGA ATTGTTGCAG
GAAGACATGG CAACATTCGA CGGACATATC GTTGAAGCGT TGATGAAAAT GCCAGAAGTT
AACGCCATGT ATCCGGAGCT TAAGCTGCAT GCCATCGGGT GGGTTAAGCA TAAATGTAAG
CCTGGTGCCA AATGGCCCGA AATTCAGGCA GAAATGCGCA TCTGGAAAAA ACGTCGCGAA
GGTGAACGCA AGGAAACCGG AAAATACACG TCTGTTGTTG ATCTCGCCCG CGCCAGAGTC
CACCGACAGC ACACTGAAAA CTCAGCAGAA AAAATCCCCC CTGTCACTGC AGTCATTCGT
CGCGAATATA AGCAGACATG GAAAACACTG GATGACGAAC TGGCCTACGC TCTCTGGCCT
GGTGATGTGG ATGCCGGAAA CATTGACGGC AGCATCCATC GCTGGGCAAA AAATGAAGTT
ATCGACAACG ACCGCGAAGA CTGGAAGCGT ATCTCGGCAT CAATGCGCAA ACAGCCTGAT
GCCCTTCGCT ACGACCGCCA GACTATTTTT GGCCTTGTCC GTGAACGTCC GATCGACATT
CACAAAGACC CTGTGGCACT GAACAAATAC ATTACTGAAT ACCTGACTAC AAAGGGCGTG
TTTGAAGATG AAGGAACAAA TCAGAGCGCA ACTGACACTC TCTCGTCGCC AGTACCAGAA
ACTGATGCAG TGGAAACGGC AATTCCGGAC AACGAAAAAA CCGAATGCAA AGTGGAAGTC
GAACCATCTG TAGAGCGTGA GGGGCCGTTC TACTTCCTCT TCACCGACAA GGATGGCGAA
AAATACGGTC GCGCAAACAA ACTTTCTGGT CTGGATAAGG CGCTGGCTGC CGGGGCTACT
GAAATCACAA AAGAAGAATA TTTTGCCCGA AAAAATGGCA CATACACAGG CTTACCGCAA
AATGCAAATA CCGCACAAAA TTCTGAACAA CCAGAACCGG TAAAAGTTAC CGCTGACGAA
GTAAAGAAAA TTATGCAGGC AGCCAATATC AGCCAGCCTG ACGCCAATCA GTTGCTCGCC
GCATCACGTG GTGAATTTGT TGCAGGGATT AGCGACCCGA ATGATCCGAA ATGGGTGAAG
GGGATTGAAA CCCGCGATTC TGTGAACCAG AACCAGCAAG AAACGGAACA GAACGACCAG
AAAGCGGAAC AAAACAGCCC AAATACGCAA CAAAACGAGC CAGAAACGAA ACAACCTGAG
CCAGTAGCGC AACAGGAACC GGAAAAAGTC TGCACCGCCT GCGGTCAGAC CGGCGGCGGC
AACTGCCCTG ATTGTGGCGC GGTGATGGGC GACGCAACAT ACCAGGAAAC ATTCGATGAA
GAGTATCAGG TTGAAGTTCA GGAAGATGAT CCGGAGGAAA TGGAAGGCGC TGAACATCCA
CACAAGGAGA ACACTGACGG CAATCAGCAT CACGATAGCG ATAATGAAAC TGGCGAGACG
GCAGATCACT CAATTAAGGT GAACGGTCAT CAAGAAATCA CATCCACCAG CAGGACGTGT
GACCATCTAA TGATCGACCT TGAAACCATG GGAAAAAATC CTGATGCCCC GATCATCTCA
ATAGGTGCAA TATTTTTCGA TCCGCAAACC GGAGATATGG GACCGGAATT TAGTAAGACT
ATCGATCTGG AAACTGCTGG CGGGGTCATT GATCGGGACA CCATTAAATG GTGGCTTAAG
CAATCACGCG AAGCGCAATC TGCCATTATG ACCGATGAAA TCCCGTTAGA TGATGCACTA
TTACAATTGC GGGAATTTAT CGACGAAAAC TCCGGTGAAT TTTTTGTTCA GGTCTGGGGA
AATGGAGCCA ACTTCGACAA CACGATTTTG CGCCGTTCAT ACGAACGGCA GGGGATCCCC
TGCCCGTGGC GTTACTACAA CGATCGCGAT GTACGCACAA TCGTTGAGCT GGGGAAAGCC
ATAGACTTCG ATGCCAGAAC GGCTATTCCA TTCGAAGGTG AGCGCCATAA TGCACTTGAT
GACGCCCGTT ACCAGGCAAA ATACGTTTCA GTTATCTGGC AAAAACTGAT CCCGAATCAG
GCTGATTTTT AA
 
Protein sequence
MSKVFICAAI PDEQAIKEEG AVAVATAIEA GDERRARAKF HWQFLEHYPA AQDCAYKFLV 
CEDKPGIPRP ALDSWDAEYM QENRWDEGAA SFVPVETESD PMNVAFDKLA PEVQNAVMVK
FDTCENITVD MVISAQELLQ EDMATFDGHI VEALMKMPEV NAMYPELKLH AIGWVKHKCK
PGAKWPEIQA EMRIWKKRRE GERKETGKYT SVVDLARARV HRQHTENSAE KIPPVTAVIR
REYKQTWKTL DDELAYALWP GDVDAGNIDG SIHRWAKNEV IDNDREDWKR ISASMRKQPD
ALRYDRQTIF GLVRERPIDI HKDPVALNKY ITEYLTTKGV FEDEGTNQSA TDTLSSPVPE
TDAVETAIPD NEKTECKVEV EPSVEREGPF YFLFTDKDGE KYGRANKLSG LDKALAAGAT
EITKEEYFAR KNGTYTGLPQ NANTAQNSEQ PEPVKVTADE VKKIMQAANI SQPDANQLLA
ASRGEFVAGI SDPNDPKWVK GIETRDSVNQ NQQETEQNDQ KAEQNSPNTQ QNEPETKQPE
PVAQQEPEKV CTACGQTGGG NCPDCGAVMG DATYQETFDE EYQVEVQEDD PEEMEGAEHP
HKENTDGNQH HDSDNETGET ADHSIKVNGH QEITSTSRTC DHLMIDLETM GKNPDAPIIS
IGAIFFDPQT GDMGPEFSKT IDLETAGGVI DRDTIKWWLK QSREAQSAIM TDEIPLDDAL
LQLREFIDEN SGEFFVQVWG NGANFDNTIL RRSYERQGIP CPWRYYNDRD VRTIVELGKA
IDFDARTAIP FEGERHNALD DARYQAKYVS VIWQKLIPNQ ADF