Gene ECH74115_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1139 
Symbol 
ID6967605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1165946 
End bp1168417 
Gene Length2472 bp 
Protein Length823 aa 
Translation table11 
GC content49% 
IMG OID643385142 
Productexonuclease family protein 
Protein accessionYP_002269641 
Protein GI209396180 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.53396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAA TCTTTATTTG CGCTGCTATT CCTGACGAAC AGGCCATAAA AGAAGATAGC 
GCTGTTGCGG TGGCCACTGC CATTGAAGCC GGTGATGAGC GTCGCGCACG CGCAAAATTT
CACTGGCAAT TTCTGGAGCA ATTCCCGGCA GCTCAGGACT GCGCTTATAA ATTTATTGTC
TGCGAGGATA AACCCGGCAT ACCCCGCCCT GCCCTCGATT CCTGGGATAC CGAATATATG
CAGGAAAACC GCTGGGATGA GGAGTCAGCT TCCTTTGTAC CGGTCGAACC AGAATCCGAT
CCGATGAACG TCAATTTTGA CAAGCTGTCC CCTGAAGTAC AGAACGCAGT CCTGGTTAAA
TTCGACACAT GCGAAAACAT CACCGTTGAT ATGGTGATTA GCGCACAGGA ATTACTACAG
GAAGACATGG CAACATTCGG CGGACATATC GTTGAAGCGT TGATGAAAAT GCCAGAAGTT
AACGCCATGT ATCCTGAACT TAAACTGCAT GCCATCGGGT GGGTTAAGCA TAAATGTGAG
CCTGGCGCTA AATGGCCTGA AATTCAGGCA GAGATGCGCA TCTGGAAAAA ACGTCGCGAA
GGTGAACGCA AGGAAACCGG AAAATACACG TCTGTTGTTG ATCTCACCCG CGCCAGAGTC
AACCAACAGA ACACTGAAAA CGCTGCTGAA AAAACCGGGG CTGTCACTGT TGCCATTCGC
CGCGAATACA AACAGACATG GAAAACACTC GACAATGAAC TGGCCTGCGC CCTCTGGCCC
GGTGATGTGG ATGCAGGAAA CATTGACGGT ACCATCCATC GCTGGGCGAC AAATGAGGTT
ATCGACAAGG ATCGCGAAGA CTGGAAGCGT ATCTCAGCAT CAATGCGCAA ACAGCCCGAA
GCACTTGGCT ATGACCGTCA GACTATTTTT GGCCTTGTTC GCGAACGTCC GATCGATATT
CACAAAGATC CCGTTGCACT GAACAAATAT ATCAGTGAAT ACCTGACGAC AAAGGGCGTG
TTTGAACATG AAGAAACAGA CCAGAGCTCT ACTGATGCTC TCCAGCCGTC AGCAGCACAA
ACTGCTCCAG TGGAGACGGC AGAATCCGAT ACTCAAAAAA ATGAAATCCT GGTGGAAGCT
GAACCATCTG TAGAGCGTGA AGGACCATTT TATTTCGTCT TTACCGATAA GGGCGGGGAA
AAATACGGCA GGGCAAACAA ACTTTCTGGT CTGGACAAGG CGCTGGCTGC CGGCGGTACC
GAAATCTCAA AAGAAGAATA TTTTGCCCGA AAAAATGGCA CATACACGGG CTTACCGCAA
AATGTGGATA CCGCTGAAGA TTCCGAACAA TCAGAGCCGG TAAAAGTTAC CGCTGACGAA
GTAAACAAAA TTATGCAGGC AGCCAATATC AGCCAGCCTG ACGCCGATAA ATTGCTTGCT
GCATCACGTG GTGAATTTGT TGAAGGGATT AGCGACCCGA ATGATCCGAA ATGGGTGAAA
GGGATCCAGA GTCGCGACGC TGAGGACCAG AATCAGCCCA ACGTGAAACA AAATGAGCCA
GAAGCGGAAC AAAACAGCCC GGATACGCAA CAAAACGGGC CAGAAGAACA ACAACCAGAA
CCAGCAGTGC AACAGGAACT GGAAAAAGTT TGCACCGCAT GCGGTCAGAC CGGTGGCGGC
AACTGTCCTG ACTGTGGTGC GGTGATGGGG AACGCAACCT ACCTGGAAAC ATTCGATGAA
GAGAATCAGG CTGAAGCTCA GAAAAATGAT CCGGAGGAAA TGGAAGGCAC TGAACATCTG
CACAAGGAGA ACACTGGCAG CGATCAGTAT CACGCCAGCG ATAATAAAAC TGGCGAAACA
GCAAATCCCT TAATTAAAGT GAACGGTCAT CATGAAATCT CATCCACCAG CAGGTTGTGG
CACCATCTGA TGATTGACCT TGAAACAATG GGAAAAAATC CTGATGCGCC AATAAACTCT
ATAGCCGGTA AGTTTTTTGA TCCGGCAACC GGAGAGATGG GGCCAGAATT CAGCAAAACT
ATCGATCTGG AAACCGCAGG TGGGGTCATC GATCGGGACA CCATTAAGTG GTGGCTGAAA
CAGTCACGCG AAGCACAATC CTCCATTCTG ACCGATGAAA TCACGTTGGA TGATGCACTG
CTGCAATTCC GGGAATTTAT CGACGAAAAC TCCGGTGAAT TTTTTGTTCA GGTCTGGGGT
AACGGTGCAA CTTTCGACAA CGTGATTTTA CGCCGTTCAT ATGAACGGCA GGGGATCCCC
TGCCCGTGGC GTTACACCAA TGATCGCGAT GTAAGAACGA TGGTTGCTCT GGGACTGGTG
ATGGATTTCG ACGCAAGAAC GACTATTCCA TTCGAAGGTG AACGCCATAA CGCCCTGCAC
GATGCACGTT ACCAGGCGAA ATACGTTTCA GCCATCTGGC AAAAACTGCT CCCGAGTCAG
GCTGATTTTT GA
 
Protein sequence
MSKIFICAAI PDEQAIKEDS AVAVATAIEA GDERRARAKF HWQFLEQFPA AQDCAYKFIV 
CEDKPGIPRP ALDSWDTEYM QENRWDEESA SFVPVEPESD PMNVNFDKLS PEVQNAVLVK
FDTCENITVD MVISAQELLQ EDMATFGGHI VEALMKMPEV NAMYPELKLH AIGWVKHKCE
PGAKWPEIQA EMRIWKKRRE GERKETGKYT SVVDLTRARV NQQNTENAAE KTGAVTVAIR
REYKQTWKTL DNELACALWP GDVDAGNIDG TIHRWATNEV IDKDREDWKR ISASMRKQPE
ALGYDRQTIF GLVRERPIDI HKDPVALNKY ISEYLTTKGV FEHEETDQSS TDALQPSAAQ
TAPVETAESD TQKNEILVEA EPSVEREGPF YFVFTDKGGE KYGRANKLSG LDKALAAGGT
EISKEEYFAR KNGTYTGLPQ NVDTAEDSEQ SEPVKVTADE VNKIMQAANI SQPDADKLLA
ASRGEFVEGI SDPNDPKWVK GIQSRDAEDQ NQPNVKQNEP EAEQNSPDTQ QNGPEEQQPE
PAVQQELEKV CTACGQTGGG NCPDCGAVMG NATYLETFDE ENQAEAQKND PEEMEGTEHL
HKENTGSDQY HASDNKTGET ANPLIKVNGH HEISSTSRLW HHLMIDLETM GKNPDAPINS
IAGKFFDPAT GEMGPEFSKT IDLETAGGVI DRDTIKWWLK QSREAQSSIL TDEITLDDAL
LQFREFIDEN SGEFFVQVWG NGATFDNVIL RRSYERQGIP CPWRYTNDRD VRTMVALGLV
MDFDARTTIP FEGERHNALH DARYQAKYVS AIWQKLLPSQ ADF