Gene ECH74115_2815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2815 
Symbol 
ID6968054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2615841 
End bp2618312 
Gene Length2472 bp 
Protein Length823 aa 
Translation table11 
GC content49% 
IMG OID643386665 
Productexonuclease family protein 
Protein accessionYP_002271141 
Protein GI209395796 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00747401 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAA TCTTTATTTG CGCTGCTATT CCTGACGAAC AGGCCATAAA AGAAGATAGC 
GCTGTTGCGG TGGCCACTGC CATTGAAGCC GGTGATGAGC GTCGCGCACG CGCAAAATTT
CACTGGCAAT TTCTGGAGCA ATTCCCGGCA GCTCAGGACT GCGCTTATAA ATTTATTGTC
TGCGAGGATA AACCCGGCAT ACCCCGCCCT GCCCTCGATT CCTGGGATAC CGAATATATG
CAGGAAAACC GCTGGGATGA GGAGTCAGCT TCCTTTGTAC CGGTCGAACC AGAATCCGAT
CCGATGAACG TCAATTTTGA CAAGCTGTCC CCTGAAGTAC AGAACGCAGT CCTGGTTAAA
TTCGACACAT GCGAAAACAT CACCGTTGAT ATGGTGATTA GCGCACAGGA ATTACTACAG
GAAGACATGG CAACATTCGG CGGACATATC GTTGAAGCGT TGATGAAAAT GCCAGAAGTT
AACGCCATGT ATCCTGAACT TAAACTGCAT GCCATCGGGT GGGTTAAGCA TAAATGTGAG
CCTGGCGCTA AATGGCCTGA AATTCAGGCA GAGATGCGCA TCTGGAAAAA ACGTCGCGAA
GGTGAACGCA AGGAAACCGG AAAATACACG TCTGTTGTTG ATCTCGCCCG CGCCAGAGTC
AACCAACAGA ACCCTGAAAA CGCTGCTGAA AAAACCGGGG CTGTCACTGT TGCCATTCGC
CGCGAATACA AACAGACATG GAAAACACTC GACAATGAAC TGGCCTGCGC CCTCTGGCCC
GGTGATGTGG ATGCAGGAAA CATTGACGGT ACCATCCATC GCTGGGCGAC AAATGAGGTT
ATCGACAAGG ATCGCGAAGA CTGGAAGCGT ATCTCAGCAT CAATGCGCAA ACAGCCCGAA
GCACTTGGCT ATGACCGTCA GACTATTTTT GGCCTTGTTC GCGAACGTCC GATCGATATT
CACAAAGATC CCGTTGCACT GAACAAATAT ATCAGTGAAT ACCTGACGAC AAAGGGCGTG
TTTGAACATG AAGAAACAGA CCAGAGCTCT ACTGATGCTC TCCAGCCGTC AGCAGCACAA
ACTGCTCCAG TGGAGACGGC AGAATCCGAT ACTCAAAAAA ATGAAATCCT GGTGGAAGCT
GAACCATCTG TAGAGCGTGA AGGACCATTT TATTTCGTCT TTACCGATAA GGGCGGGGAA
AAATACGGCA GGGCAAACAA ACTTTCTGGT CTGGACAAGG CGCTGGCTGC CGGCGGTACC
GAAATCTCAA AAGAAGAATA TTTTGCCCGA AAAAATGGCA CATACACGGG CTTACCGCAA
AATGTGGATA CCGCTGAAGA TTCCGAACAA TCAGAGCCGG TAAAAGTTAC CGCTGACGAA
GTAAACAAAA TTATGCAGGC AGCCAATATC AGCCAGCCTG ACGCCGATAA ATTGCTTGCT
GCATCACGTG GTGAATTTGT TGAAGGGATT AGCGACCCGA ATGATCCGAA ATGGGTGAAA
GGGATCCAGA GTCGCGACGC TGAGGACCAG AATCAGCCCA ACGTGAAACA AAATGAGCCA
GAAGCGGAAC AAAACAGCCC GGATACGCAA CAAAACGGGC CAGAAGAACA ACAACCAGAA
CCAGCAGTGC AACAGGAACT GGAAAAAGTT TGCACCGCAT GCGGTCAGAC CGGTGGCGGC
AACTGTCCTG ACTGTGGTGC GGTGATGGGG AACGCAACCT ACCTGGAAAC ATTCGATGAA
GAGAATCAGG CTGAAGCTCA GAAAAATGAT CCGGAGGAAA TGGAAGGCAC TGAACATCTG
CACAAGGAGA ACACTGGCAG CGATCAGTAT CACGCCAGCG ATAATAAAAC TGGCGAAACA
GCAAATCCCT TAATTAAAGT GAACGGTCAT CATGAAATCT CATCCACCAG CAGGTTGTGG
CACCATCTGA TGATTGACCT TGAAACAATG GGAAAAAATC CTGATGCGCC AATAAACTCT
ATAGCCGGTA AGTTTTTTGA TCCGGCAACC GGAGAGATGG GGCCAGAATT CAGCAAAACT
ATCGATCTGG AAACCGCAGG TGGGGTCATC GATCGGGACA CCATTAAGTG GTGGCTGAAA
CAGTCACGCG AAGCACAATC CTCCATTCTG ACCGATGAAA TCACGTTGGA TGATGCACTG
CTGCAATTCC GGGAATTTAT CGACGAAAAC TCCGGTGAAT TTTTTGTTCA GGTCTGGGGT
AACGGTGCAA CTTTCGACAA CGTGATTTTA CGCCGTTCAT ATGAACGGCA GGGGATCCCC
TGCCCGTGGC GTTACACCAA TGATCGCGAT GTAAGAACGA TGGTTGCTCT GGGACTGGTG
ATGGATTTCG ACGCAAGAAC GACTATTCCA TTCGAAGGTG AACGCCATAA CGCCCTGCAC
GATGCGCGTT ACCAGGCAAA ATACGTTTCA GCCATCTGGC AAAAACTGCT CCCGAGTCAG
GCTGATTTTT AA
 
Protein sequence
MSKIFICAAI PDEQAIKEDS AVAVATAIEA GDERRARAKF HWQFLEQFPA AQDCAYKFIV 
CEDKPGIPRP ALDSWDTEYM QENRWDEESA SFVPVEPESD PMNVNFDKLS PEVQNAVLVK
FDTCENITVD MVISAQELLQ EDMATFGGHI VEALMKMPEV NAMYPELKLH AIGWVKHKCE
PGAKWPEIQA EMRIWKKRRE GERKETGKYT SVVDLARARV NQQNPENAAE KTGAVTVAIR
REYKQTWKTL DNELACALWP GDVDAGNIDG TIHRWATNEV IDKDREDWKR ISASMRKQPE
ALGYDRQTIF GLVRERPIDI HKDPVALNKY ISEYLTTKGV FEHEETDQSS TDALQPSAAQ
TAPVETAESD TQKNEILVEA EPSVEREGPF YFVFTDKGGE KYGRANKLSG LDKALAAGGT
EISKEEYFAR KNGTYTGLPQ NVDTAEDSEQ SEPVKVTADE VNKIMQAANI SQPDADKLLA
ASRGEFVEGI SDPNDPKWVK GIQSRDAEDQ NQPNVKQNEP EAEQNSPDTQ QNGPEEQQPE
PAVQQELEKV CTACGQTGGG NCPDCGAVMG NATYLETFDE ENQAEAQKND PEEMEGTEHL
HKENTGSDQY HASDNKTGET ANPLIKVNGH HEISSTSRLW HHLMIDLETM GKNPDAPINS
IAGKFFDPAT GEMGPEFSKT IDLETAGGVI DRDTIKWWLK QSREAQSSIL TDEITLDDAL
LQFREFIDEN SGEFFVQVWG NGATFDNVIL RRSYERQGIP CPWRYTNDRD VRTMVALGLV
MDFDARTTIP FEGERHNALH DARYQAKYVS AIWQKLLPSQ ADF