Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1745 |
Symbol | |
ID | 6969126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1680466 |
End bp | 1682937 |
Gene Length | 2472 bp |
Protein Length | 823 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643385697 |
Product | exonuclease family protein |
Protein accession | YP_002270189 |
Protein GI | 209396209 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000113116 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0000795102 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGTAAAG TCTTTATTTG CGCCGCCATT CCGGACGAAC AGGCAATAAA GGAAGAAGGT GCAGTCGCTG TAGCCACTGC CATTGAAGCC GGTGATGAAC GTCGCGCCCG CGCAAAATTT CACTGGCAAT TCCTGGAGCA TTATCCGGCT GCTCAGGACT GCGCTTATAA ATTTCTTGTT TGCGAGGATA AACCCGGTAT ACCCCGCCCT GCCCTCGATT CCTGGGATGC TGAATATATG CAGGAAAACC GCTGGGATGA GGGGGCTGCT TCCTTTGTCC CGGTTGAGAC TGAATCCGAT CCGATGAACG TCGCTTTTGA CAAGCTGGCC CCTGAAGTAC AGAACGCTGT CATGGTTAAG TTCGACACAT GTGAAAACAT CACCGTTGAT ATGGTGATTA GCGCGCAGGA ATTGTTGCAG GAAGACATGG CAACATTCGA CGGACATATC GTTGAAGCGT TGATGAAAAT GCCAGAAGTT AACGCCATGT ATCCGGAGCT TAAGCTGCAT GCCATCGGGT GGGTTAAGCA TAAATGTAAG CCTGGTGCCA AATGGCCCGA AATTCAGGCA GAAATGCGCA TCTGGAAAAA ACGTCGCGAA GGTGAACGCA AGGAAACCGG AAAATACACG TCTGTTGTTG ATCTCGCCCG CGCCAGAGTC CACCGACAGC ACACTGAAAA CTCAGCAGAA AAAATCCCCC CTGTCACTGC AGTCATTCGT CGCGAATATA AGCAGACATG GAAAACACTG GATGACGAAC TGGCCTACGC TCTCTGGCCT GGTGATGTGG ATGCCGGAAA CATTGACGGC AGCATCCATC GCTGGGCAAA AAATGAAGTT ATCGACAACG ACCGCGAAGA CTGGAAGCGT ATCTCGGCAT CAATGCGCAA ACAGCCTGAT GCCCTTCGCT ACGACCGCCA GACTATTTTT GGCCTTGTCC GTGAACGTCC GATCGACATT CACAAAGACC CTGTGGCACT GAACAAATAC ATTACTGAAT ACCTGACTAC AAAGGGCGTG TTTGAAGATG AAGGAACAAA TCAGAGCGCA ACTGACACTC TCTCGTCGCC AGTACCAGAA ACTGATGCAG TGGAAACGGC AATTCCGGAC AACGAAAAAA CCGAATGCAA AGTGGAAGTC GAACCATCTG TAGAGCGTGA GGGGCCGTTC TACTTCCTCT TCACCGACAA GGATGGCGAA AAATACGGTC GCGCAAACAA ACTTTCTGGT CTGGATAAGG CGCTGGCTGC CGGGGCTACT GAAATCACAA AAGAAGAATA TTTTGCCCGA AAAAATGGCA CATACACAGG CTTACCGCAA AATGCAAATA CCGCACAAAA TTCTGAACAA CCAGAACCGG TAAAAGTTAC CGCTGACGAA GTAAAGAAAA TTATGCAGGC AGCCAATATC AGCCAGCCTG ACGCCAATCA GTTGCTCGCC GCATCACGTG GTGAATTTGT TGCAGGGATT AGCGACCCGA ATGATCCGAA ATGGGTGAAG GGGATTGAAA CCCGCGATTC TGTGAACCAG AACCAGCAAG AAACGGAACA GAACGACCAG AAAGCGGAAC AAAACAGCCC AAATACGCAA CAAAACGAGC CAGAAACGAA ACAACCTGAG CCAGTAGCGC AACAGGAACC GGAAAAAGTC TGCACCGCCT GCGGTCAGAC CGGCGGCGGC AACTGCCCTG ATTGTGGCGC GGTGATGGGC GACGCAACAT ACCAGGAAAC ATTCGATGAA GAGTATCAGG TTGAAGTTCA GGAAGATGAT CCGGAGGAAA TGGAAGGCGC TGAACATCCA CACAAGGAGA ACACTGACGG CAATCAGCAT CACGATAGCG ATAATGAAAC TGGCGAGACG GCAGATCACT CAATTAAGGT GAACGGTCAT CAAGAAATCA CATCCACCAG CAGGACGTGT GACCATCTAA TGATCGACCT TGAAACCATG GGAAAAAATC CTGATGCCCC GATCATCTCA ATAGGTGCAA TATTTTTCGA TCCGCAAACC GGAGATATGG GACCGGAATT TAGTAAGACT ATCGATCTGG AAACTGCTGG CGGGGTCATT GATCGGGACA CCATTAAATG GTGGCTTAAG CAATCACGCG AAGCGCAATC TGCCATTATG ACCGATGAAA TCCCGTTAGA TGATGCACTA TTACAATTGC GGGAATTTAT CGACGAAAAC TCCGGTGAAT TTTTTGTTCA GGTCTGGGGA AATGGAGCCA ACTTCGACAA CACGATTTTG CGCCGTTCAT ACGAACGGCA GGGGATCCCC TGCCCGTGGC GTTACTACAA CGATCGCGAT GTACGCACAA TCGTTGAGCT GGGGAAAGCC ATAGACTTCG ATGCCAGAAC GGCTATTCCA TTCGAAGGTG AGCGCCATAA TGCACTTGAT GACGCCCGTT ACCAGGCAAA ATACGTTTCA GTTATCTGGC AAAAACTGAT CCCGAATCAG GCTGATTTTT AA
|
Protein sequence | MSKVFICAAI PDEQAIKEEG AVAVATAIEA GDERRARAKF HWQFLEHYPA AQDCAYKFLV CEDKPGIPRP ALDSWDAEYM QENRWDEGAA SFVPVETESD PMNVAFDKLA PEVQNAVMVK FDTCENITVD MVISAQELLQ EDMATFDGHI VEALMKMPEV NAMYPELKLH AIGWVKHKCK PGAKWPEIQA EMRIWKKRRE GERKETGKYT SVVDLARARV HRQHTENSAE KIPPVTAVIR REYKQTWKTL DDELAYALWP GDVDAGNIDG SIHRWAKNEV IDNDREDWKR ISASMRKQPD ALRYDRQTIF GLVRERPIDI HKDPVALNKY ITEYLTTKGV FEDEGTNQSA TDTLSSPVPE TDAVETAIPD NEKTECKVEV EPSVEREGPF YFLFTDKDGE KYGRANKLSG LDKALAAGAT EITKEEYFAR KNGTYTGLPQ NANTAQNSEQ PEPVKVTADE VKKIMQAANI SQPDANQLLA ASRGEFVAGI SDPNDPKWVK GIETRDSVNQ NQQETEQNDQ KAEQNSPNTQ QNEPETKQPE PVAQQEPEKV CTACGQTGGG NCPDCGAVMG DATYQETFDE EYQVEVQEDD PEEMEGAEHP HKENTDGNQH HDSDNETGET ADHSIKVNGH QEITSTSRTC DHLMIDLETM GKNPDAPIIS IGAIFFDPQT GDMGPEFSKT IDLETAGGVI DRDTIKWWLK QSREAQSAIM TDEIPLDDAL LQLREFIDEN SGEFFVQVWG NGANFDNTIL RRSYERQGIP CPWRYYNDRD VRTIVELGKA IDFDARTAIP FEGERHNALD DARYQAKYVS VIWQKLIPNQ ADF
|
| |