Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3177 |
Symbol | |
ID | 6967414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2931219 |
End bp | 2933663 |
Gene Length | 2445 bp |
Protein Length | 814 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643386998 |
Product | exonuclease family protein |
Protein accession | YP_002271465 |
Protein GI | 209395771 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000796892 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00000000000000114787 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCACTG ATAAAGAAGA AATTGCACTG TATTACGAAG CCAAAAATGA CAAAGTCAGA AAACGCCTTG GGATTAAAGG CGGTTTTTAC TGGCGCACAG CAAAAAAATT ATCGGTTGCA ATATCACGGG GTGTTGTCGC AATGGACGAT GCTGGATTTG ACGAAGAGGA TTTCAAAAAA CCTGTTCGCG TGAATTTGCC CATTGTTAAT GACCTGCCGC CTGAAGGTGT GTTCGATACT GAATTCTGCA ACCGCTATGA AAAAGGCGGG GAAGATGGCA TCACAATGAT ATTTATAGCG CCTTCCCCCT CAGTTCAGGA CAAACCAGCC AGCTCTGACA ATACCAACGT CAATGGCGAA GACATGGCTG AGATTGAGGA TAATATGCTC CTGCCGATTT CCGGTCAGGA ACTGCCCATT CGCTGGCTTG CGCAACATGG CAGCGAAAAA CCGGTAACGC ACGTTTCACG GGAAGAACTT CAGGCATTAC ATATCGCACG AGCTGAAGAA CTGCCTGCTG TTACTGCCCT GGCTATTTCC CACAACACAA AGCTGCTCGA CCCGCTGGAG ATTCGCGACC TTCACAAACT GGTACGCGAC ACAGACAAAG TTTTCCCTAA TCCCGTTAAT TCCAGTCTGG GGTTAATGAC TGCTTTTTTC GAAGCATACC TGGACGCTGA CTATACCGAT CGAGGTCTGC TGACAAAAGA GTGGATGAAA GGAAATCGTG TTTTACGCAT CAGCCGCACG CCATCCGGCG CTAATGCTGG CGGAGGAATT CTTACCGATC GCGGTGAAAG TTTTGTCCAC GATGATGCGT CAGTAGAACG TGACGTTGCC GCTGGCGTTC TGGCCCGTTC AATGGACATC GATATTTACA ATCCACATCC GGCACACGCC AAACGCATTG AAGAAATCGT TTCAGAGAAT AAGCCGCCCT TTTCTGTTTT TCGTGACAAA TTCATCGCCA TGCCTGGTCA CCTGGATTAT TCCCGCGCGA TAGTGGTTGC GTCCGTGAAA GAAGCACCAA TTGGTATCGA GGCTACTCCC CACCGTGTTA CCGAATATCT GAACAAAGTA CTGACCGAAA CCGACCATGC CAACCCTGAT CCAGAAATCG TGGATATTGC CTGCGGTCGC TCCTCTGCTC CAATGCCGCA GCGTGTAACA AAAGAAGGAA AACAGGATGA TGAAGAAAAA CCGCAGCCAT CTGGCGCAAT GGCAGATGAA CAGGCAACGA CTGAAGCAGT GGAACCGGAT ACAACTGAAC ATAATCAGGA CACGCAGTCG ATGGATGCTC AGCCACAGAT AAATTCTGTT GATGCGAAAT ATCAGAAACT GCGGGCAGAA CTCCATGAAG CCCGGAAAAA CATTCCGCCC CAAAATCCTG TCGATGCAGA CAAATTACTG GCTGCCTCTC GCGGAGAATT TGTTGAAGGG ATTAGCGACC CGAATGATCC GAAATGGATT AAGGGGATCC AGACCCGCGA TTCTGTGTAC CAGAATCAGC CAGAAACGGA ACAGAACGAC CAGAAAGCGG AACAGAACAG CCCAAATACG CAACAAAACG AGCCAGAAAC GAAACAACCT GAACCAGTAG TGCAACAGGA ACCGGAAAAG ATCTGCACCG CCTGCGGTCA GAGGAGTGGC GGCAACTGCC CTGATTGTGG CGCGGTGATG GGCGACGCAA CATACCAGGA AACATTCGAT GACAAGAACC TGGTTGAAGT TCAGGAAGAC GATTCGGAGA AAATGGAAGG CGCTGAACAT CCACACAAGG AGAATGCTGG CAGCGCTCAG GACCACGCCA GCGATAGTGA AACTGGCGAG ACGGCAGATC CCTTAATTAC GGTGAACGGT CATCGCATTA TCACATCCAC CAGCAGGACG TGTGACCATC TAATGATCGA CCTTGAAACC ATGGGAAAAA ATCCTGATGC CCCGATTATC TCAATAGGTG CAATATTTTT CGATCCGCAA ACCGGAGATA TGGGACCGGA ATTTAGTAAG ACTATCGATC TGGAAACTGC TGGCGGAGTC ATTGATCGGG ACACCATTAA ATGGTGGCTT AAGCAATCAC GCGAAGCGCA ATCTGCCATT ATGACCGATG AAATCCCGTT AGATGATGCA CTGTTACAAT TGCGGGAATT TATCGACGAA AACTCCGGTG AATTTTTTGT TCAGGTCTGG GGAAATGGAG CCAACTTCGA CAACACGATT TTGCGCCGTT CATACGAACG GCAGGGGATC CCCTGCCCGT GGCGTTACTA CAACGATCGC GATGTACGCA CAATCGTTGA GCTGGGGAAA GCCATAGACT TCGATGCCAG AACGGCTATT CCATTCGAAG GTGAGCGCCA CAATGCGCTG GATGACGCTC GTTACCAGGC AAAATACGTT TCAGCTATCT GGCAAAAACT GATCCCGAAT CAGGCTGATT TTTAA
|
Protein sequence | MSTDKEEIAL YYEAKNDKVR KRLGIKGGFY WRTAKKLSVA ISRGVVAMDD AGFDEEDFKK PVRVNLPIVN DLPPEGVFDT EFCNRYEKGG EDGITMIFIA PSPSVQDKPA SSDNTNVNGE DMAEIEDNML LPISGQELPI RWLAQHGSEK PVTHVSREEL QALHIARAEE LPAVTALAIS HNTKLLDPLE IRDLHKLVRD TDKVFPNPVN SSLGLMTAFF EAYLDADYTD RGLLTKEWMK GNRVLRISRT PSGANAGGGI LTDRGESFVH DDASVERDVA AGVLARSMDI DIYNPHPAHA KRIEEIVSEN KPPFSVFRDK FIAMPGHLDY SRAIVVASVK EAPIGIEATP HRVTEYLNKV LTETDHANPD PEIVDIACGR SSAPMPQRVT KEGKQDDEEK PQPSGAMADE QATTEAVEPD TTEHNQDTQS MDAQPQINSV DAKYQKLRAE LHEARKNIPP QNPVDADKLL AASRGEFVEG ISDPNDPKWI KGIQTRDSVY QNQPETEQND QKAEQNSPNT QQNEPETKQP EPVVQQEPEK ICTACGQRSG GNCPDCGAVM GDATYQETFD DKNLVEVQED DSEKMEGAEH PHKENAGSAQ DHASDSETGE TADPLITVNG HRIITSTSRT CDHLMIDLET MGKNPDAPII SIGAIFFDPQ TGDMGPEFSK TIDLETAGGV IDRDTIKWWL KQSREAQSAI MTDEIPLDDA LLQLREFIDE NSGEFFVQVW GNGANFDNTI LRRSYERQGI PCPWRYYNDR DVRTIVELGK AIDFDARTAI PFEGERHNAL DDARYQAKYV SAIWQKLIPN QADF
|
| |