Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1638 |
Symbol | |
ID | 6969319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1582057 |
End bp | 1585455 |
Gene Length | 3399 bp |
Protein Length | 1132 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643385598 |
Product | hypothetical protein |
Protein accession | YP_002270092 |
Protein GI | 209397540 |
COG category | [S] Function unknown |
COG ID | [COG4733] Phage-related protein, tail component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.498718 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAAAG GCAGCAGTAA GGGGCATACC CCGCGCGAAG CGAAGGACAA CCTGAAGTCC ACGCAGCTGC TGAGTGTGAT CGATGCCATC AGCGAAGGGC CGGTTGAAGG TCCGGTGGAT GGATTAAAAA GCGTGCTGCT GAACAGTACG CCGGTGCTGG ACAGTGAGGG GAATACCAAT ATATCCGGCG TCACGGTGGT GTTCCGGGCC GGTGAGCAGG AGCAGACACC GCCGGAGGGA TTTGAATCCT CCGGCTCCGA GACGGTGCTC GGTACAGAAG TGAAATATGA CACGCCGATC ACCCGGACCA TCACGTCGGC AAACATTGAC CGTCTGCGTT TTACTTTCGG CGTGCAGGCA CTGGTGGAAA CCACCTCAAA GGGGGACAGG AATCCATCGG AAGTCCGCCT GCTGGTTCAG ATACAACGTA ACGGTGGCTG GGTGACGGAA AAAGACATCA CCATTAAGGG TAAAACCACT TCACAGTATC TGGCCTCGGT GGTGGTGGAT AACCTGCCGC CGCGCCCGTT TAATATCCGG ATGCGCAGGA TGACGCCGGA CAGCACCACA GACCAGCTGC AGAACAAAAC GCTCTGGTCG TCATACACCG AAATTATCGA TGTGAAACAG TGCTACCCGA ACACGGCACT GGTCGGCGTG CAGGTGGACT CGGAGCAGTT CGGCAGCCAG AAGGTGAGCC GTAATTATCA TCTGCGCGGG CGTATTCTGC AGGTGCCGTC GAATTATAAC CCGCAGACGC GGCAATACAG CGGTATCTGG GACGGAACGT TTAAACCGGC ATACAGCAAC AACATGGCCT GGTGTCTGTG GGATATGCTG ACCCATCCGC GCTACGGCAT GGGGAAACGT CTTGGTGCGG CGGATGTGGA TAAATGGGCG CTGTATGTCA TCGGCCAGTA CTGCGACCAG TCAGTGCCGG ACGGCTTTGG CGGCACGGAG CCGCGCATCA CCTGTAATGC CTACCTGACC ACACAGCGTA AGGCGTGGGA TGTTCTCAGC GATTTCTGCT CGGCGATGCG CTGTATGCCG GTATGGAACG GGCAGACGCT GACGTTCGTG CAGGACCGAC CGTCGGATAA GGTGTGGACC TATAACCGCA GTAATGTGGT GATGCCGGAT GATGGCGCGT CGTTCCGCTA CAGCTTCAGC GCCCTCAAGG ACCGCCATAA TGCCGTTGAG GTGAACTGGA TTGACCCGGA TAACGGCTGG GAGACGGCGA CAGAGCTTGT GGAGGACACG CAGGCCATTC TCCGTTACGG TCGTAACGTC ACGAAGATGG ATGCCTTTGG CTGTACCAGC CGGGGGCAGG CACACCGCGC CGGGCTGTGG CTGATTAAAA CGGAGCTGCT GGAGACGCAG ACCGTGGATT TCAGCGTGGG CGCAGAAGGG CTTCGCCATG TACCGGGCGA TGTCATTGAA ATCTGTGATG ATGACTATGC GGGGATCAGC ATCGGTGGGC GTGTGCTGGC GGTGAACAGC CAGACCCGGA CGCTGACGCT CGACCGTGAA ATCACGCTGC CATCCTCCGG TACCACGCTG ATAAGCCTGG TTGACGGAAG TGGCAATCCG GTCAGCGTGG AGGTCCAGTC CGTCACCGAC GGCGTGAAGG TAAAAGTGAG CCGTGTTCCT GACGGCGTTG CCGGATACAG CGTATGGGGG CTGAAGTTGC CGACGTTGCG CCAGCGCCTG TTCCGCTGCG TGAGTATCCG TGAGAACGAC GACGGCACGT ATGCCATCAC CGCCGTGCAG CATGTACCCG AAAAAGAAGC CATCGTGGAT AACGGGGCGC ACTTTGACGG CGACCTGAGC GGCACGGTGA ATGGCGTCAC GCCGCCCGCG GTGCAGCACC TGACTGCCGA AGTCACCGCA GACAGCGGGG AATATCAGGT GCTGGCGCGC TGGGACACGC CGAAGGTGGT GAAGGGGGTG AGCTTCCTGC TTCGCCTGAC CGTGGCAGCG GACGATGGCA GTGAGCGGCT GGTCAGTACG GCCAGGACGA CGGAAACCAC ATACCGCTTC ACGCAACTGG CGCTGGGGAA CTACAGGCTG ACTGTCCGGG CGGTAAATGC GTGGGGACAG CAGGGCGATC CGGCATCGGT ATCGTTCCGG ATTGCCGCAC CGGCAGCGCC GTCTCGGATT GAGCTGACAC CAGGCTATTT TCAGATAACC GCCACGCCGC ATCTTGCGGT TTATGATCCG ACGGTACAGT TTGAGTTCTG GTTCTCGGAA ACGCGGATTG CGGATATCAG GCAGGTTGAA ACCAGCGCGC GTTATCTTGG TACGGCGCTG TACTGGATAG CCGCCAGTAT CAATATCAAA CCGGGCCATG ATTATTACTT TTATATCCGC AGTGTGAACA CCGTTGGCAA ATCGGCATTC GTGGAGGCCG TCGGTCGGGC GAGCGATGAT GCGGAAGGTT ATCTGGATTT TTTCAAAGGC AAGATAACCG AATCTCATCT TGGTAAAGAG CTACTGGAAA AAGTTGACCT GACGGAGGAT AACGCCAGCA GACTGGATGA GTTTTCGAAA GAGTGGAAGG ATGCTAACGA TAAGTGGAAT GCCATGTGGG GCGTCAAAAT TGAGCAGACC AAAGACGGCA AACATTATGT CGCGGGTATT GGCCTCAGCA TGGAGGACAC GGAGGAAGGC AAGCTGAGCC AGTTTCTGGT TGCCGCCAAT CGTATCGCGT TTATTGACCC GGCAAACGGG AATGAAACGC CGATGTTTGT GGCGCAGGGC AACCAGATAT TCATGAACGA CGTGTTCCTG AAGCGCCTGA CGGCCCCCAC CATTACCAGC GGTGGAAATC CACCGGCATT TTCCCTGACA CCGGACGGAA AGCTGACTGC TAAAAATGCG GATATCAGTG GCAGTGTGAA TGCGAACTCC GGGACGCTCA ACAACGTCAC GATTAATGAG AACTGTCAGA TTAAGGGGAA ACTGTCAGCC AATCAGATTG AAGGCGATAT TGTCAAAACG GTCAGCAAGT CTTTCCCCCG CACGAGCACT TATGCCAGTG GCACCATCAC GGTAAGAATC AGTGATGATC AGAAATTTGA CCGGCAGGTC ATGATACCGC CAGTGTTATT CCGCGGTGGT AAGCATGAGA ATTTCAACAG TAATAACCAA CAGTCATACT GGTATTCAAC CTGCCGGTTA AGAGTGACCC GCAATGGCCA GGAGATTTTT AATCAGTCCA CGACGGATGC TCAGGGCGTA TTTTCCTCAG TTATAGATAT GCCTGCCGGA CAGGGGACGC TGACACTGAC ATTCACCGTA TCTTCATCAG GAGCGAATAA CTGGACACCA ACAACCAGTA TCAGCGATCT GCTGGTTGTG GTGATGAAAA AATCCACAGC AGGTATCAGT ATCAGCTGA
|
Protein sequence | MGKGSSKGHT PREAKDNLKS TQLLSVIDAI SEGPVEGPVD GLKSVLLNST PVLDSEGNTN ISGVTVVFRA GEQEQTPPEG FESSGSETVL GTEVKYDTPI TRTITSANID RLRFTFGVQA LVETTSKGDR NPSEVRLLVQ IQRNGGWVTE KDITIKGKTT SQYLASVVVD NLPPRPFNIR MRRMTPDSTT DQLQNKTLWS SYTEIIDVKQ CYPNTALVGV QVDSEQFGSQ KVSRNYHLRG RILQVPSNYN PQTRQYSGIW DGTFKPAYSN NMAWCLWDML THPRYGMGKR LGAADVDKWA LYVIGQYCDQ SVPDGFGGTE PRITCNAYLT TQRKAWDVLS DFCSAMRCMP VWNGQTLTFV QDRPSDKVWT YNRSNVVMPD DGASFRYSFS ALKDRHNAVE VNWIDPDNGW ETATELVEDT QAILRYGRNV TKMDAFGCTS RGQAHRAGLW LIKTELLETQ TVDFSVGAEG LRHVPGDVIE ICDDDYAGIS IGGRVLAVNS QTRTLTLDRE ITLPSSGTTL ISLVDGSGNP VSVEVQSVTD GVKVKVSRVP DGVAGYSVWG LKLPTLRQRL FRCVSIREND DGTYAITAVQ HVPEKEAIVD NGAHFDGDLS GTVNGVTPPA VQHLTAEVTA DSGEYQVLAR WDTPKVVKGV SFLLRLTVAA DDGSERLVST ARTTETTYRF TQLALGNYRL TVRAVNAWGQ QGDPASVSFR IAAPAAPSRI ELTPGYFQIT ATPHLAVYDP TVQFEFWFSE TRIADIRQVE TSARYLGTAL YWIAASINIK PGHDYYFYIR SVNTVGKSAF VEAVGRASDD AEGYLDFFKG KITESHLGKE LLEKVDLTED NASRLDEFSK EWKDANDKWN AMWGVKIEQT KDGKHYVAGI GLSMEDTEEG KLSQFLVAAN RIAFIDPANG NETPMFVAQG NQIFMNDVFL KRLTAPTITS GGNPPAFSLT PDGKLTAKNA DISGSVNANS GTLNNVTINE NCQIKGKLSA NQIEGDIVKT VSKSFPRTST YASGTITVRI SDDQKFDRQV MIPPVLFRGG KHENFNSNNQ QSYWYSTCRL RVTRNGQEIF NQSTTDAQGV FSSVIDMPAG QGTLTLTFTV SSSGANNWTP TTSISDLLVV VMKKSTAGIS IS
|
| |