Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3134 |
Symbol | |
ID | 6969297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2906593 |
End bp | 2910063 |
Gene Length | 3471 bp |
Protein Length | 1156 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643386959 |
Product | putative phage portal protein, lambda family |
Protein accession | YP_002271427 |
Protein GI | 209397642 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0740] Protease subunit of ATP-dependent Clp proteases [COG5511] Bacteriophage capsid protein |
TIGRFAM ID | [TIGR00493] ATP-dependent Clp protease, proteolytic subunit ClpP [TIGR01539] phage portal protein, lambda family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.000244044 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAATTA TTGATGATGT GATCGGCGTG TTTTCCCCCG GGTGGAAAGC AGCCAGACTG CGTTCAAGGG CGTTAATCAT GGCCTATGAG GCGGTGAAAC CGACCCGGAC ACATAAAGCC CGGCGGGAAA ATCGCTCTGC TGATCAGCTC AGTAAATACG GTGCGGTTTC CCTGCGGGAG CAGGCCCGTT TTCTGGATAT CAATCATGAC CTGGTGATTG GTGTGTTTGA CAAGCTGGAA GAGCGGGTGA TTGGTGCCAG GGGAATTATT GTGGAGCCTC AGCCATTACG AAAAAACGGG GAAATGGCGG CTGAGCTGGC TGCGGATATC CGCCGTTTGT GGGCTGAATG GTCCGTGAGT CCGGATGTGA CAGGGCAGTA TACCCGTCCT GTGCTTGAAC GTTTACTGCT GCGGACCTGG CTGCGGGATG GTGAAGTGTT TGCGCAGATG GTCAGTGGTG CGGGAAACGG TCTGGAACGG ACGGCGGGAG TGCCATTCTG GCTTGAGGCG ATGGAGCCGG ATTTTGTTCC CATGCGCACT GATGAATCCG CCGGACTGAA TCAGGGGGTT TTTCTTGATG AGTGGGGAAG ACCGAAAAAA TATCTGGTTT ATAAAAATTA TCCGGTCAGA GGCCGGCAGA GTGATACGAA AGAAATCGCT GCCGGAAAAA TGATCCACCT GAAGTTCACT CGTCGTCTGC ATCAGACGCG AGGCTCATCC ATGTTATCGG GGGTGCTGAT GCGGATCAGT GCCCTTAAGG AGTATGAGGA TGCGGAACTG ACAGCGGCGC GTATTGCTGC GGCGCTGGGA CTGTATATCC GTAAAGGTGA CGGACAGGAC TATGAAGATC CGGGGAGCAA AGAGACCGAG CGGGAAGTCC ATATCACCCC GGGTATTATT TATGACGATT TGCGCAAGGG CGAGGATATC GGCATGGTCA AATCTGACCG TCCCAATCCC AACCTTGAAA CTTTCCGCAA CGGCCAGTTG CGTGCAGTGG CAGCAGGCAG TCGTCTGAGT TTTTCCAGTG CGGCGCGTAA CTATAACGGC ACCTACAGCG CCCAGCGGCA GGAGTTGGTC GAGTCCACGG ATGGTTACCT GATCCTGCAG GACTGTTTTA TTGGCGCGGT AACCCGCCCG GTGTACCGGA CATGGCTGAA TATGGTGGTT GCGGCAGGTC TGCTGAAAAT TCCGGCGGAT GTGGAGATGA AAACGCTATA TAACGCGACG TATTCCGGTC CGGTGATGCC GTGGATCGAC CCGGTTAAGG AAGCTGAAGC CTGGAGAATT CAGATCCGGG GTGTGCAGCG ACAGAATCTG ACTGGGTGCG TGCCGGGGGC GCAATCCGGG ATGAGGTCAA ACGTCGCCGC AAGGCTGAAA TTGATGAAAA CAGCAGACTG GGGCTGGTCT TTGATACTGA CCCCGTCAAC GACAAAGGAG GCAACAGTGC CGGAACTGAA CGACAGTATC AGCGCGACAC CGAAAGCCAG CATGAAGAAT AAATCCTGGT TCAGGATGCA AGCTGGGGGG CCGGGTGACG CGGATATTTA TATTTATGAC GAGATTGGTT TCTGGGGAGT TACCGCGAAG CAGTTTGTCA GCGAACTGAA TGCACTGGGT GATATCACCC ACATTAATCT CCATATCAAT TCACCGGGTG GCGATGTCTT TGAAGGCATC GCCATTTTTA ATGCCCTGAA AAATCAGGGG GCGACCATTA CCGTGTATGT GGATGGCGTT GCCGCCTCGA TGGCATCTGT GATTGCGATG GCCGGTGATA CGGTCATTAT GCCGGAAAAT GCCTTCATGA TGATCCATAA GCCATGGGGA TTCAGTGGCG GGGATGCTGA GGATATGCGC AGTTATGCCG ATTTGCTGGA TAAAGTCGAA TCGGTACTGT TGCCAGCCTA TGCGCAGAAA ACCGGAAAAA CCACCGATGA AATTGCCGCC ATGCTGGCGG ATGAAACCTG GATGTCCGGT GCCGAATGTC TGGCACACGG ATTTGCTGAC CAGGTGACAC CCGCTGTTGA GGCAATGGCA TGTATTCAGT CAAAACGTAC AGAGGAATTT AAAAAGATGC CGGAATCCAT CCGAAACATG ATTACTCCGC CACGCAACAG TGCCCCGCGT GATACCACAG TGACAATCCC TGCACCGGCG GTAACAGAAC CATCACCGGT ACCGGCAGTG TCTGATGAGG CGACCATTCG CGCCCGCGTT ATGGCTGAGC AGAAAGCCCG CATGTCAGGC ATTAACGATC TGTTTGCCAT GTTCGGCGGT CGCTATCAGA CGCTTCAGGC ACAGTGCGTG GCTGATCCTG ACTGTTCGCT GGAAATGGCC CGTGAACGTC TGCTGAATGA AATGGGCAAG GAGTCCTCGC CGACCAACAA AAATACACCG GCCCATATTT ATGCCGGAAA CGGCAATTTT GTGGGGGACG GGATCCGCCA GGCGATGCTG GCCCGTGCCG GATTTGAAAA TGTCGAGAAG GATAACGCCT ATAACGGGAT GACCCTGCGT GAATGGGCTC GCATGTCACT GACGGAGCGC GGTATTGGGG TGGCCAGTTA TAACCCCATG CAGATGGTCG GGCTGGCGCT GACGCACAGC ACCTCTGATT TTGGCAATAT TCTGCTGGAT GTGTCGAACA AGGGGCTGAT CCAGGGCTGG GAGGAATCAG AAGAAACCTT CCAGAAGTGG ACCCGTAAGG GACGCCTGTC AGACTTCAAA ACAGCGTATC GCGTGGGGAT GGGCGGTTTT GGTTCTCTGC GCCAGGTTCG TGAGGGGGCG GAGTATAAAT ACATCACCAC CTCAGATCGC AAGGAGACCA TTGCACTGGC CACTTACGGG GAGATTTTCT CCATCACCCG CCAGGCCATT ATCAATGATG ATCTGAATAT GCTGGTGGAC GTGCCGATGA AGATGGGGCG TGCGGCGAAG GCAACGATTG GTGACCTGGT CTACAAGGTG CTGACGGATA ACCCGAAACT GTCCGACGGT AAGGCGCTGT TCCATGCCGA TCACAAAAAT ATTGCCACCG GGGGGATCTC CGTTTCCGGA CTGGATGCGG CCCGTCAGAT GATGCGCCTG CAGAAAGAAG GCGATCGTGC CCTGAATATC CGTCCGGCCT TTATGCTGGT ACCGGTGGCA CTGGAGACGG TGGCGAACCA GACCATCAAA TCGGCCAGTG TGAAAGGGGC GGATGCAAAC GCCGGTGTCA TTAACCCTAT CCAGAACTTT GCTGAGGTGA TTGCAGAAGC GCGTCTTGAT GCGGCAGACC CGAAAACCTG GTATCTGGCG GCGGCACAGG GCACTGACAC CATTGAAGTG GCCTGGCTGG ATGGTGTGGA CACGCCATAC ATTGATCAGC AGGAAGGTTT CACCACTGAC GGCATTGCCA CAAAAATCCG TATTGATGCC GGAGTGGCAC CACTTGACTG GCGCGGGCTG GTGCGTTCGT CGGTGGCCTG A
|
Protein sequence | MAIIDDVIGV FSPGWKAARL RSRALIMAYE AVKPTRTHKA RRENRSADQL SKYGAVSLRE QARFLDINHD LVIGVFDKLE ERVIGARGII VEPQPLRKNG EMAAELAADI RRLWAEWSVS PDVTGQYTRP VLERLLLRTW LRDGEVFAQM VSGAGNGLER TAGVPFWLEA MEPDFVPMRT DESAGLNQGV FLDEWGRPKK YLVYKNYPVR GRQSDTKEIA AGKMIHLKFT RRLHQTRGSS MLSGVLMRIS ALKEYEDAEL TAARIAAALG LYIRKGDGQD YEDPGSKETE REVHITPGII YDDLRKGEDI GMVKSDRPNP NLETFRNGQL RAVAAGSRLS FSSAARNYNG TYSAQRQELV ESTDGYLILQ DCFIGAVTRP VYRTWLNMVV AAGLLKIPAD VEMKTLYNAT YSGPVMPWID PVKEAEAWRI QIRGVQRQNL TGCVPGAQSG MRSNVAARLK LMKTADWGWS LILTPSTTKE ATVPELNDSI SATPKASMKN KSWFRMQAGG PGDADIYIYD EIGFWGVTAK QFVSELNALG DITHINLHIN SPGGDVFEGI AIFNALKNQG ATITVYVDGV AASMASVIAM AGDTVIMPEN AFMMIHKPWG FSGGDAEDMR SYADLLDKVE SVLLPAYAQK TGKTTDEIAA MLADETWMSG AECLAHGFAD QVTPAVEAMA CIQSKRTEEF KKMPESIRNM ITPPRNSAPR DTTVTIPAPA VTEPSPVPAV SDEATIRARV MAEQKARMSG INDLFAMFGG RYQTLQAQCV ADPDCSLEMA RERLLNEMGK ESSPTNKNTP AHIYAGNGNF VGDGIRQAML ARAGFENVEK DNAYNGMTLR EWARMSLTER GIGVASYNPM QMVGLALTHS TSDFGNILLD VSNKGLIQGW EESEETFQKW TRKGRLSDFK TAYRVGMGGF GSLRQVREGA EYKYITTSDR KETIALATYG EIFSITRQAI INDDLNMLVD VPMKMGRAAK ATIGDLVYKV LTDNPKLSDG KALFHADHKN IATGGISVSG LDAARQMMRL QKEGDRALNI RPAFMLVPVA LETVANQTIK SASVKGADAN AGVINPIQNF AEVIAEARLD AADPKTWYLA AAQGTDTIEV AWLDGVDTPY IDQQEGFTTD GIATKIRIDA GVAPLDWRGL VRSSVA
|
| |