Gene ECH74115_3134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3134 
Symbol 
ID6969297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2906593 
End bp2910063 
Gene Length3471 bp 
Protein Length1156 aa 
Translation table11 
GC content55% 
IMG OID643386959 
Productputative phage portal protein, lambda family 
Protein accessionYP_002271427 
Protein GI209397642 
COG category[O] Posttranslational modification, protein turnover, chaperones
[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0740] Protease subunit of ATP-dependent Clp proteases
[COG5511] Bacteriophage capsid protein 
TIGRFAM ID[TIGR00493] ATP-dependent Clp protease, proteolytic subunit ClpP
[TIGR01539] phage portal protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.000244044 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAATTA TTGATGATGT GATCGGCGTG TTTTCCCCCG GGTGGAAAGC AGCCAGACTG 
CGTTCAAGGG CGTTAATCAT GGCCTATGAG GCGGTGAAAC CGACCCGGAC ACATAAAGCC
CGGCGGGAAA ATCGCTCTGC TGATCAGCTC AGTAAATACG GTGCGGTTTC CCTGCGGGAG
CAGGCCCGTT TTCTGGATAT CAATCATGAC CTGGTGATTG GTGTGTTTGA CAAGCTGGAA
GAGCGGGTGA TTGGTGCCAG GGGAATTATT GTGGAGCCTC AGCCATTACG AAAAAACGGG
GAAATGGCGG CTGAGCTGGC TGCGGATATC CGCCGTTTGT GGGCTGAATG GTCCGTGAGT
CCGGATGTGA CAGGGCAGTA TACCCGTCCT GTGCTTGAAC GTTTACTGCT GCGGACCTGG
CTGCGGGATG GTGAAGTGTT TGCGCAGATG GTCAGTGGTG CGGGAAACGG TCTGGAACGG
ACGGCGGGAG TGCCATTCTG GCTTGAGGCG ATGGAGCCGG ATTTTGTTCC CATGCGCACT
GATGAATCCG CCGGACTGAA TCAGGGGGTT TTTCTTGATG AGTGGGGAAG ACCGAAAAAA
TATCTGGTTT ATAAAAATTA TCCGGTCAGA GGCCGGCAGA GTGATACGAA AGAAATCGCT
GCCGGAAAAA TGATCCACCT GAAGTTCACT CGTCGTCTGC ATCAGACGCG AGGCTCATCC
ATGTTATCGG GGGTGCTGAT GCGGATCAGT GCCCTTAAGG AGTATGAGGA TGCGGAACTG
ACAGCGGCGC GTATTGCTGC GGCGCTGGGA CTGTATATCC GTAAAGGTGA CGGACAGGAC
TATGAAGATC CGGGGAGCAA AGAGACCGAG CGGGAAGTCC ATATCACCCC GGGTATTATT
TATGACGATT TGCGCAAGGG CGAGGATATC GGCATGGTCA AATCTGACCG TCCCAATCCC
AACCTTGAAA CTTTCCGCAA CGGCCAGTTG CGTGCAGTGG CAGCAGGCAG TCGTCTGAGT
TTTTCCAGTG CGGCGCGTAA CTATAACGGC ACCTACAGCG CCCAGCGGCA GGAGTTGGTC
GAGTCCACGG ATGGTTACCT GATCCTGCAG GACTGTTTTA TTGGCGCGGT AACCCGCCCG
GTGTACCGGA CATGGCTGAA TATGGTGGTT GCGGCAGGTC TGCTGAAAAT TCCGGCGGAT
GTGGAGATGA AAACGCTATA TAACGCGACG TATTCCGGTC CGGTGATGCC GTGGATCGAC
CCGGTTAAGG AAGCTGAAGC CTGGAGAATT CAGATCCGGG GTGTGCAGCG ACAGAATCTG
ACTGGGTGCG TGCCGGGGGC GCAATCCGGG ATGAGGTCAA ACGTCGCCGC AAGGCTGAAA
TTGATGAAAA CAGCAGACTG GGGCTGGTCT TTGATACTGA CCCCGTCAAC GACAAAGGAG
GCAACAGTGC CGGAACTGAA CGACAGTATC AGCGCGACAC CGAAAGCCAG CATGAAGAAT
AAATCCTGGT TCAGGATGCA AGCTGGGGGG CCGGGTGACG CGGATATTTA TATTTATGAC
GAGATTGGTT TCTGGGGAGT TACCGCGAAG CAGTTTGTCA GCGAACTGAA TGCACTGGGT
GATATCACCC ACATTAATCT CCATATCAAT TCACCGGGTG GCGATGTCTT TGAAGGCATC
GCCATTTTTA ATGCCCTGAA AAATCAGGGG GCGACCATTA CCGTGTATGT GGATGGCGTT
GCCGCCTCGA TGGCATCTGT GATTGCGATG GCCGGTGATA CGGTCATTAT GCCGGAAAAT
GCCTTCATGA TGATCCATAA GCCATGGGGA TTCAGTGGCG GGGATGCTGA GGATATGCGC
AGTTATGCCG ATTTGCTGGA TAAAGTCGAA TCGGTACTGT TGCCAGCCTA TGCGCAGAAA
ACCGGAAAAA CCACCGATGA AATTGCCGCC ATGCTGGCGG ATGAAACCTG GATGTCCGGT
GCCGAATGTC TGGCACACGG ATTTGCTGAC CAGGTGACAC CCGCTGTTGA GGCAATGGCA
TGTATTCAGT CAAAACGTAC AGAGGAATTT AAAAAGATGC CGGAATCCAT CCGAAACATG
ATTACTCCGC CACGCAACAG TGCCCCGCGT GATACCACAG TGACAATCCC TGCACCGGCG
GTAACAGAAC CATCACCGGT ACCGGCAGTG TCTGATGAGG CGACCATTCG CGCCCGCGTT
ATGGCTGAGC AGAAAGCCCG CATGTCAGGC ATTAACGATC TGTTTGCCAT GTTCGGCGGT
CGCTATCAGA CGCTTCAGGC ACAGTGCGTG GCTGATCCTG ACTGTTCGCT GGAAATGGCC
CGTGAACGTC TGCTGAATGA AATGGGCAAG GAGTCCTCGC CGACCAACAA AAATACACCG
GCCCATATTT ATGCCGGAAA CGGCAATTTT GTGGGGGACG GGATCCGCCA GGCGATGCTG
GCCCGTGCCG GATTTGAAAA TGTCGAGAAG GATAACGCCT ATAACGGGAT GACCCTGCGT
GAATGGGCTC GCATGTCACT GACGGAGCGC GGTATTGGGG TGGCCAGTTA TAACCCCATG
CAGATGGTCG GGCTGGCGCT GACGCACAGC ACCTCTGATT TTGGCAATAT TCTGCTGGAT
GTGTCGAACA AGGGGCTGAT CCAGGGCTGG GAGGAATCAG AAGAAACCTT CCAGAAGTGG
ACCCGTAAGG GACGCCTGTC AGACTTCAAA ACAGCGTATC GCGTGGGGAT GGGCGGTTTT
GGTTCTCTGC GCCAGGTTCG TGAGGGGGCG GAGTATAAAT ACATCACCAC CTCAGATCGC
AAGGAGACCA TTGCACTGGC CACTTACGGG GAGATTTTCT CCATCACCCG CCAGGCCATT
ATCAATGATG ATCTGAATAT GCTGGTGGAC GTGCCGATGA AGATGGGGCG TGCGGCGAAG
GCAACGATTG GTGACCTGGT CTACAAGGTG CTGACGGATA ACCCGAAACT GTCCGACGGT
AAGGCGCTGT TCCATGCCGA TCACAAAAAT ATTGCCACCG GGGGGATCTC CGTTTCCGGA
CTGGATGCGG CCCGTCAGAT GATGCGCCTG CAGAAAGAAG GCGATCGTGC CCTGAATATC
CGTCCGGCCT TTATGCTGGT ACCGGTGGCA CTGGAGACGG TGGCGAACCA GACCATCAAA
TCGGCCAGTG TGAAAGGGGC GGATGCAAAC GCCGGTGTCA TTAACCCTAT CCAGAACTTT
GCTGAGGTGA TTGCAGAAGC GCGTCTTGAT GCGGCAGACC CGAAAACCTG GTATCTGGCG
GCGGCACAGG GCACTGACAC CATTGAAGTG GCCTGGCTGG ATGGTGTGGA CACGCCATAC
ATTGATCAGC AGGAAGGTTT CACCACTGAC GGCATTGCCA CAAAAATCCG TATTGATGCC
GGAGTGGCAC CACTTGACTG GCGCGGGCTG GTGCGTTCGT CGGTGGCCTG A
 
Protein sequence
MAIIDDVIGV FSPGWKAARL RSRALIMAYE AVKPTRTHKA RRENRSADQL SKYGAVSLRE 
QARFLDINHD LVIGVFDKLE ERVIGARGII VEPQPLRKNG EMAAELAADI RRLWAEWSVS
PDVTGQYTRP VLERLLLRTW LRDGEVFAQM VSGAGNGLER TAGVPFWLEA MEPDFVPMRT
DESAGLNQGV FLDEWGRPKK YLVYKNYPVR GRQSDTKEIA AGKMIHLKFT RRLHQTRGSS
MLSGVLMRIS ALKEYEDAEL TAARIAAALG LYIRKGDGQD YEDPGSKETE REVHITPGII
YDDLRKGEDI GMVKSDRPNP NLETFRNGQL RAVAAGSRLS FSSAARNYNG TYSAQRQELV
ESTDGYLILQ DCFIGAVTRP VYRTWLNMVV AAGLLKIPAD VEMKTLYNAT YSGPVMPWID
PVKEAEAWRI QIRGVQRQNL TGCVPGAQSG MRSNVAARLK LMKTADWGWS LILTPSTTKE
ATVPELNDSI SATPKASMKN KSWFRMQAGG PGDADIYIYD EIGFWGVTAK QFVSELNALG
DITHINLHIN SPGGDVFEGI AIFNALKNQG ATITVYVDGV AASMASVIAM AGDTVIMPEN
AFMMIHKPWG FSGGDAEDMR SYADLLDKVE SVLLPAYAQK TGKTTDEIAA MLADETWMSG
AECLAHGFAD QVTPAVEAMA CIQSKRTEEF KKMPESIRNM ITPPRNSAPR DTTVTIPAPA
VTEPSPVPAV SDEATIRARV MAEQKARMSG INDLFAMFGG RYQTLQAQCV ADPDCSLEMA
RERLLNEMGK ESSPTNKNTP AHIYAGNGNF VGDGIRQAML ARAGFENVEK DNAYNGMTLR
EWARMSLTER GIGVASYNPM QMVGLALTHS TSDFGNILLD VSNKGLIQGW EESEETFQKW
TRKGRLSDFK TAYRVGMGGF GSLRQVREGA EYKYITTSDR KETIALATYG EIFSITRQAI
INDDLNMLVD VPMKMGRAAK ATIGDLVYKV LTDNPKLSDG KALFHADHKN IATGGISVSG
LDAARQMMRL QKEGDRALNI RPAFMLVPVA LETVANQTIK SASVKGADAN AGVINPIQNF
AEVIAEARLD AADPKTWYLA AAQGTDTIEV AWLDGVDTPY IDQQEGFTTD GIATKIRIDA
GVAPLDWRGL VRSSVA