Gene ECH74115_2893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2893 
Symbol 
ID6969404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2686985 
End bp2688646 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content58% 
IMG OID643386737 
Productputative phage terminase, large subunit 
Protein accessionYP_002271208 
Protein GI209400596 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00000134214 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATACCTG TGTGGAGCAC GGCCTGCCCG GACTGGGCAG AGCGCCTGAA AAAGGGGCTG 
TCGATTATTC CGGCTCCGAT TTATCCGGAG CAGGCCGCAC ATGCCCTGGC GATTTTTAAA
CAACTGCGGA TTGTGGATGC ACCGGGCAGC CCGACGTTCG GGGAGTCCTG CGCACAGTGG
GTGTTTGACC TGGTGGCGGC CCTGTTTGGC TCCTACGATG CGCAGACCGG TGTACGCCAT
ATCAAGGAAG TTTTTATCCT TATCCCCAAG AAAAACAGCA AGTCCACGCT GGCTGCCGGG
ATCATGATGA CGGCGCTGTT ACTGAACTGG CGGCAGGCGG CGGGCTACAC CATTCTGGCC
CCGACCGTGG AGGTGGCGGC TAACGCCTTC AACCCTGCCA GGGATATGGT ACGACGGGAC
GATGATCTGG ATGACCTCTG TCAGGTGCAG ACACATATCC GGACCATCAC CCACAGGGTG
ACGGACACCA CCCTGAAGGT GGTGGCTGCC GATCCGAATA CGGTATCCGG TATCAAGTCC
GTGGGGACGC TGATTGATGA ACTGTGGTTA TTTGGCAAGC AGTACAAAGC GGAGGACATG
TTACGTGAAG CCATAGGCGG CCTTGCCTCC CGCCCGGAAG GGTTTGTGGT GTATACGACC
ACCCAGTCGA ATGAGCCGCC AGCCGGGGGG TTCAGACAGA AACTGCAGTA CGCCCGGGAT
GTCCGTGACG GCAAAATTCA TGATCCGCAC TTTCTGCCGG TGATTTTTGA GCATCCTCCT
GAAATGGTGG AAAGCGGGGC TCACCTGCTG ATGGAAAACC TCGCCATGGT TAACCCGAAT
CTCGGTTATT CGGTGGATGA GGCTTTTCTG TACCGGGAGT ACCGTAAAGC CCGGGAGGCT
GGTGAGGAAG CATTTCGTGG CTTCATGTCA AAACATGCCA ATGTGGAAAT TGGTCTTGCC
CTGCGTTCTG ACCGCTGGGC GGGTGCGGAT TTCTGGGAGC AGCAGGGCAG GCGCGTCAGC
CTGGACGATA TCCTGCAGCG CGCTGATGTG GTGACGGTGG GGATTGACGG CGGGGGCCTG
GATGATCTGC TGGGAATGTA CGTGATTGGC CGTGACAGGG AAACCCGCGA ATGGCTGGGC
TGGGGCCATG CCTGGGCGCA TGAAACCGCG GTGGTCCGAC GGAAGAGCGA GGCGTCCCGG
TTTCAGGATC TTGTTGCCTG TGGAGATATG ACCATTGTCC GGCGTGTCGG GGATGACACG
GCGGAAGTGG CGGAATATGT GCGTCGCATT CATGAGGCTG AGTTACTGGA CCATATCGGT
ATTGACCCGT CAGGGGTGGG GCAGATTCTG GATTCACTGG CGGAAGCCGG GATCCCCGAC
GGAATTGTGG TGGGGATAAG CCAGGGCTGG AAACTGGGCG GGGCCATTAA AACCACCGAG
CGCAAACTGG CTGAAGGGGT GCTGGTGCAT GGTGACCAGC CCCTGATGGC CTGGTGTGTC
GGCAATGCCC GGGTGGAGCC TAAAGGTAAC GCCATTCTTA TCACCAAACA GGCCAGTGGA
CGGGGAAAAA TTGACCCGCT GATGGCGCTG TTCAATGCGG TCTCCCTGAT GTCCCTTAAC
CCGGAACCGA AAAAGAAAGA ATATGCGGTT TTTTTCATAT AA
 
Protein sequence
MIPVWSTACP DWAERLKKGL SIIPAPIYPE QAAHALAIFK QLRIVDAPGS PTFGESCAQW 
VFDLVAALFG SYDAQTGVRH IKEVFILIPK KNSKSTLAAG IMMTALLLNW RQAAGYTILA
PTVEVAANAF NPARDMVRRD DDLDDLCQVQ THIRTITHRV TDTTLKVVAA DPNTVSGIKS
VGTLIDELWL FGKQYKAEDM LREAIGGLAS RPEGFVVYTT TQSNEPPAGG FRQKLQYARD
VRDGKIHDPH FLPVIFEHPP EMVESGAHLL MENLAMVNPN LGYSVDEAFL YREYRKAREA
GEEAFRGFMS KHANVEIGLA LRSDRWAGAD FWEQQGRRVS LDDILQRADV VTVGIDGGGL
DDLLGMYVIG RDRETREWLG WGHAWAHETA VVRRKSEASR FQDLVACGDM TIVRRVGDDT
AEVAEYVRRI HEAELLDHIG IDPSGVGQIL DSLAEAGIPD GIVVGISQGW KLGGAIKTTE
RKLAEGVLVH GDQPLMAWCV GNARVEPKGN AILITKQASG RGKIDPLMAL FNAVSLMSLN
PEPKKKEYAV FFI