Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2893 |
Symbol | |
ID | 6969404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2686985 |
End bp | 2688646 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643386737 |
Product | putative phage terminase, large subunit |
Protein accession | YP_002271208 |
Protein GI | 209400596 |
COG category | [R] General function prediction only |
COG ID | [COG4626] Phage terminase-like protein, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.00000134214 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATACCTG TGTGGAGCAC GGCCTGCCCG GACTGGGCAG AGCGCCTGAA AAAGGGGCTG TCGATTATTC CGGCTCCGAT TTATCCGGAG CAGGCCGCAC ATGCCCTGGC GATTTTTAAA CAACTGCGGA TTGTGGATGC ACCGGGCAGC CCGACGTTCG GGGAGTCCTG CGCACAGTGG GTGTTTGACC TGGTGGCGGC CCTGTTTGGC TCCTACGATG CGCAGACCGG TGTACGCCAT ATCAAGGAAG TTTTTATCCT TATCCCCAAG AAAAACAGCA AGTCCACGCT GGCTGCCGGG ATCATGATGA CGGCGCTGTT ACTGAACTGG CGGCAGGCGG CGGGCTACAC CATTCTGGCC CCGACCGTGG AGGTGGCGGC TAACGCCTTC AACCCTGCCA GGGATATGGT ACGACGGGAC GATGATCTGG ATGACCTCTG TCAGGTGCAG ACACATATCC GGACCATCAC CCACAGGGTG ACGGACACCA CCCTGAAGGT GGTGGCTGCC GATCCGAATA CGGTATCCGG TATCAAGTCC GTGGGGACGC TGATTGATGA ACTGTGGTTA TTTGGCAAGC AGTACAAAGC GGAGGACATG TTACGTGAAG CCATAGGCGG CCTTGCCTCC CGCCCGGAAG GGTTTGTGGT GTATACGACC ACCCAGTCGA ATGAGCCGCC AGCCGGGGGG TTCAGACAGA AACTGCAGTA CGCCCGGGAT GTCCGTGACG GCAAAATTCA TGATCCGCAC TTTCTGCCGG TGATTTTTGA GCATCCTCCT GAAATGGTGG AAAGCGGGGC TCACCTGCTG ATGGAAAACC TCGCCATGGT TAACCCGAAT CTCGGTTATT CGGTGGATGA GGCTTTTCTG TACCGGGAGT ACCGTAAAGC CCGGGAGGCT GGTGAGGAAG CATTTCGTGG CTTCATGTCA AAACATGCCA ATGTGGAAAT TGGTCTTGCC CTGCGTTCTG ACCGCTGGGC GGGTGCGGAT TTCTGGGAGC AGCAGGGCAG GCGCGTCAGC CTGGACGATA TCCTGCAGCG CGCTGATGTG GTGACGGTGG GGATTGACGG CGGGGGCCTG GATGATCTGC TGGGAATGTA CGTGATTGGC CGTGACAGGG AAACCCGCGA ATGGCTGGGC TGGGGCCATG CCTGGGCGCA TGAAACCGCG GTGGTCCGAC GGAAGAGCGA GGCGTCCCGG TTTCAGGATC TTGTTGCCTG TGGAGATATG ACCATTGTCC GGCGTGTCGG GGATGACACG GCGGAAGTGG CGGAATATGT GCGTCGCATT CATGAGGCTG AGTTACTGGA CCATATCGGT ATTGACCCGT CAGGGGTGGG GCAGATTCTG GATTCACTGG CGGAAGCCGG GATCCCCGAC GGAATTGTGG TGGGGATAAG CCAGGGCTGG AAACTGGGCG GGGCCATTAA AACCACCGAG CGCAAACTGG CTGAAGGGGT GCTGGTGCAT GGTGACCAGC CCCTGATGGC CTGGTGTGTC GGCAATGCCC GGGTGGAGCC TAAAGGTAAC GCCATTCTTA TCACCAAACA GGCCAGTGGA CGGGGAAAAA TTGACCCGCT GATGGCGCTG TTCAATGCGG TCTCCCTGAT GTCCCTTAAC CCGGAACCGA AAAAGAAAGA ATATGCGGTT TTTTTCATAT AA
|
Protein sequence | MIPVWSTACP DWAERLKKGL SIIPAPIYPE QAAHALAIFK QLRIVDAPGS PTFGESCAQW VFDLVAALFG SYDAQTGVRH IKEVFILIPK KNSKSTLAAG IMMTALLLNW RQAAGYTILA PTVEVAANAF NPARDMVRRD DDLDDLCQVQ THIRTITHRV TDTTLKVVAA DPNTVSGIKS VGTLIDELWL FGKQYKAEDM LREAIGGLAS RPEGFVVYTT TQSNEPPAGG FRQKLQYARD VRDGKIHDPH FLPVIFEHPP EMVESGAHLL MENLAMVNPN LGYSVDEAFL YREYRKAREA GEEAFRGFMS KHANVEIGLA LRSDRWAGAD FWEQQGRRVS LDDILQRADV VTVGIDGGGL DDLLGMYVIG RDRETREWLG WGHAWAHETA VVRRKSEASR FQDLVACGDM TIVRRVGDDT AEVAEYVRRI HEAELLDHIG IDPSGVGQIL DSLAEAGIPD GIVVGISQGW KLGGAIKTTE RKLAEGVLVH GDQPLMAWCV GNARVEPKGN AILITKQASG RGKIDPLMAL FNAVSLMSLN PEPKKKEYAV FFI
|
| |