Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1620 |
Symbol | |
ID | 6968568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1567274 |
End bp | 1569199 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643385580 |
Product | phage terminase large subunit |
Protein accession | YP_002270074 |
Protein GI | 209398443 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0047658 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAATATAT CGAACAGTCA GGTTAACAGG CTGCGGCATT TTGTCCGCGC CGGGCTTCGC TCACTGTTCA GGCCGGAGCC ACAGACCGCC GTTGAATGGG CGGATGCCAG TTACTATCTC CCGAAAGAAT CCGCATACCA GGAAGGGCGC TGGGAAACAC TGCCCTTTCA GCGGGCCATC ATGAATGCGA TGGGCAGTGA CTACATCCGC GAGGTGAATG TGGTGAAGTC TGCCCGTGTT GGTTATTCCA AAATGCTGCT GGGTGTTTAT GCCTACTTCA TAGAGCATAA GCAGCGCAAC ACCCTTATCT GGTTGCCGAC GGATGGTGAT GCCGAGAACT TTATGAAAAC CCACGTTGAG CCGACTATTC GTGATATTCC GTCGCTGCTG GCGCTGGCCC CGTGGTATGG CAAAAAGCAC CGGGATAACA CGCTCACCAT GAAGCGTTTC ACCAATGGGC GTGGCTTCTG GTGCCTGGGC GGTAAAGCGG CAAAAAACTA CCGTGAAAAG TCAGTGGATG TGGCGGGTTA TGATGAACTT GCTGCCTTTG ATGAGGATAT TGAACAGGAA GGCTCTCCGA CGTTCCTGGG CGATAAGCGT ATTGAAGGCT CGGTCTGGCC AAAGTCCATC CGTGGCTCCA CGCCCAAAGT GAGAGGCACC TGCCAGATTG AGCGTGCAGC CAGTGAATCC CCGCATTTTA TGCGTTTTCA TGTTGCCTGC CCGCACTGCG GGGAGGAGCA GTACCTTAAA TTTGGCGATA AAGAGACGCC GTTTGGCCTC AAATGGACGC CGGATGATCC CTCCAGCGTG TTTTATCTCT GCGAGCATAA TGCCTGCGTC ATCCGTCAGC AGGAGCTGGA CTTTACTGAT GCCCGTTATA TCTGCGAAAA GACCGGGATC TGGACCCGTG ATGGCATTCT CTGGTTTTCG TCATCCGGTG AAGAGATTGA ACCGCCTGAC AGTGTGACCT TTCACATCTG GACGGCGTAC AGCCCGTTCA CCACCTGGGT TCAGATTGTC AAAGACTGGA TGAAGACGAA AGGGGATACG GGAAAACGTA AAACCTTCGT GAACACCACG CTCGGTGAGA CATGGGAAGC GAAGATCGGC GAACGTCCGG ATGCTGAAGT GATGGCAGAG CGGAAAGAGC ATTATTCAGC GTCCGTTCCT GACCGTGTGG CTTACCTGAC CGCCGGTATC GACTCCCAGC TGGATCGCTA CGAAATGCGC GTATGGGGAT GGGGGCCGGG TGAGGAAAGC TGGCTGATTG ATCGGCAGAT TATTATGGGC CGCCACGACG ATGAACAGAC GCTGCTGCGT GTGGATGAGG CCATCAATAA AACCTATACC CGCCGGAATG GTGCAGAAAT GTCGGTATCC CGTATCTGCT GGGATACTGG CGGGATTGAC CCGACTATTG TGTATGAACG CTCGAAAAAG CATGGGCTGT TCCGGGTGAT CCCCATTAAA GGGGCATCCG TCTACGGAAA GCCGGTGGCC AGCATGCCAC GTAAGCGAAA CAAAAACGGG GTTTACCTTA CCGAAATCGG TACGGATACC GCGAAAGAGC AGATTTATAA CCGCTTCACA CTGACGCCGG AAGGGGATGA ACCGCTTCCC GGTGCCGTTC ACTTCCCGAA TAACCCGGAT ATTTTTGATC TGACCGAAGC GCAGCAACTG ACTGCTGAAG AGCAGGTCGA AAAATGGGTG GATGGCAGGA AAAAAATACT GTGGGACAGC AAAAAGCGAC GCAATGAGGC GCTCGACTGC TTCGTTTATG CGCTGGCGGC GCTGCGCATC AGTATTTCCC GCTGGCAGCT GGATCTCAGT GCACTGCTGG CGAGCCTGCA GGAAGAGGAT GGTGCAGCAA CCAACAAGAA AACACTGGCA GAATACGCCC GTGCCTTATC CGGAGAGGAT GAATGA
|
Protein sequence | MNISNSQVNR LRHFVRAGLR SLFRPEPQTA VEWADASYYL PKESAYQEGR WETLPFQRAI MNAMGSDYIR EVNVVKSARV GYSKMLLGVY AYFIEHKQRN TLIWLPTDGD AENFMKTHVE PTIRDIPSLL ALAPWYGKKH RDNTLTMKRF TNGRGFWCLG GKAAKNYREK SVDVAGYDEL AAFDEDIEQE GSPTFLGDKR IEGSVWPKSI RGSTPKVRGT CQIERAASES PHFMRFHVAC PHCGEEQYLK FGDKETPFGL KWTPDDPSSV FYLCEHNACV IRQQELDFTD ARYICEKTGI WTRDGILWFS SSGEEIEPPD SVTFHIWTAY SPFTTWVQIV KDWMKTKGDT GKRKTFVNTT LGETWEAKIG ERPDAEVMAE RKEHYSASVP DRVAYLTAGI DSQLDRYEMR VWGWGPGEES WLIDRQIIMG RHDDEQTLLR VDEAINKTYT RRNGAEMSVS RICWDTGGID PTIVYERSKK HGLFRVIPIK GASVYGKPVA SMPRKRNKNG VYLTEIGTDT AKEQIYNRFT LTPEGDEPLP GAVHFPNNPD IFDLTEAQQL TAEEQVEKWV DGRKKILWDS KKRRNEALDC FVYALAALRI SISRWQLDLS ALLASLQEED GAATNKKTLA EYARALSGED E
|
| |