Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1783 |
Symbol | |
ID | 6968357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1701848 |
End bp | 1703773 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643385730 |
Product | phage terminase large subunit |
Protein accession | YP_002270220 |
Protein GI | 209397875 |
COG category | [R] General function prediction only |
COG ID | [COG5525] Bacteriophage tail assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.0374719 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATATAT CGAACAGTCA GATTGATATT CTGCGGCGTG ATGTACGCGC CGGGCTGCGA GCCCTGTTCA GGCCGGAGCC ACAGACTGCC GTTGAATGGG CGGATGCCAG TTACTATCTC CCGAAAGAAT CCGCATACCA GGAAGGGCGC TGGGAAACAC TACCCTTTCA GCGGGCTATC ATGAATGCGA TGGGCAGCGA CTACATCCGC GAGGTGAATG TGGTGAAGTC TGCCCGTGTT GGTTATTCAA AAATGCTGCT GGGTGTTTAT GCCTACTTCA TAGAGCATAA GCAGCGTAAC CCCCTTATCT GGTTGCCGAC GGATGGTGAT GCCGAGAACT TTATGAAAAC CCACGTCGAG CCTACCATCC GCGATATTCC GTCGCTGCTG TCTCTGGCCC CGTGGTATGG CAAAAAGCAC CGGGATAACA CGCTCACTAT GAAGCGTTTC ACCAATGGTC GTGGCTTCTG GTGCCTGGGC GGTAAAGCGG CAAAAAACTA CCGTGAAAAG TCGGTTGATG TGGCGGGTTA TGATGAACTT GCTGCCTTTG ATGAGGATAT TGAACAGGAA GGCTCTCCGA CGTTCCTGGG TGACAAGCGT ATTGAAGGCT CGGTCTGGCC AAAGTCCATC CGTGGCTCCA CGCCCAAAGT GAGAGGCACC TGCCAGATTG AGCGTGCAGC CAGTGAATCC CCGCATTTTA TGCGTTTTCA TGTTGCCTGC CCGCACTGCG GGGAGGAGCA GTATCTTAAA TTTGGCGACA AAGAGACGCC GTTTGGCCTC AAATGGACGC CGGATGACCC CTCCAGCGTG TTTTATCTCT GCGAGCATAA TGCCTGCGTC ATCCGCCAGC AGGAGCTGGA CTTTACTGAT GCCCGTTATA TCTGCGAAAA GACCGGGATC TGGACCCGTG ATGGCATTCT CTGGTTTTCG TCATCCGGTG AAGAGATTGA ACCGCCTGAC AGTGTGACCT TTCACATCTG GACGGCGTAC AGCCCGTTCA CCACCTGGGT GCAGATTGTC AAAGACTGGA TGAAGACGAA AGGGGATACG GGAAAACGTA AAACCTTCGT GAACACCACG CTCGGTGAGA CGTGGGAAGC GAAAATCGGC GAACGTCCGG ATGCTGAAGT GATGGCAGAG CGGAAAGAGT ATTATTCAGC GCCCGTTCCT GATCGTGTGG CTTACCTGAC CGCCGGTATC GACTCCCAGC TGGACCGCTA CGAAATGCGC GTATGGGGAT GGGGGCCGGG TGAGGAAAGC TGGCTGATTG ACCGGCAGAT TATTATGGGC CGCCACGACG ATGAACAGAC GCTGCTGCGT GTGGATGAGG CCATCAATAA AACCTATACC CGCCGGAATG GTGCAGAAAT GTCGGTATCC CGTATCTGCT GGGATACTGG CGGGATTGAC CCGACCATTG TGTATGAACG CTCGAAAAAA CATGGGCTGT TCCGGGTGAT CCCCATTAAA GGGGCATCCG TCTACGGAAA GCCGGTGGCC AGCATGCCAC GTAAGCGAAA CAAAAACGGG GTTTACCTTA CCGAAATCGG TACGGATACC GCGAAAGAGC AGATTTATAA CCGCTTCACA CTGACGCCGG AAGGGGATGA ACCGCTTCCC GGTGCCGTTC ACTTCCCGAA TAACCCGGAT ATTTTTGATC TTACCGAAGC GCAGCAACTG ACTGCTGAAG AGCAGGTCGA AAAATGGGTG GATGGCAGGA AAAAAATACT GTGGGACAGC AAAAAGCGAC GCAATGAGGC GCTCGACTGC TTCGTTTATG CGCTGGCGGC GCTGCGCATC AGTATTTCCC GCTGGCAGCT GGATCTCAGT GCACTGCTGG CGAGCCTGCA GGAAGAGGAT GGTGCAGCAA CCAACAAGAA AACACTGGCA GAATACGCCC GTGCCTTATC CGGAGAGGAT GAATGA
|
Protein sequence | MNISNSQIDI LRRDVRAGLR ALFRPEPQTA VEWADASYYL PKESAYQEGR WETLPFQRAI MNAMGSDYIR EVNVVKSARV GYSKMLLGVY AYFIEHKQRN PLIWLPTDGD AENFMKTHVE PTIRDIPSLL SLAPWYGKKH RDNTLTMKRF TNGRGFWCLG GKAAKNYREK SVDVAGYDEL AAFDEDIEQE GSPTFLGDKR IEGSVWPKSI RGSTPKVRGT CQIERAASES PHFMRFHVAC PHCGEEQYLK FGDKETPFGL KWTPDDPSSV FYLCEHNACV IRQQELDFTD ARYICEKTGI WTRDGILWFS SSGEEIEPPD SVTFHIWTAY SPFTTWVQIV KDWMKTKGDT GKRKTFVNTT LGETWEAKIG ERPDAEVMAE RKEYYSAPVP DRVAYLTAGI DSQLDRYEMR VWGWGPGEES WLIDRQIIMG RHDDEQTLLR VDEAINKTYT RRNGAEMSVS RICWDTGGID PTIVYERSKK HGLFRVIPIK GASVYGKPVA SMPRKRNKNG VYLTEIGTDT AKEQIYNRFT LTPEGDEPLP GAVHFPNNPD IFDLTEAQQL TAEEQVEKWV DGRKKILWDS KKRRNEALDC FVYALAALRI SISRWQLDLS ALLASLQEED GAATNKKTLA EYARALSGED E
|
| |