Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3058 |
Symbol | |
ID | 6971479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2833462 |
End bp | 2835744 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643386890 |
Product | bacteriophage replication gene A protein |
Protein accession | YP_002271358 |
Protein GI | 209398781 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00583111 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.000000000126727 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCCGTTA AAGCCTCCGG GCGTTTTGTC CCTCCGTCAG CATTTGCTGC AGGCACCGGT AAGGCGTTTA CCGGTGCTTA TGCATGGAAC GCGCCACGCG AGGCCGTCGG GCGCGAAAGA CCCCTTACAC GTGACGAGAT GCGTCAGGTG CAAGGTGTTT TATCCACGAT TAACCGCCTG CCTTACTTTT TGCGCTCGCT GTTTACTTCA CGCTATGACC ACATCCGGCG CAATAAAAGC CCGGTGCACG GGTTTTATTT CCTCACATCC ACTTTTCAGC GTCGTTTATG GCCGCGCATT GAGCGTGTGA ATCAGCGCCA TGAAATGAAC ACCGACGCGT CGTTACTGTT TCTGGCAGAG CGTGACCACT ATGCGCGCCT GCCGGGAATG AATGACAAGG AGCTGAAAAA GTTTGCTGCC CGTATCTCAT CGCAGCTTTT CATGATGTAT GGGGAACTCA GTGATGCCTG GGTGGATGCG CATGGCGAAA AAGAATCGCT GTTTACGGAT GAGGCGCAGG CTCACCTCTA TGGTCATGTT GCTGGCGCTG CACGTGCTTT CAATATTTCC CCTCTCTACT GGAAAATATA CCGTAAAGGG CAGATGACCA CGAGGCAGGC ATATTCTGCC ATTGCCCGTC TGTTTAACGA TGAGTGGTGG ACTCATCAGC TTAAAGGCCA GCGTATGCGC TGGCATGAAG CGTTACTGAT AGCTGTCGGG GAGGTCAATA AAGACCGTTC TCCTTATGCC AGTAAACACG CCATTCGTGA TGTGCGTGCG CGCCGCCAGG CAAATCTGGA ATTTCTTAAA TCGTGTGACC TTGAAAACAG GGAAACCGGC GAGCGCATCG ACCTTATCAG TAAGGTGATG GGCAGTATTT CTAATCCTGA AATTCGCCGG ATGGAGCTGA TGAACACCAT TGCCGGTATT GAGCGTTACG CCGCCGCAGA GGGTGATGTG GGGATGTTTA TCACGCTGAC CGCGCCGTCA AAGTATCACC CGACACGTCA GGTCAGAAAA GGCGAAAGTA AAACCGTTCA GCTTAATCAC GGCTGGAACG ATGAGGCATT TAATCCAAAG GATGCGCAGC GTTATCTCTG CCGCATCTGG AGCCTGATGC GCACGGCATT CAAGGATAAT GATTTACAGG TCTACGGTTT GCGTGTCGTC GAGCCACACC ACGACGGAAC GCCGCACTGG CATATGATGC TTTTTTGTAA TCCACGCCAG CGTAACCAGA TTATCGAAAT CATGCGTCGC TACGCGCTCA AAGAGGATGG AGACGAAAGA GGAGCTGCGC GAAACCGTTT TCAGGCAAAA CACCTTAACC GGGGCGGTGC TGCGGGATAT ATCGCGAAAT ACATTTCAAA AAACATCGAC GGCTATGCAC TGGATGGTCA GCTCGATAAC GATACCGGTA AGCCGCTTAA AGATACTGCC GCGGCTGTTA CCGCATGGGC GTCAACGTGG CGCATTCCGC AATTTAAAAC GGTTGGACTG CCGACAATGG GGGCTTACCG TGAACTACGC AAATTGCCTC GCGGCGTCAG TATTGCTGAT GAGTTTGACG AACGCGTCGA GGCTGCTCGC GCTGCCGCAG ACAGTGGTGA TTTTGCGTTG TATATCAGCG CGCAGGGTGG GGCAAATGTC CCGCGCGATT GTCAGACTGT CAGGGTCGCC CGTAGCCCGT CGGATGACGT TAACGAGTAC GAGGAAGAAG TCGAGAGAGT GGTCGGCATT TACGCGCCGC ATCTCGGCGC GCGTCATATT CATATCACCA GAACGACGGA CTGGCGCATT GTGCCGAAAG TTCCGGTCGT TGAGCCTTTG ACTTTAAAAA GCGGCATCGC CGCGCCTCGG AGTCCTGTCA ATAACTGTGG AAAACTCACC GGTGGTGATA CTTCGTTACC GGCTCCCACA CCGACTGAGC ATGCTGCAGC GGTGTTAAAT TTAATAGATG AAGGGATTCT AAGTTGGACC GAGCCAGGCA TTATGAAGGT ACTTAGAGAC TTATTGAGTA ATGAACTGAA ATGCAGTAAT CGTTCACAGC AGAGCTTTAC CCCTTTCAAT GGTCGAAGCT ACTTTCCTGC CCCATCCGCC CGGTTGACAA GGCAGGAGAG AAAGACTATC CCGAAAATTA AGGTTCTTCT TGCACAAAGT GACATTCAGG CTAGTTATTG GGAGCTTGAA GCTCTCGCTC GCGGAGCTGT TTTGGATTTT GGGCATAAGC GTTTTAAATT TGATAACGAT ATCGACTTTT TTGACAGACA GCGTGAGTGG TAG
|
Protein sequence | MAVKASGRFV PPSAFAAGTG KAFTGAYAWN APREAVGRER PLTRDEMRQV QGVLSTINRL PYFLRSLFTS RYDHIRRNKS PVHGFYFLTS TFQRRLWPRI ERVNQRHEMN TDASLLFLAE RDHYARLPGM NDKELKKFAA RISSQLFMMY GELSDAWVDA HGEKESLFTD EAQAHLYGHV AGAARAFNIS PLYWKIYRKG QMTTRQAYSA IARLFNDEWW THQLKGQRMR WHEALLIAVG EVNKDRSPYA SKHAIRDVRA RRQANLEFLK SCDLENRETG ERIDLISKVM GSISNPEIRR MELMNTIAGI ERYAAAEGDV GMFITLTAPS KYHPTRQVRK GESKTVQLNH GWNDEAFNPK DAQRYLCRIW SLMRTAFKDN DLQVYGLRVV EPHHDGTPHW HMMLFCNPRQ RNQIIEIMRR YALKEDGDER GAARNRFQAK HLNRGGAAGY IAKYISKNID GYALDGQLDN DTGKPLKDTA AAVTAWASTW RIPQFKTVGL PTMGAYRELR KLPRGVSIAD EFDERVEAAR AAADSGDFAL YISAQGGANV PRDCQTVRVA RSPSDDVNEY EEEVERVVGI YAPHLGARHI HITRTTDWRI VPKVPVVEPL TLKSGIAAPR SPVNNCGKLT GGDTSLPAPT PTEHAAAVLN LIDEGILSWT EPGIMKVLRD LLSNELKCSN RSQQSFTPFN GRSYFPAPSA RLTRQERKTI PKIKVLLAQS DIQASYWELE ALARGAVLDF GHKRFKFDND IDFFDRQREW
|
| |