Gene ECH74115_3058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3058 
Symbol 
ID6971479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2833462 
End bp2835744 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content52% 
IMG OID643386890 
Productbacteriophage replication gene A protein 
Protein accessionYP_002271358 
Protein GI209398781 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00583111 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000000000126727 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCCGTTA AAGCCTCCGG GCGTTTTGTC CCTCCGTCAG CATTTGCTGC AGGCACCGGT 
AAGGCGTTTA CCGGTGCTTA TGCATGGAAC GCGCCACGCG AGGCCGTCGG GCGCGAAAGA
CCCCTTACAC GTGACGAGAT GCGTCAGGTG CAAGGTGTTT TATCCACGAT TAACCGCCTG
CCTTACTTTT TGCGCTCGCT GTTTACTTCA CGCTATGACC ACATCCGGCG CAATAAAAGC
CCGGTGCACG GGTTTTATTT CCTCACATCC ACTTTTCAGC GTCGTTTATG GCCGCGCATT
GAGCGTGTGA ATCAGCGCCA TGAAATGAAC ACCGACGCGT CGTTACTGTT TCTGGCAGAG
CGTGACCACT ATGCGCGCCT GCCGGGAATG AATGACAAGG AGCTGAAAAA GTTTGCTGCC
CGTATCTCAT CGCAGCTTTT CATGATGTAT GGGGAACTCA GTGATGCCTG GGTGGATGCG
CATGGCGAAA AAGAATCGCT GTTTACGGAT GAGGCGCAGG CTCACCTCTA TGGTCATGTT
GCTGGCGCTG CACGTGCTTT CAATATTTCC CCTCTCTACT GGAAAATATA CCGTAAAGGG
CAGATGACCA CGAGGCAGGC ATATTCTGCC ATTGCCCGTC TGTTTAACGA TGAGTGGTGG
ACTCATCAGC TTAAAGGCCA GCGTATGCGC TGGCATGAAG CGTTACTGAT AGCTGTCGGG
GAGGTCAATA AAGACCGTTC TCCTTATGCC AGTAAACACG CCATTCGTGA TGTGCGTGCG
CGCCGCCAGG CAAATCTGGA ATTTCTTAAA TCGTGTGACC TTGAAAACAG GGAAACCGGC
GAGCGCATCG ACCTTATCAG TAAGGTGATG GGCAGTATTT CTAATCCTGA AATTCGCCGG
ATGGAGCTGA TGAACACCAT TGCCGGTATT GAGCGTTACG CCGCCGCAGA GGGTGATGTG
GGGATGTTTA TCACGCTGAC CGCGCCGTCA AAGTATCACC CGACACGTCA GGTCAGAAAA
GGCGAAAGTA AAACCGTTCA GCTTAATCAC GGCTGGAACG ATGAGGCATT TAATCCAAAG
GATGCGCAGC GTTATCTCTG CCGCATCTGG AGCCTGATGC GCACGGCATT CAAGGATAAT
GATTTACAGG TCTACGGTTT GCGTGTCGTC GAGCCACACC ACGACGGAAC GCCGCACTGG
CATATGATGC TTTTTTGTAA TCCACGCCAG CGTAACCAGA TTATCGAAAT CATGCGTCGC
TACGCGCTCA AAGAGGATGG AGACGAAAGA GGAGCTGCGC GAAACCGTTT TCAGGCAAAA
CACCTTAACC GGGGCGGTGC TGCGGGATAT ATCGCGAAAT ACATTTCAAA AAACATCGAC
GGCTATGCAC TGGATGGTCA GCTCGATAAC GATACCGGTA AGCCGCTTAA AGATACTGCC
GCGGCTGTTA CCGCATGGGC GTCAACGTGG CGCATTCCGC AATTTAAAAC GGTTGGACTG
CCGACAATGG GGGCTTACCG TGAACTACGC AAATTGCCTC GCGGCGTCAG TATTGCTGAT
GAGTTTGACG AACGCGTCGA GGCTGCTCGC GCTGCCGCAG ACAGTGGTGA TTTTGCGTTG
TATATCAGCG CGCAGGGTGG GGCAAATGTC CCGCGCGATT GTCAGACTGT CAGGGTCGCC
CGTAGCCCGT CGGATGACGT TAACGAGTAC GAGGAAGAAG TCGAGAGAGT GGTCGGCATT
TACGCGCCGC ATCTCGGCGC GCGTCATATT CATATCACCA GAACGACGGA CTGGCGCATT
GTGCCGAAAG TTCCGGTCGT TGAGCCTTTG ACTTTAAAAA GCGGCATCGC CGCGCCTCGG
AGTCCTGTCA ATAACTGTGG AAAACTCACC GGTGGTGATA CTTCGTTACC GGCTCCCACA
CCGACTGAGC ATGCTGCAGC GGTGTTAAAT TTAATAGATG AAGGGATTCT AAGTTGGACC
GAGCCAGGCA TTATGAAGGT ACTTAGAGAC TTATTGAGTA ATGAACTGAA ATGCAGTAAT
CGTTCACAGC AGAGCTTTAC CCCTTTCAAT GGTCGAAGCT ACTTTCCTGC CCCATCCGCC
CGGTTGACAA GGCAGGAGAG AAAGACTATC CCGAAAATTA AGGTTCTTCT TGCACAAAGT
GACATTCAGG CTAGTTATTG GGAGCTTGAA GCTCTCGCTC GCGGAGCTGT TTTGGATTTT
GGGCATAAGC GTTTTAAATT TGATAACGAT ATCGACTTTT TTGACAGACA GCGTGAGTGG
TAG
 
Protein sequence
MAVKASGRFV PPSAFAAGTG KAFTGAYAWN APREAVGRER PLTRDEMRQV QGVLSTINRL 
PYFLRSLFTS RYDHIRRNKS PVHGFYFLTS TFQRRLWPRI ERVNQRHEMN TDASLLFLAE
RDHYARLPGM NDKELKKFAA RISSQLFMMY GELSDAWVDA HGEKESLFTD EAQAHLYGHV
AGAARAFNIS PLYWKIYRKG QMTTRQAYSA IARLFNDEWW THQLKGQRMR WHEALLIAVG
EVNKDRSPYA SKHAIRDVRA RRQANLEFLK SCDLENRETG ERIDLISKVM GSISNPEIRR
MELMNTIAGI ERYAAAEGDV GMFITLTAPS KYHPTRQVRK GESKTVQLNH GWNDEAFNPK
DAQRYLCRIW SLMRTAFKDN DLQVYGLRVV EPHHDGTPHW HMMLFCNPRQ RNQIIEIMRR
YALKEDGDER GAARNRFQAK HLNRGGAAGY IAKYISKNID GYALDGQLDN DTGKPLKDTA
AAVTAWASTW RIPQFKTVGL PTMGAYRELR KLPRGVSIAD EFDERVEAAR AAADSGDFAL
YISAQGGANV PRDCQTVRVA RSPSDDVNEY EEEVERVVGI YAPHLGARHI HITRTTDWRI
VPKVPVVEPL TLKSGIAAPR SPVNNCGKLT GGDTSLPAPT PTEHAAAVLN LIDEGILSWT
EPGIMKVLRD LLSNELKCSN RSQQSFTPFN GRSYFPAPSA RLTRQERKTI PKIKVLLAQS
DIQASYWELE ALARGAVLDF GHKRFKFDND IDFFDRQREW