Gene ECH74115_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1158 
Symbol 
ID6972156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1178633 
End bp1179682 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content53% 
IMG OID643385158 
Producthypothetical protein 
Protein accessionYP_002269657 
Protein GI209396179 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.53743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGTAT TACTTCGACC TGTTCTGGTA CCGGAACTCG GTCTGGTTAT CGTTAAGCCA 
GGCCGTGAAT CAATGTCAGC ATTCCATAAC GGCAGAATAC TGGTGGAGCC GGAACCAAAA
AGCATGCGAG CTCTGCCGTC CGGGGTTGTA CCTGCCGTTC ACCAGCCGCT GGCGGAAGAT
AAATCACTAC TGCCATTTTT CAGCGATGAG CGGGTGATCC GTGCTGCGGG TGGCGCTGGT
GCACTGTCTG ACTGGTTATT ACGTCACGTG AAATCCTGCC AGTGGCCTCA TGGTGATTAT
CATCACAGCG AAACTGTCAT ACATCGTTAC GGTACCGGCG CGATGGTGTT GTGCTGGCAC
TGCGACAACC AGCTGCGCGA CCAGACATCC GAATCACTTG AGCAACTTGC TCAACAGAAT
CTGGCCGCCT GGATGATTGA CGTCATCCGC CACGCAATGA ATGGCATACA GGAACGGGAA
TTATCGCTGG CTGAATTATC CTGGTGGGCA GTCTGCAATC AGGTGGTGGA CGCATTACCT
GAGGCAGTAT CGCGTCGTTC TCTGGGATTA CCGGCGGAAA AAATCCGCTC CGTATACCGT
GAAAGCGACA TCATACCGGG AGAACAGACC GCCACCAGCA TACTGAAGCA GCGCACAAAA
AATATTGCGC TACCGCCTCA CACCCACCAG CAACAGAACC CACCACAGGA AAAGACGGTG
GTCAGCATTG CCGTTGATCC GGAGTCTCCG AAATCCTTCA TGAAACGACC TAAACGTCGC
CGCTGGGTAA ATGAGAAATA CACACGCTGG GTAAAGACAC AGCCGTGTGC GTGTTGTGGT
AAGCCAGCGG ACGATCCTCA TCATCTGATT GGTCATGGTC AGGGTGGAAT GGGAACAAAA
TCCCACGATA TTTTCACGCT ACCGCTGTGT CGGGAGCATC ACAACGAGCT TCATGCGGAT
CCGCTGGCGT TCGAAGAAAA GCATGGTTCC CAGGTTGATT TAATTTTTCG TTTTCTTGAT
CACGCTTTTG CAACCGGCGT GCTCGGGTAA
 
Protein sequence
MRVLLRPVLV PELGLVIVKP GRESMSAFHN GRILVEPEPK SMRALPSGVV PAVHQPLAED 
KSLLPFFSDE RVIRAAGGAG ALSDWLLRHV KSCQWPHGDY HHSETVIHRY GTGAMVLCWH
CDNQLRDQTS ESLEQLAQQN LAAWMIDVIR HAMNGIQERE LSLAELSWWA VCNQVVDALP
EAVSRRSLGL PAEKIRSVYR ESDIIPGEQT ATSILKQRTK NIALPPHTHQ QQNPPQEKTV
VSIAVDPESP KSFMKRPKRR RWVNEKYTRW VKTQPCACCG KPADDPHHLI GHGQGGMGTK
SHDIFTLPLC REHHNELHAD PLAFEEKHGS QVDLIFRFLD HAFATGVLG