Gene ECH74115_5122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5122 
Symbol 
ID6971662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4762150 
End bp4763400 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content54% 
IMG OID643388794 
Producthypothetical protein 
Protein accessionYP_002273220 
Protein GI209398871 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.116872 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCGG GAACCGTTTT GTATCAGGAC CGCGCCATGA AACAGATAAC CTTTGCTCCC 
CGTAATCACC TGCTCACCAA TACCAATACC TGGACGCCCG ACAGCCAGTG GCTGGTATTT
GACGTGCGTC CTTCTGGCGC GTCGTTTACC GGCGAGACCA TTGAGCGTGT GAATATCCAT
ACCGGCGAGG TCGAGGTTAT CTATCGCGTG TCACAGGGCG CGCACGTCGG CGTGGTGACC
GTTCATCCAA AGTCAGAGAA ATATGTCTTT ATCCACGGAC CTGAAAATCC TCATGAAACA
TGGCATTACG ATTTCCATCA CCGTCGCGGC GTGATTGTTG AAGGCGGCAA GATGAACAAT
CTCGATGCAA TGGATATTAC CGTTCCGTAC ACATCAGGGG CGCTGCGTGG CGGCAGCCAT
GTGCATGTCT TTAGCCCGAA CGGTGAAATG GTGAGTTTTA CCTATAACGA CCATGTAATG
CATCAGTTCG ATTCGGCGCT GGATTTGCGA AACGTCGGGG TTGCTGCACC GTTTGGCCCG
GTCAACGTAC AAAAGCAGCA TCCGCGTGAA TACAGCGGTA GCCACTGGTG CGTGCTGGTG
AGTAAAACCA CGCCCACGCC GCAGCCTGGC AGCGATGAAA TCAATCGTGC TTATGAAGAA
GGATGGGTAG GAAATCACGC GCTGGCGTTT ATTGGCGACA CACTTTCGCC AAAGGGCGAG
AAAGTGCCGG AGCTGTTTAT CGTTGAGTTA CCGCAAGATG AAGCTGGCTG GAAAGCAGCA
GGTGATGCGC CGTTAAGTGG AACGGAAACA ACCCTGCCCG CGCCACCGCG TGGCATCGTG
CAGCGACGTT TAACCTTTAC CCACCATCGG GCTTATCCGG GGTTAGTCAA CGTCCCGCGC
CACTGGGTGC GCTGTAATCC GCAGGGTACG CAAATCGCGT TTTTAATGCG AGATGATAAC
GGCATTGTGC AACTGTGGCT TATCTCGCCA CAGGGCGGCG AGCCGCGCCA GTTAACCCAT
AACAAAACGG ATATTCAGTC TGCATTTAAC TGGCATCCGT CAGGAGAATG GTTGGGCTTT
GTGCTGGGTA ATCGAATTGC TTGTGCCCAT GCGCAAAGCG GCGAGGTTGA GTATTTAACC
GAAAACCACG CCAATCCACC TTCTGCGGAC GCCGTGGTCT TCTCGCCGGA TGGTCAGTGG
CTGGCGTGGA TGGAAGGCGG CCAGCTGTGG ATCACCGAAA CTGATCGCTA A
 
Protein sequence
MMAGTVLYQD RAMKQITFAP RNHLLTNTNT WTPDSQWLVF DVRPSGASFT GETIERVNIH 
TGEVEVIYRV SQGAHVGVVT VHPKSEKYVF IHGPENPHET WHYDFHHRRG VIVEGGKMNN
LDAMDITVPY TSGALRGGSH VHVFSPNGEM VSFTYNDHVM HQFDSALDLR NVGVAAPFGP
VNVQKQHPRE YSGSHWCVLV SKTTPTPQPG SDEINRAYEE GWVGNHALAF IGDTLSPKGE
KVPELFIVEL PQDEAGWKAA GDAPLSGTET TLPAPPRGIV QRRLTFTHHR AYPGLVNVPR
HWVRCNPQGT QIAFLMRDDN GIVQLWLISP QGGEPRQLTH NKTDIQSAFN WHPSGEWLGF
VLGNRIACAH AQSGEVEYLT ENHANPPSAD AVVFSPDGQW LAWMEGGQLW ITETDR