Gene ECH74115_2324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2324 
Symbol 
ID6971978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2195948 
End bp2197456 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content49% 
IMG OID643386202 
Producthypothetical protein 
Protein accessionYP_002270686 
Protein GI209400318 
COG category[S] Function unknown 
COG ID[COG5339] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAT CGCTGGTAGC GGTAGGCGTC ATTGTTGCGC TAGGCGTAGT CTGGACAGGC 
GGCGCATGGT ATACAGGCAA GAAGATTGAA ACCCATCTCG AAGACATGGT CGCGCAGGCG
AACGCGCAAC TCAAACTGAC CGCTCCTGAA TCCAACCTGG AAGTGAGTTA TCAAAACTAT
CATCGCGGCG TATTCAGCAG TCAGCTGCAA CTGTTGGTGA AACCCATTGC CGGAAAAGAA
AATCCGTGGA TTAAAAGCGG TCAGAGCGTC ATCTTCAACG AATCGGTTGA TCATGGTCCC
TTCCCCCTTG CCCAGCTTAA AAAACTGAAC CTGATCCCGT CGATGGCATC AATTCAAACC
ACGCTGGTTA ATAACGAAGT AAGCAAACCA CTGTTTGATA TGGCAAAAGG TGAAACGCCT
TTTGAGATTA ACTCGCGCAT TGGTTACAGC GGTGATTCCA GTTCCGATAT TTCGCTCAAG
CCACTAAATT ACGAGCAAAA GGATGAAAAA GTCGCCTTTA GCGGCGGCGA GTTCCAGTTA
AATGCGGACA GAGACGGCAA AGCTATCTCC CTTTCCGGGG AGGCGCAAAG TGGTCGGATA
GACGCGGTTA ACGAATACAA CCAGAAAGTA CAGTTGACCT TTAATAATCT GAAAACCGAC
GGTTCCAGCA CGCTGGCAAG TTTTGGTGAG CGCGTAGGAA ACCAAAAACT GTCACTGGTA
AAAATGACCA TTTCAGTGGA AGGCAAAGAA CTGGCACTGC TGGAAGGCAT GGAGATCAGC
GGTAAATCGG ATCTGGTCAA TGACGGTAAA ACGATCAATA GCCAACTGGA TTACTCGCTA
AACAGCCTGA AGGTACAGAA TCAGGATCTG GGCAGCGGCA AGCTGACTTT AAAAGTCGGC
CAAATTGATG GCGAAGCCTG GCATCAGTTT AGCCAGCAAT ATAACGCGCA AACTCAGGCG
CTGCTGGCAC AGCCAGAAAT TGCCAACAAT CCCGAACTTT ATCAGGAGAA AGTGACGGAA
GCCTTCTTTA GCGCCCTGCC GCTGATGTTG AAAGGCGATC CGGTGATTAC TATCGCGCCG
CTAAGCTGGA AAAACAGTCA GGGTGAAAGT GCGCTGAATC TGTCGCTGTT CCTGAAAGAT
CCGGCAACGA CTAAAGAAGC GCCGCAAACG CTGGCGCAGG AAGTAGATCG TTCGGTTAAA
TCTCTGGATG CGAAACTGAC CATTCCGGTG GATATGGCAA CTGAGTTGAT GACTCAGGTA
GCGAAGCTGG AAGGTTATCA GGAAGATCAA GCGAAAAAAC TGGCGAAACA GCAAGTTGAA
GGTGCATCAG CAATGGGGCA GATGTTCCGT CTGACCACCT TGCAGGACAA TACCATCACC
ACCAGCCTGC AATATACTAA CGGTCAGATA ACGTTAAACG GGCAGAAAAT GCCACTGGAA
GATTTCGTTG GTATGTTTGC AATGCCGGCA TTAAATGTTC CGGTCGTACC CGCTATTCCG
CAGCAGTAA
 
Protein sequence
MNKSLVAVGV IVALGVVWTG GAWYTGKKIE THLEDMVAQA NAQLKLTAPE SNLEVSYQNY 
HRGVFSSQLQ LLVKPIAGKE NPWIKSGQSV IFNESVDHGP FPLAQLKKLN LIPSMASIQT
TLVNNEVSKP LFDMAKGETP FEINSRIGYS GDSSSDISLK PLNYEQKDEK VAFSGGEFQL
NADRDGKAIS LSGEAQSGRI DAVNEYNQKV QLTFNNLKTD GSSTLASFGE RVGNQKLSLV
KMTISVEGKE LALLEGMEIS GKSDLVNDGK TINSQLDYSL NSLKVQNQDL GSGKLTLKVG
QIDGEAWHQF SQQYNAQTQA LLAQPEIANN PELYQEKVTE AFFSALPLML KGDPVITIAP
LSWKNSQGES ALNLSLFLKD PATTKEAPQT LAQEVDRSVK SLDAKLTIPV DMATELMTQV
AKLEGYQEDQ AKKLAKQQVE GASAMGQMFR LTTLQDNTIT TSLQYTNGQI TLNGQKMPLE
DFVGMFAMPA LNVPVVPAIP QQ