Gene ECH74115_5519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5519 
Symbol 
ID6967208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5168239 
End bp5169819 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content32% 
IMG OID643389162 
Producthypothetical protein 
Protein accessionYP_002273559 
Protein GI209398240 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0795524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGGTC ATATCTCAAA GTTTGACGGC AATAACTCTT TGATAAAACA TGGTGTGGTG 
CAAGGAAATA ATATAGTAGA TTTTGATTTA CTACGTAATT TTAATGGGGG GCCAGGGTTA
AATCGAGAAA ACTTTATTTA TATCAGCAAT ATTTTTTTAA ATATAAAACA ACGGAACGAA
AAAAATCATT CAATAAATAT GTTTCGTGAA GTCTCAATCA GTGGTGATAT TGTAAGCGTA
AAATTTTATA GAAATGAAAA AATAGAATGC GCTTGTGATT TTATGATGGC TAAAGATGCG
CAGGGGTATA TCGACCTGTC TGAATTGGAT TTAACAAGTT GTCATTTTAA AGGTGACGTT
ATTTCGAAGG TGTCTTTCAT ATCATCAAAT CTACAACATG TAACATTCGA ATGTAAAGAA
ATTGGGGATT GCAATTTTAC TACTGCAATA GTTGATAATG TCATATTTAA ATGTCGACGT
TTACACAATG TAATTTTTAT CAAAGCGAGT GGTGATTATG TCGATTTTAG CAAAAATATT
CTTGATACAG TTGACTTCTC GCAGAGTCAA CTTACTCATA GTAATTTTTG TGAATGTCAG
ATTAGAAATT CAAACTTCGA TCATTGTTAT CTTTATGCTT CGCACTTCAC CAGAGCAGAA
TTTCTTACTG ACAAAGAAAT ATCATTTATT AAATCGAATT TAACAGCTGT TATGTTTGAT
CATGTGCGAA TATCGACAGG GAATTTTAAA GACAGCGTTA CACAACTAAT GGTATTATCT
ATTGATTACT CAGATATATT TGGAAATGAA TATCTCGATG GTTATATCAA TAACATTATA
AAAATGATTG ATTCGTTGCC AGATGATCCA GCGATATTGA AATCCGTTCT GGCAGTAAAA
CTGGTGATGC AATTAAAAAT TCTTAATATT GTTAATAAAA ACTTTATTGA GAATATGAAG
AAAATATTTA GCCATGGTCC TTATATAAAA GATCCCATTA TACGTAGTTA TATCCATCCT
GATGAAGATA ACAAGTTCGA TAATTTTATG CGTCAAAATC GATTCAGTAA GGTGAATTTC
GATACCCAAC AGATGATCGA TTTTATTAAC AGATTTAATA TGAATAAATG GCTGATTGAT
CGAAATAACA ATTTTTTTAT CCAACTTATC GATCAGGCTC TACGATCAAC GAATGATACG
ATCAAAGAAA ATGCCTGGCA TCTTTATAAA GAGTGGATTC GTAGTGATGA TGTTTCACCT
TTATTTATAG AAATTGAAGA TAATTTAAGA ACCTTTAACA CGAATGAATT AACACGAAAC
GATAATATCT TTATCTTGTT CTCCTCTGTC GATGATGGGC CAGTTATGGT GGTAAGCTCC
CAGCGCTTAC ATGATATGTT GAATCCTACA AAAGATACCA ATTGGAATTC CACGTATATC
TATAAATCCA GACATGAGAT GTTGCCTGTT AATCTTACTC CGGAAACACT TTTCGGCTCC
AAATCTTATG ATAAACATGC GCTTTTCCCC ATTTTTACTG CGAGTTGGCG AGCTAATCGT
ATAAAGAATA AAGGTATTTA A
 
Protein sequence
MLGHISKFDG NNSLIKHGVV QGNNIVDFDL LRNFNGGPGL NRENFIYISN IFLNIKQRNE 
KNHSINMFRE VSISGDIVSV KFYRNEKIEC ACDFMMAKDA QGYIDLSELD LTSCHFKGDV
ISKVSFISSN LQHVTFECKE IGDCNFTTAI VDNVIFKCRR LHNVIFIKAS GDYVDFSKNI
LDTVDFSQSQ LTHSNFCECQ IRNSNFDHCY LYASHFTRAE FLTDKEISFI KSNLTAVMFD
HVRISTGNFK DSVTQLMVLS IDYSDIFGNE YLDGYINNII KMIDSLPDDP AILKSVLAVK
LVMQLKILNI VNKNFIENMK KIFSHGPYIK DPIIRSYIHP DEDNKFDNFM RQNRFSKVNF
DTQQMIDFIN RFNMNKWLID RNNNFFIQLI DQALRSTNDT IKENAWHLYK EWIRSDDVSP
LFIEIEDNLR TFNTNELTRN DNIFILFSSV DDGPVMVVSS QRLHDMLNPT KDTNWNSTYI
YKSRHEMLPV NLTPETLFGS KSYDKHALFP IFTASWRANR IKNKGI