Gene ECH74115_5801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5801 
Symbol 
ID6967343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5433847 
End bp5434950 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content38% 
IMG OID643389429 
Producthypothetical protein 
Protein accessionYP_002273821 
Protein GI209395990 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0386752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAT CTATCGACAT TTCAGAACTT ATTCAATTAG GGAAGAAAAT GTTACCAGAA 
GGAGTCGATT TTTTTCTGGA TGAATCCCCT ATTGACTTTG ATCCTATAGA TATTGAGTTA
TCCACGGGTA AAGAAGTTAG TATCGAAGAT CTTGACCCTG GTAGCGGGCT TATCTCTTAT
CATGGCCGCC AGGTTCTTTT ATATATTCGG GACCATTCAG GGCGTTATGA TGCGGCTATC
GTAGATGGCG AAAAAGGAAA ACGTTTTCAT ATTGCCTGGT GCAGAACTCT TGATGAAATG
CGCCATAAAA ATCGATTTGA AAGGTATCAT GCAACTAACC GCATAGATGG TTTATTCGAA
ATTGATGATG GTTCAGGTCG GAGCCAGGAT GTTGATTTAC GGGTATGTAT GAATTGCCTC
GAACGACTTA ATTATAAAGG AAGTATTGAT AAACAACGAA AAAGAGAGAT TTTTAAATCA
TTCTCATTAA ATGAGTTTTT TTCAGATTAT AGTACTTGTT TTCGTCATAT GCCTAAGGGT
ATCTATGACA AAACAAATAG TGGGTATGTC GAAAACTGGA AGGAAATATC TAAAGAAATA
CGAGAAAAGG CAAATTATGT TTGTAATGAT TGTGGCGTGA ATTTATCAAC CGCCAAAAAC
TTGTGCCATG TCCATCATAA AAATGGCATC AAATATGATA ATCACCATGA AAACCTTCTT
GTTCTGTGCA AGGATTGCCA TCGAAAACAG CCCCTCCACG AAGGTATATT CGTTACCCAA
GCAGAGATGG CTATCATTCA ACGTTTACGT TCCCAACAAG GGTTATTAAA AGCAGAATCC
TGGAATGAAA TATATGACCT GACTGATCCA TCAGTGCATG GTGATATTAA TATGATGCAA
CATAAAGGCT TTCAACCTCC TGTTCCTGGG TTAGATCTTC CAAACTCAGA ACATGAAATT
ATTGCAACCG TAGAAGCTGC ATGGCCAGGC CTTAAAATTG CAGTTAACCT TACTCCCGCC
GAAGTCGAAG GATGGAGAAT ATATACCGTG GGTGAGCTGG TTAAAGAAAT ACAAACAGGA
GCCTTTACGC CAGCAAAATT GTAA
 
Protein sequence
MKLSIDISEL IQLGKKMLPE GVDFFLDESP IDFDPIDIEL STGKEVSIED LDPGSGLISY 
HGRQVLLYIR DHSGRYDAAI VDGEKGKRFH IAWCRTLDEM RHKNRFERYH ATNRIDGLFE
IDDGSGRSQD VDLRVCMNCL ERLNYKGSID KQRKREIFKS FSLNEFFSDY STCFRHMPKG
IYDKTNSGYV ENWKEISKEI REKANYVCND CGVNLSTAKN LCHVHHKNGI KYDNHHENLL
VLCKDCHRKQ PLHEGIFVTQ AEMAIIQRLR SQQGLLKAES WNEIYDLTDP SVHGDINMMQ
HKGFQPPVPG LDLPNSEHEI IATVEAAWPG LKIAVNLTPA EVEGWRIYTV GELVKEIQTG
AFTPAKL