Gene ECH74115_4059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4059 
Symbol 
ID6970975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3753105 
End bp3754469 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content51% 
IMG OID643387818 
Producthypothetical protein 
Protein accessionYP_002272261 
Protein GI209399193 
COG category[R] General function prediction only 
COG ID[COG1611] Predicted Rossmann fold nucleotide-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0267999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATTACAC ATATTAGCCC GCTTGGCTCC ATGGATATGT TGTCGCAGCT GGAAGTGGAT 
ATGCTTAAAC GCACCGCCAG CAGCGACCTC TATCAACTGT TTCGCAACTG TTCACTTGCC
GTACTGAACT CCGGTAGTTT GACCGATAAC AGCAAAGAAT TGCTGTCTCG TTTTGAAAAT
TTCGATATTA ACGTCTTGCG CCGTGAACGC GGCGTAAAGC TGGAACTGAT TAATCCCCCG
GAAGAGGCTT TTGTCGATGG GCGAATTATT CGCGCTTTGC AGGCCAACTT GTTCGCGGTT
CTGCGAGACA TTCTCTTCGT TTACGGGCAA ATCCATAATA CCGTTCGTTT TCCCAACCTG
AATCTCGACA ACTCCGTCCA CATCACTAAC CTGGTCTTTT CCATCTTGCG TAACGCTCGC
GCGCTGCATG TGGGTGAAGC GCCAAATATG GTGGTCTGCT GGGGCGGTCA CTCAATTAAC
GAAAATGAGT ATTTGTATGC CCGTCGCGTC GGAAACCAGC TGGGCCTGCG TGAGCTGAAT
ATCTGCACCG GCTGTGGTCC GGGAGCGATG GAAGCGCCGA TGAAAGGTGC TGCGGTCGGA
CACGCGCAGC AGCGTTACAA AGACAGTCGT TTTATTGGTA TGACAGAGCC GTCGATTATC
GCCGCTGAAC CGCCTAACCC GCTGGTCAAC GAATTGATCA TCATGCCGGA TATCGAAAAA
CGTCTGGAAG CGTTTGTCCG TATCGCTCAC GGCATCATTA TCTTCCCTGG CGGTGTGGGT
ACGGCAGAAG AGTTGCTGTA TTTGCTGGGA ATTTTAATGA ACCCGGCCAA CAAAGATCAG
GTTTTACCAT TGATCCTCAC CGGCCCGAAA GAGAGCGCCG ACTACTTCCG CGTACTGGAC
GAGTTTGTCG TACATACGCT GGGCGAAAAC GCGCGCCGCC ATTACCGCAT AATCATTGAT
GACGCCGCTG AAGTCGCCCG TCAGATGAAA AAATCGATGC CGCTGGTGAA AGAAAATCGC
CGTGATACAG GCGATGCCTA CAGCTTTAAC TGGTCAATGC GCATTGCGCC AGATTTGCAA
ATGCCATTTG AGCCGTCTCA CGAGAATATG GCTAATCTGA AGCTTTACCC GGATCAACCT
GTTGAAGTGC TGGCTGCCGA TCTGCGCCGT GCGTTCTCCG GTATTGTGGC GGGTAACGTA
AAAGAAGTCG GTATTCGCGC CATTGAAGAG TTTGGTCCTT ACAAAATCAA CGGCGATAAA
GAGATTATGC GTCGTATGGA CGACCTGCTA CAGGGTTTTG TTGCCCAGCA TCGTATGAAG
TTGCCAGGCT CAGCCTACAT CCCTTGCTAC GAAATCTGCA CGTAA
 
Protein sequence
MITHISPLGS MDMLSQLEVD MLKRTASSDL YQLFRNCSLA VLNSGSLTDN SKELLSRFEN 
FDINVLRRER GVKLELINPP EEAFVDGRII RALQANLFAV LRDILFVYGQ IHNTVRFPNL
NLDNSVHITN LVFSILRNAR ALHVGEAPNM VVCWGGHSIN ENEYLYARRV GNQLGLRELN
ICTGCGPGAM EAPMKGAAVG HAQQRYKDSR FIGMTEPSII AAEPPNPLVN ELIIMPDIEK
RLEAFVRIAH GIIIFPGGVG TAEELLYLLG ILMNPANKDQ VLPLILTGPK ESADYFRVLD
EFVVHTLGEN ARRHYRIIID DAAEVARQMK KSMPLVKENR RDTGDAYSFN WSMRIAPDLQ
MPFEPSHENM ANLKLYPDQP VEVLAADLRR AFSGIVAGNV KEVGIRAIEE FGPYKINGDK
EIMRRMDDLL QGFVAQHRMK LPGSAYIPCY EICT