Gene ECH74115_0079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0079 
SymbolleuC 
ID6968726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp84052 
End bp85452 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content56% 
IMG OID643384157 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_002268680 
Protein GI209397303 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGA CGTTATACGA AAAATTATTC GACGCTCACG TAGTGTACGA AGCCGAAAAT 
GAAACCCCGC TGTTATATAT CGACCGCCAC CTGGTGCATG AAGTGACCTC ACCGCAGGCG
TTCGATGGTC TGCGCGCCCA CGGTCGCCCG GTACGTCAGC CGGGCAAAAC CTTCGCCACC
ATGGATCACA ACGTCTCTAC CCAGACTAAA GACATTAATG CCTGCGGTGA AATGGCGCGC
ATCCAGATGC AGGAGCTGAT CAAAAACTGC AAAGAATTTG GCGTCGAACT GTATGACCTG
AATCACCCCT ATCAAGGGAT CGTCCACGTA ATGGGGCCGG AACAGGGCGT GACCTTGCCG
GGGATGACCA TTGTCTGCGG CGACTCGCAT ACCGCCACCC ACGGCGCGTT TGGCGCACTG
GCCTTTGGTA TCGGCACTTC CGAAGTTGAA CACGTACTGG CAACGCAAAC CCTGAAACAG
GGCCGTGCGA AGACCATGAA AATTGAAGTC CAGGGCAAAG CCGCGCCGGG CATTACCGCA
AAAGATATCG TGTTGGCAAT TATCGGTAAA ACCGGTAGCG CAGGCGGCAC CGGGCATGTG
GTGGAGTTTT GCGGCGAAGC AATCCGTGAT TTAAGCATGG AAGGTCGGAT GACCCTGTGC
AATATGGCAA TCGAAATGGG CGCGAAAGCC GGTCTGGTTG CACCTGACGA AACCACCTTT
AACTATGTCA AAGGCCGTCT GCATGCGCCG AAAGGCAAAG ATTTCGACGA CGCCGTTGCC
TACTGGAAAA CCCTGCAAAC CGACGAAGGC GCAACTTTCG ATACCGTTGT CACTTTGCAG
GCAGAAGAGA TTTCACCGCA GGTCACCTGG GGAACTAACC CAGGCCAGGT GATTTCCGTG
AACGACAATA TTCCCGATCC GGCTTCGTTT GCCGATCCGG TTGAACGCGC GTCGGCAGAA
AAAGCGTTGG CCTATATGGG GCTGAAACCG GGTATTCTGC TGACCGAAGT GGCTATCGAC
AAAGTGTTTA TCGGTTCCTG CACCAACTCA CGTATTGAAG ATTTACGCGC GGCGGCGGAA
ATCGCCAAAG GGCGGAAAGT CGCGCCAGGC GTACAGGCGC TGGTGGTTCC CGGCTCTGGT
CCGGTAAAAG CGCAGGCGGA AGCAGAAGGT CTGGATAAAA TCTTTATTGA AGCCGGTTTT
GAATGGCGCT TGCCTGGCTG CTCAATGTGT CTGGCGATGA ACAACGACCG TCTGAATCCG
GGCGAACGTT GTGCCTCCAC CAGCAACCGT AACTTTGAAG GCCGTCAGGG GCGCGGCGGG
CGCACGCATC TGGTCAGCCC GGCAATGGCT GCCGCCGCGG CTGTGACCGG CCATTTCGCC
GACATTCGCA ACATTAAATA A
 
Protein sequence
MAKTLYEKLF DAHVVYEAEN ETPLLYIDRH LVHEVTSPQA FDGLRAHGRP VRQPGKTFAT 
MDHNVSTQTK DINACGEMAR IQMQELIKNC KEFGVELYDL NHPYQGIVHV MGPEQGVTLP
GMTIVCGDSH TATHGAFGAL AFGIGTSEVE HVLATQTLKQ GRAKTMKIEV QGKAAPGITA
KDIVLAIIGK TGSAGGTGHV VEFCGEAIRD LSMEGRMTLC NMAIEMGAKA GLVAPDETTF
NYVKGRLHAP KGKDFDDAVA YWKTLQTDEG ATFDTVVTLQ AEEISPQVTW GTNPGQVISV
NDNIPDPASF ADPVERASAE KALAYMGLKP GILLTEVAID KVFIGSCTNS RIEDLRAAAE
IAKGRKVAPG VQALVVPGSG PVKAQAEAEG LDKIFIEAGF EWRLPGCSMC LAMNNDRLNP
GERCASTSNR NFEGRQGRGG RTHLVSPAMA AAAAVTGHFA DIRNIK