Gene EcE24377A_0075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0075 
SymbolleuC 
ID5587213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp81154 
End bp82554 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content55% 
IMG OID640923806 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001461243 
Protein GI157157583 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAA CGTTATACGA AAAATTATTC GACGCTCACG TAGTGTACGA AGCCGAAAAC 
GAAACCCCGC TGTTATATAT CGACCGCCAC CTGGTGCATG AAGTGACCTC ACCGCAGGCG
TTTGATGGTC TGCGCGCCCA CGGTCGCCCG GTACGTCAGC CGGGCAAAAC CTTCGCCACT
ATGGATCACA ACGTCTCTAC CCAGACCAAA GACATTAATG CCTGTGGTGA AATGGCGCGC
ATCCAGATGC AGGAGCTGAT TAAAAACTGC AAAGAATTTG GCGTCGAGCT GTATGACCTG
AATCACCCGT ATCAGGGGAT CGTCCACGTA ATGGGGCCGG AACAGGGCGT CACCTTGCCG
GGGATGACCA TTGTCTGCGG CGACTCGCAT ACCGCCACCC ACGGCGCGTT TGGCGCACTG
GCCTTTGGTA TCGGCACTTC CGAAGTTGAA CACGTACTGG CAACACAAAC CCTGAAACAG
GGCCGTGCGA AGACCATGAA AATTGAAGTC CAGGGCAAAG CCGCGCCGGG CATTACAGCA
AAAGATATCG TGCTGGCAAT TATCGGTAAA ACTGGCAGCG CAGGCGGCAC CGGGCATGTG
GTGGAGTTTT GCGGCGAAGC AATTCGTGAT TTAAGCATGG AAGGTCGTAT GACCCTGTGC
AATATGGCAA TCGAAATGGG CGCAAAAGCC GGTCTGGTTG CACCGGACGA AACCACCTTT
AACTATGTCA AAGGCCGTTT GCATGCGCCG AAAGGCAAAG ATTTCGACGA CGCCGTTGCC
TACTGGAAAA CCCTGCAAAC CGACGAAGGC GCAACTTTCG ATACCGTTGT CACTCTGCAA
GCAGAAGAAA TTTCACCGCA GGTCACCTGG GGCACCAATC TAGGCCAGGT GATTTCCGTG
AACGACAATA TTCCCGATCC GGCTTCGTTT GCCGATCCGG TTGAACGCGC GTCGGCAGAA
AAAGCGCTGG CCTATATGGG GCTGAAACCG GGCATTCCGC TGACCGAAGT GGCTATCGAC
AAAGTGTTTA TCGGTTCCTG TACCAACTCG CGCATTGAAG ATTTACGCGC GGCAGCGGAG
ATCGCCAAAG GGCGAAAAGT CGCGCCAGGC GTGCAGGCAC TGGTGGTTCC CGGCTCTGGC
CCGGTAAAAG CCCAGGCGGA AGCGGAAGGT CTGGATAAAA TCTTTATTGA AGCCGGTTTT
GAATGGCGCT TGCCTGGCTG CTCAATGTGT CTGGCGATGA ACAACGACCG TCTGAATCCG
GGCGAACGTT GTGCATCCAC CAGCAACCGT AACTTTGAAG GCCGCCAGGG GCGCGGCGGG
CGCACGCATC TGGTCAGCCC GGCAATGGCT GCCGCTGCTG CTGTGACCGG ACATTTCGCC
GACATTCGCA ACATTAAATA A
 
Protein sequence
MAKTLYEKLF DAHVVYEAEN ETPLLYIDRH LVHEVTSPQA FDGLRAHGRP VRQPGKTFAT 
MDHNVSTQTK DINACGEMAR IQMQELIKNC KEFGVELYDL NHPYQGIVHV MGPEQGVTLP
GMTIVCGDSH TATHGAFGAL AFGIGTSEVE HVLATQTLKQ GRAKTMKIEV QGKAAPGITA
KDIVLAIIGK TGSAGGTGHV VEFCGEAIRD LSMEGRMTLC NMAIEMGAKA GLVAPDETTF
NYVKGRLHAP KGKDFDDAVA YWKTLQTDEG ATFDTVVTLQ AEEISPQVTW GTNLGQVISV
NDNIPDPASF ADPVERASAE KALAYMGLKP GIPLTEVAID KVFIGSCTNS RIEDLRAAAE
IAKGRKVAPG VQALVVPGSG PVKAQAEAEG LDKIFIEAGF EWRLPGCSMC LAMNNDRLNP
GERCASTSNR NFEGRQGRGG RTHLVSPAMA AAAAVTGHFA DIRNIK