Gene EcolC_3585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3585 
Symbol 
ID6066420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3918717 
End bp3920117 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content56% 
IMG OID641603002 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001726526 
Protein GI170021572 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000478946 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTAAAA CGTTATACGA AAAATTGTTC GACGCTCACG TTGTGTACGA AGCCGAAAAC 
GAAACCCCAC TGTTATATAT CGACCGCCAC CTGGTGCATG AAGTGACCTC ACCGCAGGCG
TTCGATGGTC TGCGCGCCCA CGGTCGCCCG GTACGTCAGC CGGGCAAAAC CTTCGCTACC
ATGGATCACA ACGTCTCTAC CCAGACCAAA GACATTAATG CCTGCGGTGA AATGGCGCGT
ATCCAGATGC AGGAACTGAT CAAAAACTGC AAAGAATTTG GCGTCGAACT GTATGACCTG
AATCACCCGT ATCAGGGGAT CGTCCACGTA ATGGGGCCGG AACAGGGCGT CACCTTGCCG
GGGATGACCA TTGTCTGCGG CGACTCGCAT ACCGCCACCC ACGGCGCGTT TGGCGCACTG
GCCTTTGGTA TCGGCACTTC CGAAGTTGAA CACGTACTGG CAACGCAAAC CCTGAAACAG
GGCCGCGCAA AAACCATGAA AATTGAAGTC CAGGGCAAAG CCGCGCCGGG CATTACCGCA
AAAGATATCG TGCTGGCAAT TATCGGTAAA ACCGGTAGCG CAGGCGGCAC CGGGCATGTG
GTGGAGTTTT GCGGCGAAGC AATCCGTGAT TTAAGCATGG AAGGTCGTAT GACCCTGTGC
AATATGGCAA TCGAAATGGG CGCAAAAGCC GGTCTGGTTG CACCGGACGA AACCACCTTT
AACTATGTCA AAGGCCGTCT GCATGCGCCG AAAGGCAAAG ATTTCGACGA CGCCATTGCC
TACTGGAAAA CCCTGCAAAC CGACGAAGGC GCAACTTTCG ATACCGTTGT CACTCTGCAA
GCAGAAGAAA TTTCACCGCA GGTCACCTGG GGCACCAATC CCGGCCAGGT GATTTCCGTG
AACGACAATA TTCCCGATCC GGCTTCGTTT GCCGATCCGG TTGAACGTGC GTCGGCAGAA
AAAGCGCTGG CCTATATGGG GCTGAAACCG GGTATTCCGC TGACCGAAGT GGCTATCGAC
AAAGTGTTTA TCGGTTCCTG TACCAACTCG CGCATTGAAG ATTTACGCGC GGCAGCGGAG
ATCGCCAAAG GGCGAAAAGT CGCGCCAGGC GTGCAGGCAC TGGTGGTTCC CGGCTCTGGC
CCGGTAAAAG CCCAGGCGGA AGCGGAAGGT CTGGATAAAA TCTTTATTGA AGCCGGTTTT
GAATGGCGCT TACCTGGCTG CTCAATGTGT CTGGCGATGA ACAACGACCG GCTGAATCCG
GGCGAACGTT GTGCCTCCAC CAGCAACCGT AACTTTGAAG GCCGCCAGGG GCGCGGCGGG
CGCACGCATC TGGTCAGCCC GGCAATGGCT GCCGCCGCGG CTGTGACCGG CCATTTCGCC
GACATTCGCA ACATTAAATA A
 
Protein sequence
MAKTLYEKLF DAHVVYEAEN ETPLLYIDRH LVHEVTSPQA FDGLRAHGRP VRQPGKTFAT 
MDHNVSTQTK DINACGEMAR IQMQELIKNC KEFGVELYDL NHPYQGIVHV MGPEQGVTLP
GMTIVCGDSH TATHGAFGAL AFGIGTSEVE HVLATQTLKQ GRAKTMKIEV QGKAAPGITA
KDIVLAIIGK TGSAGGTGHV VEFCGEAIRD LSMEGRMTLC NMAIEMGAKA GLVAPDETTF
NYVKGRLHAP KGKDFDDAIA YWKTLQTDEG ATFDTVVTLQ AEEISPQVTW GTNPGQVISV
NDNIPDPASF ADPVERASAE KALAYMGLKP GIPLTEVAID KVFIGSCTNS RIEDLRAAAE
IAKGRKVAPG VQALVVPGSG PVKAQAEAEG LDKIFIEAGF EWRLPGCSMC LAMNNDRLNP
GERCASTSNR NFEGRQGRGG RTHLVSPAMA AAAAVTGHFA DIRNIK