Gene EcSMS35_0077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0077 
SymbolleuC 
ID6147182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp86098 
End bp87498 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content56% 
IMG OID641614978 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001742194 
Protein GI170682290 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.387894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAA CGTTATACGA AAAATTGTTC GACGCTCACG TAGTGTACGA AGCCGAAAAC 
GAAACCCCGC TGTTATATAT CGACCGCCAC CTGGTGCATG AAGTGACCTC ACCGCAGGCG
TTCGATGGTC TGCGCGCCCA CGGTCGCCCG GTACGTCAGC CGGGCAAAAC CTTCGCCACC
ATGGATCACA ACGTCTCTAC TCAGACCAAA GACATTAATG CCTGCGGTGA AATGGCGCGC
ATCCAGATGC AGGAGCTGAT CAAAAACTGC AAAGAATTTG GCGTCGAGCT GTATGACCTG
AATCACCCGT ATCAGGGGAT CGTCCACGTA ATGGGGCCGG AACAGGGCGT CACATTGCCG
GGGATGACCA TTGTCTGCGG CGACTCACAT ACCGCCACCC ACGGCGCGTT TGGCGCACTG
GCCTTTGGTA TCGGCACTTC CGAAGTTGAA CACGTACTGG CAACGCAAAC CCTGAAACAG
GGTCGTGCGA AGACCATGAA AATTGAAGTC CAGGGCAAAG CCGCGCCGGG CATTACCGCG
AAAGATATCG TGCTGGCGAT TATCGGTAAA ACCGGTAGCG CAGGCGGCAC CGGGCATGTG
GTGGAGTTTT GCGGCGAAGC AATCCGTGAT TTAAGCATGG AAGGTCGTAT GACCCTGTGC
AATATGGCAA TCGAAATGGG CGCGAAAGCC GGTCTGGTTG CACCGGACGA AACTACCTTT
AACTATGTCA AAGGCCGTCT GCATGCGCCG AAAGGCAAAG ATTTCGACGA CGCCGTCGCC
TACTGGAAAA CCCTGCAAAC CGACGAAGGC GCAACTTTCG ATACCGTTGT CACTCTGCAA
GCAGAAGAGA TTTCGCCGCA GGTCACCTGG GGCACCAATC CAGGCCAGGT GATTTCCGTG
AACGACAATA TTCCCGATCC AGCTTCGTTT GCCGATCCGG TTGAACGCGC GTCGGCAGAA
AAAGCGCTGG CCTATATGGG GCTGAAACCG GGTATTCCGC TGACCGAAGT GGCTATCGAT
AAAGTGTTTA TCGGTTCCTG TACCAACTCG CGCATTGAAG ATTTACGCGC GGCAGCGGAA
ATCGCCAAAG GGCGAAAAGT CGCGCCAGGC GTGCAGGCAC TGGTGGTTCC CGGCTCTGGC
CCGGTAAAAG CCCAGGCGGA AGCGGAAGGT CTGGATAAAA TCTTTATTGA AGCCGGTTTT
GAATGGCGCT TGCCTGGCTG CTCAATGTGT CTGGCGATGA ATAACGACCG TCTGAATCCG
GGCGAACGTT GTGCCTCCAC CAGTAACCGT AACTTTGAAG GCCGCCAGGG GCGCGGCGGG
CGCACGCATC TGGTCAGCCC GGCTATGGCT GCCGCCGCGG CTGTGACCGG ACATTTTGCC
GACATTCGCA ACATTAAATA A
 
Protein sequence
MAKTLYEKLF DAHVVYEAEN ETPLLYIDRH LVHEVTSPQA FDGLRAHGRP VRQPGKTFAT 
MDHNVSTQTK DINACGEMAR IQMQELIKNC KEFGVELYDL NHPYQGIVHV MGPEQGVTLP
GMTIVCGDSH TATHGAFGAL AFGIGTSEVE HVLATQTLKQ GRAKTMKIEV QGKAAPGITA
KDIVLAIIGK TGSAGGTGHV VEFCGEAIRD LSMEGRMTLC NMAIEMGAKA GLVAPDETTF
NYVKGRLHAP KGKDFDDAVA YWKTLQTDEG ATFDTVVTLQ AEEISPQVTW GTNPGQVISV
NDNIPDPASF ADPVERASAE KALAYMGLKP GIPLTEVAID KVFIGSCTNS RIEDLRAAAE
IAKGRKVAPG VQALVVPGSG PVKAQAEAEG LDKIFIEAGF EWRLPGCSMC LAMNNDRLNP
GERCASTSNR NFEGRQGRGG RTHLVSPAMA AAAAVTGHFA DIRNIK