Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0078 |
Symbol | leuB |
ID | 6146489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 87501 |
End bp | 88592 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641614979 |
Product | 3-isopropylmalate dehydrogenase |
Protein accession | YP_001742195 |
Protein GI | 170684236 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0473] Isocitrate/isopropylmalate dehydrogenase |
TIGRFAM ID | [TIGR00169] 3-isopropylmalate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.656199 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAGA ATTACCATAT TGCCGTATTG CCGGGGGACG GTATTGGTCC GGAAGTGATG ACCCAGGCGC TGAAAGTGCT GGATGCCGTG CGCAACCGCT TTGCGATGCG TATCACCACC AGCCATTACG ATGTAGGCGG CGCAGCCATT GATAACCACG GGCAGCCGCT GCCGCCTGCG ACGGTTGAAG GTTGTGAGCA AGCCGATGCC GTGCTGTTTG GCTCGGTAGG TGGTCCGAAG TGGGAACATT TGCCGCCAGA CCAGCAACCA GAACGCGGCG CACTGCTGCC TTTGCGTAAG CACTTCAAAT TATTCAGCAA TCTGCGTCCG GCAAAACTGT ATCAGGGGCT GGAAGCATTC TGTCCGCTGC GTGCTGACAT TGCCGCAAAC GGCTTTGACA TCCTGTGTGT GCGCGAACTG ACCGGCGGTA TCTATTTCGG TCAGCCAAAA GGCCGCGAAG GTAGCGGACA ATATGAAAAA GCGTTTGATA CCGAGGTGTA CCACCGTTTT GAGATCGAGC GCATTGCCCG CATCGCATTT GAATCTGCCC GCAAGCGTCG CCACAAAGTC ACCTCAATCG ACAAAGCCAA CGTGCTGCAA TCCTCTATTT TATGGCGGGA GATCGTTAAC GAGATCGCCA CGGAATACCC GGATGTCGAA CTGGCGCATA TGTACATCGA CAACGCCACC ATGCAGCTGA TTAAAGATCC ATCACAGTTT GACGTCTTGC TGTGCTCCAA CCTGTTTGGC GACATTCTGT CTGATGAGTG CGCAATGATC ACTGGCTCAA TGGGGATGTT GCCTTCCGCC AGCCTGAACG AGCAAGGTTT TGGTCTGTAT GAACCGGCGG GCGGCTCGGC ACCCGATATC GCAGGTAAAA ATATCGCCAA CCCGATTGCA CAAATTCTGT CGCTGGCGCT GCTGCTGCGT TACAGCCTGG ATGCCGATGA TGCGGCTTCC GCCATTGAAC GCGCCATTAA CCGCGCATTA GAAGAAGGCA TTCGCACCGG GGATTTAGCC CGTGGCGCTG CCGCCGTTAG TACCAATGAA ATGGGCGATA TCATTGCCCG CTATGTGGTA GAAGGGGTGT AA
|
Protein sequence | MSKNYHIAVL PGDGIGPEVM TQALKVLDAV RNRFAMRITT SHYDVGGAAI DNHGQPLPPA TVEGCEQADA VLFGSVGGPK WEHLPPDQQP ERGALLPLRK HFKLFSNLRP AKLYQGLEAF CPLRADIAAN GFDILCVREL TGGIYFGQPK GREGSGQYEK AFDTEVYHRF EIERIARIAF ESARKRRHKV TSIDKANVLQ SSILWREIVN EIATEYPDVE LAHMYIDNAT MQLIKDPSQF DVLLCSNLFG DILSDECAMI TGSMGMLPSA SLNEQGFGLY EPAGGSAPDI AGKNIANPIA QILSLALLLR YSLDADDAAS AIERAINRAL EEGIRTGDLA RGAAAVSTNE MGDIIARYVV EGV
|
| |