Gene EcHS_A3985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3985 
SymbolilvG 
ID5591168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3978971 
End bp3980617 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content54% 
IMG OID640923090 
Productacetolactate synthase 2 catalytic subunit 
Protein accessionYP_001460561 
Protein GI157163243 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGCG CACAGTGGGT GGTACATGCG TTGCGGGCAC AGGGTGTGAA TACCGTTTTC 
GGTTATCCGG GTGGCGCAAT TATGCCGGTT TACGATGCAT TGTATGACGG CGGCGTGGAG
CACTTGCTGT GCCGACATGA ACAGGGTGCG GCAATGGCGG CTATCGGTTA TGCCCGTGCT
ACTGGCAAAA CTGGCGTATG TATCGCCACG TCTGGTCCGG GCGCAACCAA CCTGATAACC
GGGCTTGCGG ACGCACTGTT AGATTCCATC CCCGTTGTTG CCATCACCGG TCAAGTGTCC
GCACCGTTTA TCGGCACGGA CGCATTTCAG GAAGTGGATG TCCTGGGATT GTCGCTAGCC
TGTACCAAGC ACAGCTTCCT GGTGCAGTCG CTGGAAGAGT TGCCGCGCAT CATGGCTGAA
GCATTCGACG TTGCCAGCTC AGGTCGTCCT GGTCCGGTTC TGGTCGATAT CCCAAAAGAT
ATCCAATTAG CCAGCGGCGA CCTGGAACCG TGGTTCACCA CCGTTGAAAA CGAAGTGACT
TTCCCACATG CCGAAGTCGA GCAAGCGCGC CAGATGCTGG CAAAAGCGCA AAAACCGATG
CTGTACGTTG GTGGTGGCGT GGGTATGGCG CAGGCAGTTC CTGCTTTACG AGAATTTCTC
GCTACCACAA AAATGCCTGC CACCTGCACG CTGAAAGGGC TGGGCGCAGT TGAAGCAGAT
TATCCGTACT ATCTGGGCAT GCTGGGAATG CATGGCACCA AAGCGGCGAA CTTCGCGGTG
CAGGAGTGCG ACTTGCTGAT CGCCGTGGGT GCACGTTTTG ATGACCGGGT GACCGGCAAA
CTGAACACCT TCGCACCACA CGCCAGTGTT ATCCATATGG ATATCGACCC GGCAGAAATG
AACAAGCTGC GTCAGGCACA TGTGGCATTA CAAGGTGATT TAAATGCTCT GTTACCAGCA
TTACAGCAGC CGTTAAATAT CAATGACTGG CAGCAACACT GCGCGCAGCT GCGTGATGAA
CATGCCTGGC GTTACGACCA TCCCGGTGAC GCTATCTACG CGCCGTTGTT GTTAAAACAA
CTGTCGGATC GTAAACCTGC GGATTGCGTC GTGACCACAG ATGTGGGGCA GCACCAGATG
TGGGCTGCGC AGCACATCGC CCACACTCGC CCGGAAAATT TCATCACCTC CAGCGGCTTA
GGTACCATGG GTTTTGGTTT ACCGGCGGCG GTTGGCGCAC AAGTCGCGCG ACCGAACGAT
ACCGTTGTCT GTATCTCCGG TGACGGCTCT TTCATGATGA ATGTGCAAGA GCTGGGCACC
GTAAAACGCA AGCAGTTACC GTTGAAAATC GTCTTACTCG ATAACCAACG GTTAGGGATG
GTTCGACAAT GGCAGCAACT GTTTTTTCAG GAACGATACA GCGAAACCAC CCTTACTGAT
AACCCCGATT TCCTCATGTT AGCCAGCGCC TTCGGCATCC CTGGCCAACA CATCACCCGT
AAAGACCAGG TTGAAGCGGC ACTCAACACC ATGCTGAACA GTGATGGGCC ATACCTGCTT
CATGTCTCAA TCGACGAACT TGAGAACGTC TGGCCGCTGG TGCCGCCAGG TGCCAGTAAT
TCAGAAATGT TGGAGAAATT ATCATGA
 
Protein sequence
MNGAQWVVHA LRAQGVNTVF GYPGGAIMPV YDALYDGGVE HLLCRHEQGA AMAAIGYARA 
TGKTGVCIAT SGPGATNLIT GLADALLDSI PVVAITGQVS APFIGTDAFQ EVDVLGLSLA
CTKHSFLVQS LEELPRIMAE AFDVASSGRP GPVLVDIPKD IQLASGDLEP WFTTVENEVT
FPHAEVEQAR QMLAKAQKPM LYVGGGVGMA QAVPALREFL ATTKMPATCT LKGLGAVEAD
YPYYLGMLGM HGTKAANFAV QECDLLIAVG ARFDDRVTGK LNTFAPHASV IHMDIDPAEM
NKLRQAHVAL QGDLNALLPA LQQPLNINDW QQHCAQLRDE HAWRYDHPGD AIYAPLLLKQ
LSDRKPADCV VTTDVGQHQM WAAQHIAHTR PENFITSSGL GTMGFGLPAA VGAQVARPND
TVVCISGDGS FMMNVQELGT VKRKQLPLKI VLLDNQRLGM VRQWQQLFFQ ERYSETTLTD
NPDFLMLASA FGIPGQHITR KDQVEAALNT MLNSDGPYLL HVSIDELENV WPLVPPGASN
SEMLEKLS