Gene EcolC_4234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4234 
Symbol 
ID6067862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4677780 
End bp4679426 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content54% 
IMG OID641603665 
Productacetolactate synthase 2 catalytic subunit 
Protein accessionYP_001727157 
Protein GI170022203 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.194744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.310929 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGGCG CACAGTGGGT GGTACATGCG TTGCGGGCAC AGGGTGTGAA TACCGTTTTC 
GGTTATCCGG GTGGCGCAAT TATGCCGGTT TACGATGCAT TGTATGACGG CGGCGTGGAG
CACTTGCTGT GCCGACATGA ACAGGGTGCG GCAATGGCGG CTATCGGTTA TGCCCGTGCT
ACTGGCAAAA CTGGCGTATG TATCGCCACG TCTGGTCCGG GCGCAACCAA CCTGATAACC
GGGCTTGCGG ACGCACTGTT AGATTCCATC CCCGTTGTTG CCATCACCGG TCAAGTGTCC
GCACCGTTTA TCGGCACGGA CGCATTTCAG GAAGTGGATG TCCTGGGATT GTCGCTAGCC
TGTACCAAGC ACAGCTTCCT GGTGCAGTCG CTGGAAGAGT TGCCGCGCAT CATGGCTGAA
GCATTCGACG TTGCCAGCTC AGGTCGTCCT GGTCCGGTTC TGGTCGATAT CCCAAAAGAT
ATCCAATTAG CCAGCGGCGA CCTGGAACCG TGGTTCACCA CCGTTGAAAA CGAAGTGACT
TTCCCACATG CCGAAGTCGA GCAAGCGCGC CAGATGCTGG CAAAAGCGCA AAAACCGATG
CTGTACGTTG GTGGTGGCGT GGGTATGGCG CAGGCAGTTC CTGCTTTACG AGAATTTCTC
GCTACCACAA AAATGCCTGC CACCTGCACG CTGAAAGGGC TGGGCGCAGT TGAAGCAGAT
TATCCGTACT ATCTGGGCAT GCTGGGAATG CATGGCACCA AAGCGGCGAA CTTCGCGGTG
CAGGAGTGCG ACTTGCTGAT CGCCGTGGGT GCACGTTTTG ATGACCGGGT GACCGGCAAA
CTGAACACCT TCGCACCACA CGCCAGTGTT ATCCATATGG ATATCGACCC GGCAGAAATG
AACAAGCTGC GTCAGGCACA TGTGGCATTA CAAGGTGATT TAAATGCTCT GTTACCAGCA
TTACAGCAGC CGTTAAATAT CAATGACTGG CAGCAACACT GCGCGCAGCT GCGTGATGAA
CATGCCTGGC GTTACGACCA TCCCGGTGAC GCTATCTACG CGCCGTTGTT GTTAAAACAA
CTGTCGGATC GTAAACCTGC GGATTGCGTC GTGACCACAG ATGTGGGGCA GCACCAGATG
TGGGCTGCGC AGCACATCGC CCACACTCGC CCGGAAAATT TCATCACCTC CAGCGGCTTA
GGTACCATGG GTTTTGGTTT ACCGGCGGCG GTTGGCGCAC AAGTCGCGCG ACCGAACGAT
ACCGTTGTCT GTATCTCCGG TGACGGCTCT TTCATGATGA ATGTGCAAGA GCTGGGCACC
GTAAAACGCA AGCAGTTACC GTTGAAAATC GTCTTACTCG ATAACCAACG GTTAGGGATG
GTTCGACAAT GGCAGCAACT GTTTTTTCAG GAACGATACA GCGAAACCAC CCTTACTGAT
AACCCCGATT TCCTCATGTT AGCCAGCGCC TTCGGCATCC CTGGCCAACA CATCACCCGT
AAAGACCAGG TTGAAGCGGC ACTCAACACC ATGCTGAACA GTGATGGGCC ATACCTGCTT
CATGTCTCAA TCGACGAACT TGAGAACGTC TGGCCGCTGG TGCCGCCAGG TGCCAGTAAT
TCAGAAATGT TGGAGAAATT ATCATGA
 
Protein sequence
MNGAQWVVHA LRAQGVNTVF GYPGGAIMPV YDALYDGGVE HLLCRHEQGA AMAAIGYARA 
TGKTGVCIAT SGPGATNLIT GLADALLDSI PVVAITGQVS APFIGTDAFQ EVDVLGLSLA
CTKHSFLVQS LEELPRIMAE AFDVASSGRP GPVLVDIPKD IQLASGDLEP WFTTVENEVT
FPHAEVEQAR QMLAKAQKPM LYVGGGVGMA QAVPALREFL ATTKMPATCT LKGLGAVEAD
YPYYLGMLGM HGTKAANFAV QECDLLIAVG ARFDDRVTGK LNTFAPHASV IHMDIDPAEM
NKLRQAHVAL QGDLNALLPA LQQPLNINDW QQHCAQLRDE HAWRYDHPGD AIYAPLLLKQ
LSDRKPADCV VTTDVGQHQM WAAQHIAHTR PENFITSSGL GTMGFGLPAA VGAQVARPND
TVVCISGDGS FMMNVQELGT VKRKQLPLKI VLLDNQRLGM VRQWQQLFFQ ERYSETTLTD
NPDFLMLASA FGIPGQHITR KDQVEAALNT MLNSDGPYLL HVSIDELENV WPLVPPGASN
SEMLEKLS