Gene EcHS_A0975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0975 
SymbolpoxB 
ID5593155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp969566 
End bp971284 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content53% 
IMG OID640920146 
Productpyruvate dehydrogenase 
Protein accessionYP_001457712 
Protein GI157160394 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAA CGGTTGCAGC TTATATCGCC AAAACACTCG AATCGGCAGG GGTGAAACGC 
ATCTGGGGAG TCACAGGCGA CTCTCTGAAC GGTCTTAGTG ACAGTCTTAA TCGCATGGGC
ACCATCGAGT GGATGTCCAC CCGCCACGAA GAAGTGGCGG CCTTTGCCGC TGGCGCTGAA
GCACAACTTA GCGGAGAACT GGCGGTCTGT GCCGGATCGT GCGGCCCCGG CAACCTGCAC
TTAATCAACG GCCTGTTCGA TTGCCACCGC AATCACGTTC CGGTACTGGC GATTGCCGCT
CATATTCCCT CCAGCGAAAT TGGCAGCGGC TATTTCCAGG AAACCCACCC ACAAGAGCTA
TTCCGCGAAT GTAGTCACTA TTGCGAGCTG GTTTCCAGCC CGGAGCAGAT CCCACAAGTG
CTGGCAATTG CTATGCGCAA AGCAGTGCTT AACCGTGGCG TTTCCGTTGT TGTGTTACCG
GGTGACGTGG CGTTAAAACC TGCGCCAGAA GGAGCAACTA CCCACTGGTA TCATGCGCCA
CAGCCAGTTG TGACGCCGGA AGAAGAAGAG TTACGCAAAC TGGCGCAACT GCTGCGTTAT
TCCAGCAATA TCGCCCTGAT GTGTGGCAGC GGCTGCGCGG GGGCGCATAA AGAGTTAGTT
GAGTTTGCCG GGAAAATTAA AGCACCTATT GTTCATGCCC TGCGCGGTAA AGAACATGTC
GAATACGATA ATCCGTATGA TGTTGGAATG ACCGGGTTAA TCGGCTTCTC GTCAGGTTTT
CACACTATGA TGAATGCCGA TACGTTAGTG CTGCTCGGCA CGCAATTTCC CTACCGCGCC
TTCTACCCGA CCGATGCCAA AATCATTCAG ATTGATATCA ATCCAGCCAG CATCGGCGCG
CATAGCAAGG TAGATATGGC GCTGGTCGGC GATATCAAAT CAACCCTGCG TGCATTGCTG
CCACTGGTGG AAGAAAAAGC CGATCGCAAA TTTCTGGATA AAGCGCTGGA AGATTACCGC
GATGCCCGCA AAGGGCTGGA CGATTTAGCT AAACCGAGCG AGAAAGCCAT TCACCCGCAA
TATCTGGCGC AGCAAATTAG TCATTTTGCC GCCGATGACG CAATCTTTAC CTGTGACGTC
GGCACGCCAA CAGTATGGGC AGCGCGTTAT CTAAAAATGA ACGGCAAGCG TCGCCTGTTA
GGTTCGTTTA ACCACGGTTC GATGGCTAAC GCCATGCCGC AGGCGCTGGG TGCGCAGGCG
ACAGAGCCAG AACGTCAGGT GGTCGCCATG TGCGGCGATG GCGGTTTTAG CATGTTGATG
GGCGATTTCC TCTCAGTAGT GCAGATGAAA CTGCCAGTGA AAATTGTCGT CTTTAACAAC
AGCGTGCTGG GCTTTGTGGC GATGGAGATG AAAGCTGGTG GCTATTTGAC TGACGGCACC
GAACTACACG ACACAAACTT TGCCCGCATT GCCGAAGCGT GCGGCATTAC GGGTATCCGT
GTAGAAAAAG CGTCTGAAGT TGATGAAGCC CTGCAACGCG CCTTCTCCAT CGACGGTCCG
GTGCTGGTGG ATGTGGTGGT CGCCAAAGAA GAGTTAGCCA TTCCACCGCA GATCAAACTC
GAACAGGCCA AAGGATTTAG CTTGTATATG TTGCGCGCAA TCATCAGCGG ACGCGGCGAT
GAAGTGATCG AACTGGCGAA AACGAACTGG CTAAGGTAA
 
Protein sequence
MKQTVAAYIA KTLESAGVKR IWGVTGDSLN GLSDSLNRMG TIEWMSTRHE EVAAFAAGAE 
AQLSGELAVC AGSCGPGNLH LINGLFDCHR NHVPVLAIAA HIPSSEIGSG YFQETHPQEL
FRECSHYCEL VSSPEQIPQV LAIAMRKAVL NRGVSVVVLP GDVALKPAPE GATTHWYHAP
QPVVTPEEEE LRKLAQLLRY SSNIALMCGS GCAGAHKELV EFAGKIKAPI VHALRGKEHV
EYDNPYDVGM TGLIGFSSGF HTMMNADTLV LLGTQFPYRA FYPTDAKIIQ IDINPASIGA
HSKVDMALVG DIKSTLRALL PLVEEKADRK FLDKALEDYR DARKGLDDLA KPSEKAIHPQ
YLAQQISHFA ADDAIFTCDV GTPTVWAARY LKMNGKRRLL GSFNHGSMAN AMPQALGAQA
TEPERQVVAM CGDGGFSMLM GDFLSVVQMK LPVKIVVFNN SVLGFVAMEM KAGGYLTDGT
ELHDTNFARI AEACGITGIR VEKASEVDEA LQRAFSIDGP VLVDVVVAKE ELAIPPQIKL
EQAKGFSLYM LRAIISGRGD EVIELAKTNW LR