Gene EcolC_4228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4228 
Symbol 
ID6067837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4670539 
End bp4672014 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content54% 
IMG OID641603659 
Productketol-acid reductoisomerase 
Protein accessionYP_001727151 
Protein GI170022197 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0059] Ketol-acid reductoisomerase 
TIGRFAM ID[TIGR00465] ketol-acid reductoisomerase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAACT ACTTCAATAC ACTGAATCTG CGCCAGCAGC TGGCACAGCT GGGCAAATGT 
CGCTTTATGG GCCGCGATGA ATTCGCCGAT GGCGCGAGCT ACCTTCAGGG TAAAAAAGTA
GTCATCGTCG GCTGTGGCGC ACAGGGTCTG AACCAGGGCC TGAACATGCG TGATTCTGGT
CTCGATATCT CCTACGCTCT GCGTAAAGAA GCGATTGCCG AGAAGCGCGC GTCCTGGCGT
AAAGCGACCG AAAATGGTTT TAAAGTGGGT ACTTACGAAG AACTGATCCC ACAGGCGGAT
CTGGTGATTA ACCTGACGCC GGACAAGCAG CACTCTGATG TAGTGCGCAC CGTACAGCCA
CTGATGAAAG ACGGCGCGGC GCTGGGCTAC TCGCACGGTT TCAACATCGT CGAAGTGGGC
GAGCAGATCC GTAAAGATAT CACCGTAGTG ATGGTTGCGC CGAAATGCCC AGGCACCGAA
GTGCGTGAAG AGTACAAACG TGGGTTCGGC GTACCGACGC TGATTGCCGT TCACCCGGAA
AACGATCCGA AAGGCGAAGG CATGGCGATT GCCAAAGCCT GGGCGGCTGC AACCGGTGGT
CACCGTGCGG GTGTGCTGGA ATCGTCCTTC GTTGCGGAAG TGAAATCTGA CCTGATGGGC
GAGCAAACCA TCCTGTGCGG TATGTTGCAG GCTGGCTCTC TGCTGTGCTT CGACAAGCTG
GTGGAAGAAG GTACCGATCC AGCATACGCA GAAAAACTGA TTCAGTTCGG TTGGGAAACC
ATCACCGAAG CACTGAAACA GGGCGGCATC ACCCTGATGA TGGACCGTCT CTCTAACCCG
GCGAAACTGC GTGCTTATGC GCTTTCTGAA CAGCTGAAAG AGATCATGGC ACCCCTGTTC
CAGAAACATA TGGACGACAT CATCTCCGGC GAATTCTCTT CCGGTATGAT GGCGGACTGG
GCCAACGATG ATAAGAAACT GCTGACCTGG CGTGAAGAGA CCGGCAAAAC CGCGTTTGAA
ACCGCGCCGC AGTATGAAGG CAAAATCGGC GAGCAGGAGT ACTTCGATAA AGGCGTACTG
ATGATCGCGA TGGTGAAAGC GGGCGTTGAA CTGGCGTTCG AAACCATGGT CGATTCCGGC
ATCATTGAAG AGTCTGCATA TTATGAATCA CTGCACGAGC TGCCGCTGAT TGCCAACACC
ATCGCCCGTA AGCGTCTGTA CGAAATGAAC GTGGTTATCT CTGATACCGC TGAGTACGGT
AACTATCTGT TCTCTTACGC TTGTGTGCCG TTGCTGAAAC CGTTTATGGC AGAGCTGCAA
CCGGGCGACC TGGGTAAAGC TATTCCGGAA GGCGCGGTAG ATAACGGGCA ACTGCGTGAT
GTGAACGAAG CGATTCGCAG CCATGCGATT GAGCAGGTAG GTAAGAAACT GCGCGGCTAT
ATGACAGATA TGAAACGTAT TGCTGTTGCG GGTTAA
 
Protein sequence
MANYFNTLNL RQQLAQLGKC RFMGRDEFAD GASYLQGKKV VIVGCGAQGL NQGLNMRDSG 
LDISYALRKE AIAEKRASWR KATENGFKVG TYEELIPQAD LVINLTPDKQ HSDVVRTVQP
LMKDGAALGY SHGFNIVEVG EQIRKDITVV MVAPKCPGTE VREEYKRGFG VPTLIAVHPE
NDPKGEGMAI AKAWAAATGG HRAGVLESSF VAEVKSDLMG EQTILCGMLQ AGSLLCFDKL
VEEGTDPAYA EKLIQFGWET ITEALKQGGI TLMMDRLSNP AKLRAYALSE QLKEIMAPLF
QKHMDDIISG EFSSGMMADW ANDDKKLLTW REETGKTAFE TAPQYEGKIG EQEYFDKGVL
MIAMVKAGVE LAFETMVDSG IIEESAYYES LHELPLIANT IARKRLYEMN VVISDTAEYG
NYLFSYACVP LLKPFMAELQ PGDLGKAIPE GAVDNGQLRD VNEAIRSHAI EQVGKKLRGY
MTDMKRIAVA G