Gene EcolC_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0139 
Symbol 
ID6068314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp151095 
End bp152093 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content53% 
IMG OID641599539 
Product2,3-diketo-L-gulonate reductase 
Protein accessionYP_001723148 
Protein GI170018194 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTGA CATTTGAGCA GTTAAAAGCA GCCTTTAATC GGGTCTTAAT TTCACGCGGC 
GTTGACAGCG AAACGGCTGA CGCCTGTGCA GAGATGTTCG CCCGCACCAC CGAATCCGGC
GTTTATTCTC ACGGCGTTAA TCGTTTCCCT CGTTTCATTC AACAACTGGA AAACGGCGAT
ATCATTCCTG ATGCCCAACC CAAACGTATA ACCAGCCTCG GCGCAATTGA ACAGTGGGAC
GCCCAGCGTT CGATCGGTAA CCTGACAGCG AAAAAGATGA TGGATCGCGC CATTGAACTG
GCTGCCGATC ACGGTATTGG TCTGGTGGCA CTACGTAATG CCAACCACTG GATGCGCGGC
GGCAGCTACG GCTGGCAGGC GGCGGAAAAA GGCTATATTG GCATTTGCTG GACCAACTCC
ATCGCCGTAA TGCCGCCGTG GGGCGCAAAA GAGTGTCGCA TCGGCACTAA CCCGCTGATC
GTCGCCATTC CTTCCACGCC GATCACCATG GTCGATATGT CGATGTCGAT GTTCTCTTAC
GGCATGTTAG AAGTTAACCG TCTGGCAGGT CGTCAGCTCC CGGTCGATGG TGGCTTTGAT
GATGAGGGCA ATTTGACCAA AGAACCTGGC GTTATCGAGA AGAATCGCCG CATTTTGCCG
ATGGGCTACT GGAAAGGTTC TGGCATGTCG ATTGTGCTGG ATATGATCGC TACTCTCCTT
TCCGACGGCG CATCCGTTGC CGAAGTCACC CAGGACAACA GCGACGAATA CGGCATTTCA
CAAATTTTTA TTGCCATTGA AGTGGACAAG CTTATCGACG GTCCCACCCG CGATGCCAAG
CTGCAACGCA TCATGGATTA CGTTACTAGT GCCGAGCGTG CTGACGAAAA TCAGGCCATT
CGCTTACCCG GCCATGAATT TACTACCCTG CTGGCCGAAA ACCGCCGTAA CGGCATCACT
GTTGATGACA GCGTGTGGGC CAAAATCCAG GCGTTATGA
 
Protein sequence
MKVTFEQLKA AFNRVLISRG VDSETADACA EMFARTTESG VYSHGVNRFP RFIQQLENGD 
IIPDAQPKRI TSLGAIEQWD AQRSIGNLTA KKMMDRAIEL AADHGIGLVA LRNANHWMRG
GSYGWQAAEK GYIGICWTNS IAVMPPWGAK ECRIGTNPLI VAIPSTPITM VDMSMSMFSY
GMLEVNRLAG RQLPVDGGFD DEGNLTKEPG VIEKNRRILP MGYWKGSGMS IVLDMIATLL
SDGASVAEVT QDNSDEYGIS QIFIAIEVDK LIDGPTRDAK LQRIMDYVTS AERADENQAI
RLPGHEFTTL LAENRRNGIT VDDSVWAKIQ AL