Gene EcolC_1132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1132 
Symbol 
ID6068026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1237162 
End bp1238223 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content55% 
IMG OID641600548 
Productalcohol dehydrogenase 
Protein accessionYP_001724126 
Protein GI170019172 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGA TGCTGGCAGC TTATTTACCA GGAAATTCGA CCGTCGATCT GCGGGAAGTT 
GCGGTGCCGA CGCCGGGGAT TAACCAGGTA CTGATCAAAA TGAAATCCTC CGGGATTTGC
GGAAGCGATG TCCACTATAT CTATCATCAA CACCGTGCCA CAGCGGCGGC ACCCGATAAA
CCGTTATACC AGGGCTTTAT CAACGGTCAT GAACCGTGCG GGCAGATTGT GGCGATGGGG
CAAGGCTGCC GCCATTTTAA AGAGGGCGAC CGCGTGCTGG TGTATCACAT TTCTGGCTGT
GGTTTTTGCC CGAACTGCCG CCGCGGCTTT CCTATCTCTT GTACTGGCAA AGGAAAAGCG
GCTTACGGCT GGCAGCGTGA CGGCGGTCAT GCCGAATATC TGCTGGCGGA AGAAAAAGAT
CTGATCCTCC TGCCGGATGC GCTGAGCTAC GAAGATGGTG CGTTTATCAG TTGCGGCGTT
GGCACGGCCT ATGAAGGGAT TTTGCGCGGC GAAGTTTCCG GCAGCGATAA CGTGCTGGTG
GTCGGTCTGG GGCCGGTCGG CATGATGGCG ATGATGCTGG CGAAAGGTCG CGGTGCAAAA
CGGATCATCG GCGTTGATAT GCTGCCGGAA CGTCTGGCGA TGGCAAAACA GTTAGGGGTG
ATGGATCACG GCTATTTAGC GACCACCGAA GGTCTGCCGC AGATAATTGC TGAACTTACC
CACGGTGGCG CGGATGTTGC GCTCGATTGT TCCGGTAATG CCGCAGGTCG CTTGCTGGCA
CTGCAATCCA CTGCTGACTG GGGGCGGGTG GTTTACATTG GTGAAACCGG AAAAGTGGAA
TTCGAGGTCA GCGCCGATCT GATGCACCAT CAACGGCGGA TTATTGGCTC CTGGGTGACC
AGTCTGTTCC ATATGGAAAA ATGCGCCCAC GATTTAACGG ACTGGAAACT GTGGCCGCGT
AACGCCATTA CCCATCGCTT CTCGCTGGAA CAGGCAGGTG ATGCCTATGC GCTGATGGCG
AGCGGCAAAT GCGGGAAAGT TGTGATTAAC TTCCCGGATT AA
 
Protein sequence
MKTMLAAYLP GNSTVDLREV AVPTPGINQV LIKMKSSGIC GSDVHYIYHQ HRATAAAPDK 
PLYQGFINGH EPCGQIVAMG QGCRHFKEGD RVLVYHISGC GFCPNCRRGF PISCTGKGKA
AYGWQRDGGH AEYLLAEEKD LILLPDALSY EDGAFISCGV GTAYEGILRG EVSGSDNVLV
VGLGPVGMMA MMLAKGRGAK RIIGVDMLPE RLAMAKQLGV MDHGYLATTE GLPQIIAELT
HGGADVALDC SGNAAGRLLA LQSTADWGRV VYIGETGKVE FEVSADLMHH QRRIIGSWVT
SLFHMEKCAH DLTDWKLWPR NAITHRFSLE QAGDAYALMA SGKCGKVVIN FPD