Gene EcolC_2275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2275 
Symbol 
ID6066995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2508048 
End bp2509037 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content50% 
IMG OID641601679 
ProductD-isomer specific 2-hydroxyacid dehydrogenase NAD-binding 
Protein accessionYP_001725238 
Protein GI170020284 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1052] Lactate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCG CCGTTTATAG CACAAAACAG TACGACAAGA AGTACCTGCA ACAGGTGAAC 
GAGTCCTTTG GCTTTGAGCT GGAATTTTTT GACTTTCTGC TGACGGAAAA AACCGCTAAA
ACTGCCAATG GCTGCGAAGC GGTATGTATT TTCGTAAACG ATGACGGCAG CCGCCCGGTG
CTGGAAGAGC TGAAAAAGCA CGGCGTTAAA TATATCGCCC TGCGCTGTGC CGGTTTCAAT
AACGTCGACC TTGACGCGGC AAAAGAACTG GGGCTGAAAG TAGTCCGTGT TCCAGCCTAT
GATCCAGAGG CCGTTGCTGA ACACGCCATC GGTATGATGA TGACGCTGAA CCGCCGTATT
CACCGCGCGT ATCAGCGTAC CCGTGACGCT AACTTCTCTC TGGAAGGTCT GACCGGCTTT
ACTATGTATG GCAAAACGGC AGGCGTTATC GGTACCGGTA AAATCGGTGT GGCGATGCTG
CGCATTCTGA AAGGTTTTGG TATGCGTCTG CTGGCGTTCG ATCCGTATCC AAGTGCAGCG
GCGCTGGAAC TCGGTGTGGA GTATGTCGAT CTGCCAACCC TGTTCTCTGA ATCAGACGTT
ATCTCTCTGC ACTGCCCGCT GACACCGGAA AACTACCATC TGTTGAACGA AGCCGCCTTC
GATCAGATGA AAAATGGCGT GATGATCGTC AATACCAGTC GCGGTGCATT GATTGATTCT
CAGGCAGCAA TTGAAGCGCT GAAAAATCAG AAAATTGGTT CGTTGGGTAT GGACGTGTAT
GAGAACGAAC GCGATCTGTT CTTTGAAGAT AAATCCAACG ACGTGATCCA GGATGACGTA
TTCCGTCGCC TGTCTGCCTG CCACAACGTG CTGTTTACCG GGCACCAGGC ATTCCTGACA
GCAGAAGCTC TGACCAGTAT TTCTCAGACT ACGCTGCAAA ACTTAAGCAA TCTGGAAAAA
GGCGAAACCT GCCCGAACGA ACTGGTTTAA
 
Protein sequence
MKLAVYSTKQ YDKKYLQQVN ESFGFELEFF DFLLTEKTAK TANGCEAVCI FVNDDGSRPV 
LEELKKHGVK YIALRCAGFN NVDLDAAKEL GLKVVRVPAY DPEAVAEHAI GMMMTLNRRI
HRAYQRTRDA NFSLEGLTGF TMYGKTAGVI GTGKIGVAML RILKGFGMRL LAFDPYPSAA
ALELGVEYVD LPTLFSESDV ISLHCPLTPE NYHLLNEAAF DQMKNGVMIV NTSRGALIDS
QAAIEALKNQ KIGSLGMDVY ENERDLFFED KSNDVIQDDV FRRLSACHNV LFTGHQAFLT
AEALTSISQT TLQNLSNLEK GETCPNELV