Gene EcHS_A1467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1467 
SymbolidhA 
ID5591397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1465046 
End bp1466035 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content50% 
IMG OID640920624 
ProductD-lactate dehydrogenase 
Protein accessionYP_001458180 
Protein GI157160862 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1052] Lactate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value0.59802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCG CCGTTTATAG CACAAAACAG TACGACAAGA AGTACCTGCA ACAGGTGAAC 
GAGTCCTTTG GCTTTGAGCT GGAATTTTTT GACTTTCTGC TGACGGAAAA AACCGCTAAA
ACTGCCAATG GCTGCGAAGC GGTATGTATT TTCGTAAACG ATGACGGCAG CCGCCCGGTG
CTGGAAGAGC TGAAAAAGCA CGGCGTTAAA TATATCGCCC TGCGCTGTGC CGGTTTCAAT
AACGTCGACC TTGACGCGGC AAAAGAACTG GGGCTGAAAG TAGTCCGTGT TCCAGCCTAT
GATCCAGAGG CCGTTGCTGA ACACGCCATC GGTATGATGA TGACGCTGAA CCGCCGTATT
CACCGCGCGT ATCAGCGTAC CCGTGACGCT AACTTCTCTC TGGAAGGTCT GACCGGCTTT
ACTATGTATG GCAAAACGGC AGGCGTTATC GGTACCGGTA AAATCGGTGT GGCGATGCTG
CGCATTCTGA AAGGTTTTGG TATGCGTCTG CTGGCGTTCG ATCCGTATCC AAGTGCAGCG
GCGCTGGAAC TCGGTGTGGA GTATGTCGAT CTGCCAACCC TGTTCTCTGA ATCAGACGTT
ATCTCTCTGC ACTGCCCGCT GACACCGGAA AACTACCATC TGTTGAACGA AGCCGCCTTC
GATCAGATGA AAAATGGCGT GATGATCGTC AATACCAGTC GCGGTGCATT GATTGATTCT
CAGGCAGCAA TTGAAGCGCT GAAAAATCAG AAAATTGGTT CGTTGGGTAT GGACGTGTAT
GAGAACGAAC GCGATCTGTT CTTTGAAGAT AAATCCAACG ACGTGATCCA GGATGACGTA
TTCCGTCGCC TGTCTGCCTG CCACAACGTG CTGTTTACCG GGCACCAGGC ATTCCTGACA
GCAGAAGCTC TGACCAGTAT TTCTCAGACT ACGCTGCAAA ACTTAAGCAA TCTGGAAAAA
GGCGAAACCT GCCCGAACGA ACTGGTTTAA
 
Protein sequence
MKLAVYSTKQ YDKKYLQQVN ESFGFELEFF DFLLTEKTAK TANGCEAVCI FVNDDGSRPV 
LEELKKHGVK YIALRCAGFN NVDLDAAKEL GLKVVRVPAY DPEAVAEHAI GMMMTLNRRI
HRAYQRTRDA NFSLEGLTGF TMYGKTAGVI GTGKIGVAML RILKGFGMRL LAFDPYPSAA
ALELGVEYVD LPTLFSESDV ISLHCPLTPE NYHLLNEAAF DQMKNGVMIV NTSRGALIDS
QAAIEALKNQ KIGSLGMDVY ENERDLFFED KSNDVIQDDV FRRLSACHNV LFTGHQAFLT
AEALTSISQT TLQNLSNLEK GETCPNELV