Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1151 |
Symbol | |
ID | 5591305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1160782 |
End bp | 1161720 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640920310 |
Product | D-isomer specific 2-hydroxyacid dehydrogenase family protein |
Protein accession | YP_001457873 |
Protein GI | 157160555 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.018964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATCA TCTTTTATCA CCCAACGTTC GATACCCAAT GGTGGATTGA GGCACTGCGC AAAGCTATTC CTCAGGCAAG AGTCAGAGCA TGGAAAAGCG GAGATAATGA CTCTGCTGAT TATGCTTTAG TCTGGCATCC TCCTGTTGAA ATGCTGGCAG GGCGCGATCT TAAAGCGGTG TTCGCACTCG GGGCCGGTGT TGATTCTATT TTGAGCAAGC TACAGGCACA CCCTGAAATG CTGAACCCTT CTGTTCCACT TTTTCGCCTG GAAGATACCG GTATGGGCGA GCAAATGCAG GAATATGCTG TCAGTCAGGT GCTGCATTGG TTTCGACGTT TTGACGATTA TCGCATCCAG CAAAATAGTT CGCATTGGCA ACCGCTGCCT GAATATCATC GGGAAGATTT TACCATCGGC ATTTTGGGCG CAGGCGTACT GGGCAGTAAA GTTGCTCAGA GTCTGCAAAC CTGGCGCTTT CCGCTGCGTT GCTGGAGTCG AACCCGTAAA TCGTGGCCTG GCGTGCAAAG CTTTGCCGGA CGGGAAGAAC TGTCTGCATT TCTGAGCCAA TGTCGGGTAT TGATTAATTT GTTACCGAAT ACCCCTGAAA CCGTCGGCAT TATTAATCAA CAATTACTCG AAAAATTACC GGATGGCGCG TATCTCCTCA ACCTGGCGCG TGGTGTTCAT GTTGTGGAAG ATGACCTGCT CGCGGCGCTG GATAGCGGGA AAGTTAAAGG CGCAATGCTG GATGTTTTTA ATCGTGAACC CTTACCGCCT GAAAGTCCGC TCTGGCAACA TCCACGCGTG ACGATAACAC CACATGTCGC CGCGATTACC CGTCCCGCTG AAGCTGTGGA GTACATTTCT CGCACTATTG CCCAGCTCGA AAAAGGGGAG AAGGTCTGCG GGCAAGTCGA CCGCGCACGC GGCTACTAA
|
Protein sequence | MDIIFYHPTF DTQWWIEALR KAIPQARVRA WKSGDNDSAD YALVWHPPVE MLAGRDLKAV FALGAGVDSI LSKLQAHPEM LNPSVPLFRL EDTGMGEQMQ EYAVSQVLHW FRRFDDYRIQ QNSSHWQPLP EYHREDFTIG ILGAGVLGSK VAQSLQTWRF PLRCWSRTRK SWPGVQSFAG REELSAFLSQ CRVLINLLPN TPETVGIINQ QLLEKLPDGA YLLNLARGVH VVEDDLLAAL DSGKVKGAML DVFNREPLPP ESPLWQHPRV TITPHVAAIT RPAEAVEYIS RTIAQLEKGE KVCGQVDRAR GY
|
| |