Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0856 |
Symbol | |
ID | 5593994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 862955 |
End bp | 864040 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640920028 |
Product | hypothetical protein |
Protein accession | YP_001457595 |
Protein GI | 157160277 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 66 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGTG GTCATCGCTT TGATGCTCAG ACGCTGCACA GTTTTATTCA GGCTGTATTT CGTCAGATGG GTAGCGAGGA ACAAGAAGCG AAATTAGTTG CCGATCATTT AATCGCGGCA AACCTGGCAG GGCATGATTC ACATGGTATT GGCATGATCC CAAGCTATGT ACGCTCCTGG AGTCAGGGGC ACCTGCAAAT TAACCATCAT GCCAAAACCG TTAAAGAGGC GGGGGCGGCG GTCACGCTCG ATGGCGATCG CGCATTTGGT CAGGTCGCGG CACATGAAGC GATGGCGCTG GGGATTGAGA AAGCGCATCA GCACGGTATT GCCGCCGTGG CGCTACATAA CTCGCATCAT ATCGGCCGTA TCGGTTACTG GGCGGAGCAG TGTGCAGCGG CGGGGTTTGT CTCTATCCAC TTTGTTAGCG TGGTCGGTAT TCCAATGGTC GCGCCGTTCC ACGGTCGCGA CAGCCGCTTT GGCACCAATC CGTTCTGTGT GGTTTTCCCT CGTAAAGATA ATTTCCCGCT GTTGCTTGAT TACGCCACCA GCGCCATTGC ATTTGGCAAA ACCCGCGTCG CCTGGCATAA AGGCGTCCCC GTGCCGCCAG GTTGCCTGAT TGACGTTAAC GGCGTGCCGA CGACCAATCC GGCGGTAATG CAGGAGTCGC CGTTGGGTTC GCTGTTGACC TTTGCCGAAC ATAAAGGCTA CGCCCTTGCA GCGATGTGTG AAATTCTTGG CGGGGCGCTT TCCGGCGGTA AAACGACGCA TCAGGAAACG TTACAAACCA GTCCCGATGC CATTCTTAAC TGCATGACCA CTATCATCAT CAACCCGGAA CTCTTCGGCG CGCCGGATTG TAACGCGCAG ACCGAAGCCT TTGCCGAGTG GGTGAAAGCC TCGCCGCATG ATGATGATAA GCCGATTTTG CTACCGGGCG AGTGGGAAGT GAACACGCGT CGCGAACGGC AGAAGCAGGG GATTCCACTG GATGCGGGAA GCTGGCAGGC CATTTGTGAT GCAGCGCGGC AGATTGGTAT GCCGGAAGAG ACGTTGCAGG CTTTCTGTCA GCAGTTAGCC AGCTAA
|
Protein sequence | MESGHRFDAQ TLHSFIQAVF RQMGSEEQEA KLVADHLIAA NLAGHDSHGI GMIPSYVRSW SQGHLQINHH AKTVKEAGAA VTLDGDRAFG QVAAHEAMAL GIEKAHQHGI AAVALHNSHH IGRIGYWAEQ CAAAGFVSIH FVSVVGIPMV APFHGRDSRF GTNPFCVVFP RKDNFPLLLD YATSAIAFGK TRVAWHKGVP VPPGCLIDVN GVPTTNPAVM QESPLGSLLT FAEHKGYALA AMCEILGGAL SGGKTTHQET LQTSPDAILN CMTTIIINPE LFGAPDCNAQ TEAFAEWVKA SPHDDDKPIL LPGEWEVNTR RERQKQGIPL DAGSWQAICD AARQIGMPEE TLQAFCQQLA S
|
| |