Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3779 |
Symbol | dlgD |
ID | 5592987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3772036 |
End bp | 3773034 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640922893 |
Product | 2,3-diketo-L-gulonate reductase |
Protein accession | YP_001460371 |
Protein GI | 157163053 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 57 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTGA CATTTGAGCA GTTAAAAGCA GCCTTTAATC GGGTCTTAAT TTCACGCGGC GTTGACAGCG AAACGGCTGA CGCCTGTGCA GAGATGTTCG CCCGCACCAC CGAATCCGGC GTTTATTCTC ACGGCGTTAA TCGTTTCCCT CGTTTCATTC AACAACTGGA AAACGGCGAT ATCATTCCTG ATGCCCAACC CAAACGTATA ACCAGCCTCG GCGCAATTGA ACAGTGGGAC GCCCAGCGTT CGATCGGTAA CCTGACAGCG AAAAAGATGA TGGATCGCGC CATTGAACTG GCTGCCGATC ACGGTATTGG TCTGGTGGCA CTACGTAATG CCAACCACTG GATGCGCGGC GGCAGCTACG GCTGGCAGGC GGCGGAAAAA GGCTATATTG GCATTTGCTG GACCAACTCC ATCGCCGTAA TGCCGCCGTG GGGCGCAAAA GAGTGTCGCA TCGGCACTAA CCCGCTGATC GTCGCCATTC CTTCCACGCC GATCACCATG GTCGATATGT CGATGTCGAT GTTCTCTTAC GGCATGTTAG AAGTTAACCG TCTGGCAGGT CGTCAGCTCC CGGTCGATGG TGGCTTTGAT GATGAGGGCA ATTTGACCAA AGAACCTGGC GTTATCGAGA AGAATCGCCG CATTTTGCCG ATGGGCTACT GGAAAGGTTC TGGCATGTCG ATTGTGCTGG ATATGATCGC TACTCTCCTT TCCGACGGCG CATCCGTTGC CGAAGTCACC CAGGACAACA GCGACGAATA CGGCATTTCA CAAATTTTTA TTGCCATTGA AGTGGACAAG CTTATCGACG GTCCCACCCG CGATGCCAAG CTGCAACGCA TCATGGATTA CGTTACTAGT GCCGAGCGTG CTGACGAAAA TCAGGCCATT CGCTTACCCG GCCATGAATT TACTACCCTG CTGGCCGAAA ACCGCCGTAA CGGTATTACC GTTGATGACA GCGTGTGGGC CAAAATCCAG GCGTTATAA
|
Protein sequence | MKVTFEQLKA AFNRVLISRG VDSETADACA EMFARTTESG VYSHGVNRFP RFIQQLENGD IIPDAQPKRI TSLGAIEQWD AQRSIGNLTA KKMMDRAIEL AADHGIGLVA LRNANHWMRG GSYGWQAAEK GYIGICWTNS IAVMPPWGAK ECRIGTNPLI VAIPSTPITM VDMSMSMFSY GMLEVNRLAG RQLPVDGGFD DEGNLTKEPG VIEKNRRILP MGYWKGSGMS IVLDMIATLL SDGASVAEVT QDNSDEYGIS QIFIAIEVDK LIDGPTRDAK LQRIMDYVTS AERADENQAI RLPGHEFTTL LAENRRNGIT VDDSVWAKIQ AL
|
| |