Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4072 |
Symbol | dlgD |
ID | 5588465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4055354 |
End bp | 4056352 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640927691 |
Product | 2,3-diketo-L-gulonate reductase |
Protein accession | YP_001465051 |
Protein GI | 157156962 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTGA CATTTGAGCA GTTAAAAGCA GCCTTTAATC GGGTCTTAAT TTCACGCGGC GTTGACAGCG AAACGGCTAA CGCCTGTGCA GAGATGTTCG CCCGCACCAC CGAATCCGGC GTTTATTCTC ACGGCGTTAA TCGTTTCCCT CGTTTCATTC AACAACTGGA AAACGGCGAT ATCATTCCTG ATGCCCAACC CAAACGTATA ACCAGCCTCG GCGCAATTGA ACAGTGGGAT GCCCAGCGTT CGATCGGCAA CCTGACGGCG AAAAAGATGA TGGATCGCGC CATTGAACTG GCTGCCGATC ACGGTATTGG TCTGGTGGCA CTTCGTAATG CCAACCACTG GATGCGCGGC GGCAGCTACG GCTGGCAGGC GGCGGAAAAA GGCTATATTG GCATTTGCTG GACCAACTCC ATCGCCGTAA TGCCGCCGTG GGGCGCAAAA GAGTGTCGCA TTGGCACTAA CCCGCTGATC GTCGCCATTC CTTCCACGCC GATCACCATG GTCGATATGT CGATGTCGAT GTTCTCTTAC GGCATGTTAG AAGTTAACCG CCTGGCGGGC CGTCAGCTCC CGGTCGATGG TGGCTTTGAT GATGAGGGCA ATTTGACCAA AGAACCTGGC GTTATCGAGA AGAATCGCCG CATTTTGCCG ATGGGCTACT GGAAAGGTTC TGGCATGTCG ATTGTGCTGG ATATGATCGC TACTCTCCTT TCCGACGGCG CATCGGTTGC CGAAGTCACC CAGGACAACA GCGACGAATA CGGCATTTCA CAAATTTTTA TTGCCATTGA AGTGGACAAG CTTATCGACG GTCCCACCCG CGATGCCAAG CTGCAACGTA TCATGGATTA CGTTACTACC GCTGAACGCG CTGACGAAAA CCAGGCCATC CGCTTACCCG GCCACGAATT TACTACCCTG CTGGCCGAAA ACCGCCGTAA CGGTATTACC GTTGATGACA GCGTGTGGGC CAAAATCCAG GCGTTATAA
|
Protein sequence | MKVTFEQLKA AFNRVLISRG VDSETANACA EMFARTTESG VYSHGVNRFP RFIQQLENGD IIPDAQPKRI TSLGAIEQWD AQRSIGNLTA KKMMDRAIEL AADHGIGLVA LRNANHWMRG GSYGWQAAEK GYIGICWTNS IAVMPPWGAK ECRIGTNPLI VAIPSTPITM VDMSMSMFSY GMLEVNRLAG RQLPVDGGFD DEGNLTKEPG VIEKNRRILP MGYWKGSGMS IVLDMIATLL SDGASVAEVT QDNSDEYGIS QIFIAIEVDK LIDGPTRDAK LQRIMDYVTT AERADENQAI RLPGHEFTTL LAENRRNGIT VDDSVWAKIQ AL
|
| |