Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3898 |
Symbol | dlgD |
ID | 6144299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3968804 |
End bp | 3969802 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618724 |
Product | 2,3-diketo-L-gulonate reductase |
Protein accession | YP_001745863 |
Protein GI | 170680670 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTGA CATTTGAGCA GTTAAAAGCA GCCTTTAATC GGGTCTTAAT TTCACGCGGC GTTGACAGCG AAACGGCTGA CGCCTGTGCA GAGATGTTCG CCCGCACCAC CGAATCCGGC GTTTATTCTC ACGGCGTTAA TCGTTTCCCT CGTTTCATTC AACAACTGGA AAACGGCGAT ATCATTCCTG ATGCCCAACC CAAACGTATA ACCAGCCTCG GCGCAATTGA ACAGTGGGAC GCCCAGCGTT CGATCGGTAA CCTGACAGCG AAAAAGATGA TGGATCGCGC CATTGAACTG GCTGCCGATC ACGGTATTGG TCTGGTGGCA CTACGTAATG CCAACCACTG GATGCGCGGC GGCAGCTACG GCTGGCAGGC GGCGGAAAAA GGCTATATTG GCATTTGCTG GACCAACTCC ATCGCCGTAA TGCCGCCGTG GGGCGCAAAA GAGTGTCGCA TCGGCACCAA CCCGCTGATC GTCGCCATTC CTTCTACCCC AATCACCATG GTCGATATGT CGATGTCGAT GTTCTCTTAC GGCATGTTAG AAGTTAACCG TCTGGCAGGC CGTCAGCTCC CGGTCGATGG TGGCTTTGAT GATGAGGGCA ATTTGACCAA AGAACCTGGC GTTATCGAGA AGAATCGCCG CATTTTGCCG ATGGGCTACT GGAAAGGTTC TGGCATGTCG ATTGTGCTGG ATATGATCGC CACTCTCCTT TCCGACGGCG CATCGGTTGC CGAAGTCACC CAGGACAACA GCGACGAATA CGGCGTTTCG CAAATCTTTA TCGCCATCGA AGTGGATAAA TTGATCGACG GCCCCACCCG CGATGCCAAG CTGCAACGCA TCATGGATTA CGTTACTACC GCTGAACGCG CTGACGAAAA CCAGGCCATC CGCTTACCCG GCCACGAATT TACTACCCTG CTGGCCGAAA ACCGCCGTAA CGGCATCACC GTTGATGACA GCGTGTGGGC AAAAATCCAG GCGTTATAA
|
Protein sequence | MKVTFEQLKA AFNRVLISRG VDSETADACA EMFARTTESG VYSHGVNRFP RFIQQLENGD IIPDAQPKRI TSLGAIEQWD AQRSIGNLTA KKMMDRAIEL AADHGIGLVA LRNANHWMRG GSYGWQAAEK GYIGICWTNS IAVMPPWGAK ECRIGTNPLI VAIPSTPITM VDMSMSMFSY GMLEVNRLAG RQLPVDGGFD DEGNLTKEPG VIEKNRRILP MGYWKGSGMS IVLDMIATLL SDGASVAEVT QDNSDEYGVS QIFIAIEVDK LIDGPTRDAK LQRIMDYVTT AERADENQAI RLPGHEFTTL LAENRRNGIT VDDSVWAKIQ AL
|
| |