Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2442 |
Symbol | |
ID | 5590585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2423776 |
End bp | 2425011 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640926103 |
Product | dihydropyrimidine dehydrogenase |
Protein accession | YP_001463498 |
Protein GI | 157158894 |
COG category | [C] Energy production and conversion [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase [COG1146] Ferredoxin |
TIGRFAM ID | [TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAACGA AAGATCTTTC GATTACTTTT TGCGGCGTGA AGTTTCCCAA CCCGTTCTGC CTCTCTTCTT CGCCTGTAGG CAACTGCTAT GAGATGTGTG CCAAAGCCTA CGACACAGGT TGGGGCGGCG TGGTGTTTAA AACGATCGGC TTTTTTATCG CCAACGAAGT CTCGCCGCGT TTTGATCATC TGGTGAAAGA AGATACCGGT TTTATCGGCT TCAAAAATAT GGAGCAGATT GCTGAACATC CGTTGGAAGA GAATCTGGCC GCCCTGCGTC AGCTGAAGGA AGATTACCCG GACAAAGTAT TGATCGCTTC GATCATGGGG GAAAATGAGC AGCAATGGGA GGAGCTGGCG CGCCTGGTGC AAGAAGCTGG CGCGGATATG ATCGAGTGTA ACTTCTCCTG TCCGCAAATG ACTTCTCATG CGATGGGTAG CGATGTCGGG CAAAGCCCGG AGCTGGTAGA AAAATATTGT CGGGCAGTGA AACGGGGTTC CACGCTGCCA ATGCTGGCGA AGATGACGCC GAATATCGGT GATATGTGCG AAGTGGCGCT GGCGGCGAAG CGCGGCGGCG CAGATGGCAT TGCGGCGATT AACACCGTTA AATCCATCAC CAATATCGAT CTTAATCAGA AAATCGGTAT GCCGATCGTT AACGGAAAAT CGAGTATTTC CGGATATTCC GGTAAAGCGG TAAAACCGAT CGCCCTGCGC TTCATTCAGC AGATGCGCAC CCATCCAGAA CTGTGCGATT TCCCAATCAG CGGTATCGGC GGCATTGAAA CCTGGGAGGA TGCGGCTGAG TTTTTATTGC TCGGCGCAGC AACGTTACAG GTGACCACCG GCATCATGCA GTACGGGTAT CGGATAGTGG AAGATATGGC AAGCGGGTTG TCGCATTATC TCGCCGATCA GGGATTTGAT TCGCTGCAGG AGATGGTAGG TCTGGCGAAT AACAATATTG TCCCGGCGGA AGATTTAGAC CGCAGTTATA TTGTCTATCC CCGTATCAAT CTTGATAAAT GTGTTGGCTG TGGACGCTGT TATATTTCCT GTTACGACGG CGGTCACCAG GCGATGGAAT GGAGCGAGAA AACCCGCACA CCGCATTGTA ATACCGAGAA ATGTGTGGGT TGTCTGCTTT GTGGTCACGT CTGCCCGGTG GGTTGTATTG AGCTCGGGGA AGTGAAGTTT AAGAAAGGCG AGAAAGAACA CCCGGTAACG TTGTAA
|
Protein sequence | MLTKDLSITF CGVKFPNPFC LSSSPVGNCY EMCAKAYDTG WGGVVFKTIG FFIANEVSPR FDHLVKEDTG FIGFKNMEQI AEHPLEENLA ALRQLKEDYP DKVLIASIMG ENEQQWEELA RLVQEAGADM IECNFSCPQM TSHAMGSDVG QSPELVEKYC RAVKRGSTLP MLAKMTPNIG DMCEVALAAK RGGADGIAAI NTVKSITNID LNQKIGMPIV NGKSSISGYS GKAVKPIALR FIQQMRTHPE LCDFPISGIG GIETWEDAAE FLLLGAATLQ VTTGIMQYGY RIVEDMASGL SHYLADQGFD SLQEMVGLAN NNIVPAEDLD RSYIVYPRIN LDKCVGCGRC YISCYDGGHQ AMEWSEKTRT PHCNTEKCVG CLLCGHVCPV GCIELGEVKF KKGEKEHPVT L
|
| |