Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1501 |
Symbol | |
ID | 6067096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 1658452 |
End bp | 1659687 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641600920 |
Product | dihydropyrimidine dehydrogenase |
Protein accession | YP_001724490 |
Protein GI | 170019536 |
COG category | [C] Energy production and conversion [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase [COG1146] Ferredoxin |
TIGRFAM ID | [TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAACGA AAGATCTTTC GATTACTTTT TGCGGCGTGA AGTTTCCCAA CCCGTTCTGC CTCTCTTCTT CGCCGGTAGG CAACTGCTAT GAGATGTGTG CCAAAGCCTA CGACACAGGT TGGGGCGGTG TGGTGTTTAA AACGATCGGC TTTTTTATCG CCAACGAAGT CTCGCCGCGT TTTGATCATC TGGTGAAAGA AGATACCGGT TTTATCGGCT TCAAAAATAT GGAGCAGATT GCCGAGCATC CGTTGGAAGA GAATCTGGCC GCCCTGCGTC GGCTGAAGGA AGATTACCCG GACAAAGTAT TGATCGCTTC GATCATGGGG GAAAATGAGC AGCAATGGGA GGAGCTGGCG CGCCTGGTGC AAGAAGTTGG CGCGGATATG ATCGAGTGTA ACTTCTCCTG TCCGCAAATG ACGTCTCATG CGATGGGTAG CGATGTCGGG CAAAGCCCGG AGCTGGTAGA AAAATATTGT CGGGCAGTTA AACGGGGTTC CACGCTGCCG ATGCTGGCGA AGATGACACC GAATATCGGT GATATGTGCG AAGTGGCGCT GGCGGCGAAG CGCGGCGGCG CAGATGGCAT TGCGGCAATT AACACCGTTA AATCCATCAC CAATATCGAT CTTAATCAGA AAATCGGTAT GCCGATCGTG AACGGTAAAT CGAGTATTTC CGGATATTCC GGTAAAGCGG TAAAACCGAT CGCCCTGCGC TTTATTCAGC AAATGCGCAC CCATCCAGAA CTGCGCGATT TCCCGATCAG CGGTATCGGC GGCATTGAAA CCTGGGAAGA TGCGGCGGAG TTTTTATTGC TCGGCGCAGC AACGTTACAG GTCACCACCG GCATTATGCA GTACGGCTAT CGGATAGTGG AAGATATGGC GAGCGGGTTG TCGCATTATC TCGCCGATCA AGGGTTTGAT TCGTTGCAGG AGATGGTAGG TCTGGCGAAT AACAATATTG TCCCGGCGGA AGATTTAGAC CGCAGTTATA TTGTCTATCC CCGTATCAAT CTTGATAAAT GTGTTGGCTG TGGACGCTGT TATATTTCCT GTTACGACGG CGGTCACCAG GCGATGGAAT GGAGCGAGAA AACCCGCACA CCGCATTGTA ATACCGAGAA ATGTGTGGGT TGTCTGCTTT GTGGTCACGT CTGCCCGGTG GGTTGTATTG AGCTCGGGGA AGTGAAGTTT AAGAAAGGCG AGAAAGAACA CCCGGTAACG TTGTAA
|
Protein sequence | MLTKDLSITF CGVKFPNPFC LSSSPVGNCY EMCAKAYDTG WGGVVFKTIG FFIANEVSPR FDHLVKEDTG FIGFKNMEQI AEHPLEENLA ALRRLKEDYP DKVLIASIMG ENEQQWEELA RLVQEVGADM IECNFSCPQM TSHAMGSDVG QSPELVEKYC RAVKRGSTLP MLAKMTPNIG DMCEVALAAK RGGADGIAAI NTVKSITNID LNQKIGMPIV NGKSSISGYS GKAVKPIALR FIQQMRTHPE LRDFPISGIG GIETWEDAAE FLLLGAATLQ VTTGIMQYGY RIVEDMASGL SHYLADQGFD SLQEMVGLAN NNIVPAEDLD RSYIVYPRIN LDKCVGCGRC YISCYDGGHQ AMEWSEKTRT PHCNTEKCVG CLLCGHVCPV GCIELGEVKF KKGEKEHPVT L
|
| |