Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2294 |
Symbol | |
ID | 6143286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2321151 |
End bp | 2322386 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617168 |
Product | dihydropyrimidine dehydrogenase |
Protein accession | YP_001744341 |
Protein GI | 170682869 |
COG category | [C] Energy production and conversion [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase [COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits |
TIGRFAM ID | [TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00484206 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTAACGA AAGATCTTTC GATTACTTTT TGCGGCGTGA AGTTTCCCAA CCCGTTCTGC CTCTCTTCTT CGCCGGTAGG CAACTGCTAT GAGATGTGTG CCAAAGCCTA CGACACTGGC TGGGGCGGCG TGGTGTTTAA AACGATCGGC TTTTTTATCG CCAACGAAGT CTCGCCGCGT TTTGATCATC TGGTGAAAGA AGATACCGGT TTTATCGGCT TCAAAAATAT GGAGCAGATT GCTGAACATC CGTTGGAAGA GAATCTGGCC GCCCTGCGTC GGCTGAAAGA AGATTACCCG GACAAAGTAT TGATCGCTTC AATCATGGGC GAAAATGAGC AGCAATGGGA AGAGCTGGCG CGTCTGGTGC AAGAAGCTGG CGCGGATATG ATCGAGTGTA ACTTCTCCTG TCCGCAAATG ACTTCTCATG CGATGGGTAG CGATGTCGGG CAAAGCCCGG AGCTGGTAGA AAAATATTGT CGGGCAGTAA AACGTGGTTC CACGCTGCCA ATGCTGGCGA AGATGACGCC GAATATCGGT GATATGTGCG AAGTGGCGCT GGCGGCGAAG CGCGGCGGCG CAGATGGCAT TGCGGCGATT AACACCGTTA AATCCATCAC CAATATCGAT CTTAATCAGA AAATCGGTAT GCCGATCGTT AACGGTAAAT CGAGTATTTC CGGATATTCC GGTAAAGCGG TAAAACCGAT CGCCCTGCGC TTCATTCAGC AAATGCGTAC CCATCCAGAG CTGCGCGATT TCCCGATCAG CGGTATCGGC GGTATTGAAA CCTGGGAAGA TGCGGCGGAG TTTTTATTGC TCGGTGCAGC GACGTTGCAG GTGACCACCG GCATTATGCA GTACGGGTAT CGGATAGTGG AAGATATGGC GAGCGGGTTG TCGCATTATC TCACCGATCA GGGATTTGAT TCGTTGCAGG AGATGGTAGG TCTGGCGAAT AACAATATTG TCCCGGCGGA AGATTTAGAC CGCAGTTATA TTGTCTATCC CCATATCAAT CTCGATAAAT GTGTTGGCTG TGGACGCTGT TATATTTCCT GTTACGACGG CGGTCACCAG GCGATGGAAT GGAGCGAGAA AACCCGTACA CCGCATTGTA ATACCGAGAA ATGCGTGGGT TGTCTGCTTT GTGGTCACGT CTGCCCGGTG GGTTGTATTG ATCTCGGAGA AGTGAAGTTT AAGAAAGGCG AGAAAGAACA CCCGGTAACG TTGTAG
|
Protein sequence | MLTKDLSITF CGVKFPNPFC LSSSPVGNCY EMCAKAYDTG WGGVVFKTIG FFIANEVSPR FDHLVKEDTG FIGFKNMEQI AEHPLEENLA ALRRLKEDYP DKVLIASIMG ENEQQWEELA RLVQEAGADM IECNFSCPQM TSHAMGSDVG QSPELVEKYC RAVKRGSTLP MLAKMTPNIG DMCEVALAAK RGGADGIAAI NTVKSITNID LNQKIGMPIV NGKSSISGYS GKAVKPIALR FIQQMRTHPE LRDFPISGIG GIETWEDAAE FLLLGAATLQ VTTGIMQYGY RIVEDMASGL SHYLTDQGFD SLQEMVGLAN NNIVPAEDLD RSYIVYPHIN LDKCVGCGRC YISCYDGGHQ AMEWSEKTRT PHCNTEKCVG CLLCGHVCPV GCIDLGEVKF KKGEKEHPVT L
|
| |