Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2174 |
Symbol | pyrD |
ID | 6143184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2181670 |
End bp | 2182680 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641617050 |
Product | dihydroorotate dehydrogenase 2 |
Protein accession | YP_001744224 |
Protein GI | 170682541 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000792039 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.661729 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACTACC CCTTCGTTCG TAAAGCCCTT TTCCAGCTCG ATCCAGAGCG CGCTCATGAG TTTACTTTTC AGCAATTACG CCGTATTACC GGAACGCCGT TTGAAGCACT GGTGCGGCAG AAAGTGCCTG CGAAACCTGT TAACTGCATG GGCCTGACGT TTAAAAATCC GCTTGGTCTG GCAGCCGGTC TTGATAAAGA CGGGGAGTGC ATTGACGCGT TAGGCGCGAT GGGTTTTGGA TCTATCGAGA TCGGTACCGT CACGCCACGT CCACAGCCAG GTAATGACAA GCCGCGTCTC TTTCGTCTGG TAGATGCCGA AGGTTTGATC AACCGTATGG GCTTTAATAA TCTTGGCGTT GATAACCTCG TAGAGAACGT AAAAAAAGCC CATTATGATG GCGTCCTGGG TATTAACATC GGCAAAAATA AAGATACGCC GGTGGAGCAG GGTAAAGATG ACTATCTGAT TTGTATGGAA AAAATCTATG CTTATGCGGG ATATATCGCC ATCAATATTT CATCGCCGAA TACCCCAGGA CTACGCACAC TGCAATATGG CGAAGCGCTG GATGATCTCT TAACCGCGAT TAAAAATAAG CAAAATGATT TGCAAGCGAT GCACCATAAA TATGTGCCGA TCGCAGTGAA GATCGCGCCG GATCTTTCTG AAGAAGAATT GATCCAGGTT GCCGATAGTT TAGTTCGCCA TAATATTGAT GGCGTTATTG CAACCAATAC CACGCTCGAT CGTTCTCTGG TTCAGGGAAT GAAAAATTGT GATCAAACCG GTGGCTTAAG TGGTCGTCCG CTTCAGTTAA AAAGCACAGA AATTATTCGC CGCTTGTCAC AGGAATTAAA CGGTCGCTTA CCGATCATCG GTGTTGGCGG CATTGACTCG GTTATCGCTG CGCGTGAAAA GATTGCTGCG GGGGCCTCAC TGGTGCAAAT TTATTCTGGT TTTATTTTTA AAGGTCCGCC GCTGATTAAA GAAATCGTTA CCCATATCTA A
|
Protein sequence | MYYPFVRKAL FQLDPERAHE FTFQQLRRIT GTPFEALVRQ KVPAKPVNCM GLTFKNPLGL AAGLDKDGEC IDALGAMGFG SIEIGTVTPR PQPGNDKPRL FRLVDAEGLI NRMGFNNLGV DNLVENVKKA HYDGVLGINI GKNKDTPVEQ GKDDYLICME KIYAYAGYIA INISSPNTPG LRTLQYGEAL DDLLTAIKNK QNDLQAMHHK YVPIAVKIAP DLSEEELIQV ADSLVRHNID GVIATNTTLD RSLVQGMKNC DQTGGLSGRP LQLKSTEIIR RLSQELNGRL PIIGVGGIDS VIAAREKIAA GASLVQIYSG FIFKGPPLIK EIVTHI
|
| |