Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECD_00949 |
Symbol | pyrD |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21(DE3) |
Kingdom | Bacteria |
Replicon accession | CP001509 |
Strand | + |
Start bp | 1009857 |
End bp | 1010867 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | |
Product | dihydroorotate dehydrogenase |
Protein accession | ACT42844 |
Protein GI | 253977174 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000020271 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACTACC CCTTCGTTCG TAAAGCCCTT TTCCAGCTCG ATCCAGAGCG CGCTCATGAG TTTACTTTTC AGCAATTACG CCGTATTACA GGAACGCCGT TTGAAGCACT GGTGCGGCAG AAAGTGCCTG CGAAACCTGT TAACTGCATG GGCCTGACGT TTAAAAATCC GCTTGGTCTG GCAGCCGGTC TTGATAAAGA CGGGGAGTGC ATTGACGCGT TAGGCGCGAT GGGATTTGGA TCGATCGAGA TCGGTACCGT CACGCCACGT CCACAGCCAG GTAATGACAA GCCGCGTCTC TTTCGTCTGG TAGATGCCGA AGGTTTGATC AACCGTATGG GCTTTAATAA TCTTGGCGTT GATAACCTCG TAGAGAACGT AAAAAAGGCC CATTATGACG GCGTCCTGGG TATTAACATC GGCAAAAATA AAGATACGCC AGTGGAGCAG GGCAAAGATG ACTATCTGAT TTGTATGGAA AAAATCTATG CCTATGCGGG ATATATCGCC ATCAATATTT CATCGCCGAA TACCCCAGGA TTACGCACGC TGCAATATGG TGAAGCGCTG GATGATCTCT TAACCGCGAT TAAAAATAAG CAAAATGATT TGCAAGCGAT GCACCATAAA TATGTGCCGA TCGCAGTGAA GATCGCGCCG GATCTTTCTG AAGAAGAATT GATCCAGGTT GCCGATAGTT TAGTTCGCCA TAATATTGAT GGCGTTATTG CAACCAATAC CACACTCGAT CGTTCTCTTG TTCAGGGAAT GAAAAATTGC GATCAAACCG GTGGCTTAAG TGGTCGTCCG CTTCAGTTAA AAAGCACCGA AATTATTCGC CGCTTGTCAC TGGAATTAAA CGGTCGCTTA CCGATCATCG GTGTTGGCGG CATCGACTCG GTTATCGCTG CGCGTGAAAA GATTGCTGCG GGTGCCTCAC TGGTGCAAAT TTATTCTGGT TTTATTTTTA AAGGTCCGCC GCTGATTAAA GAAATCGTTA CCCATATCTA A
|
Protein sequence | MYYPFVRKAL FQLDPERAHE FTFQQLRRIT GTPFEALVRQ KVPAKPVNCM GLTFKNPLGL AAGLDKDGEC IDALGAMGFG SIEIGTVTPR PQPGNDKPRL FRLVDAEGLI NRMGFNNLGV DNLVENVKKA HYDGVLGINI GKNKDTPVEQ GKDDYLICME KIYAYAGYIA INISSPNTPG LRTLQYGEAL DDLLTAIKNK QNDLQAMHHK YVPIAVKIAP DLSEEELIQV ADSLVRHNID GVIATNTTLD RSLVQGMKNC DQTGGLSGRP LQLKSTEIIR RLSLELNGRL PIIGVGGIDS VIAAREKIAA GASLVQIYSG FIFKGPPLIK EIVTHI
|
| |