Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1133 |
Symbol | pyrD |
ID | 6873276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 1125440 |
End bp | 1126450 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 642784317 |
Product | dihydroorotate dehydrogenase 2 |
Protein accession | YP_002214991 |
Protein GI | 198243233 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000040512 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 81 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACTATC CCTTCGTTCG TAAAGCCCTT TTCCAGCTCG ATCCAGAGCG CGCTCATGAA TTTACATTTC AACAATTACG CCGCATTACA GGTACGCCGC TGGAAGCGCT GGTGCGCCAG AAAGTACCGA CAAAGCCGGT TACCTGCATG GGACTTACCT TTAAAAATCC ACTGGGGCTG GCTGCCGGTC TGGATAAAGA CGGGGAGTGC ATCGACGCGT TAGGCGCGAT GGGGTTTGGC TCCCTGGAAA TCGGCACCGT GACGCCGCGC CCACAGCCGG GTAACGATAA GCCGCGTCTT TTTCGTCTGG TGGATGCTGA AGGTCTGATC AATCGGATGG GCTTTAATAA TCTGGGCGTC GATAACCTGG TCGAGAATGT TAAAAAAGCC CATTTTGATG GTATTCTGGG AATTAACATC GGTAAAAATA AAGATACGCC TGTCGAAAAT GGCAAAGATG ACTACCTGAT TTGTATGGAA AAAGTCTATG CTTATGCGGG TTATATCGCC ATTAATATTT CTTCGCCGAA TACGCCAGGG CTACGTACGC TCCAGTATGG CGATGCGCTG GACGATCTGT TAACTGCCAT TAAAAATAAG CAAAACGATC TTCAGGCGAT CCACCATAAA TATGTGCCGG TGGCAGTAAA GATCGCGCCG GATCTTTGTG AAGAAGAATT GATCCAGGTT GCCGATAGCC TGCTTCGTCA TAATATTGAT GGGGTGATTG CGACAAATAC CACCCTCGAT CGTTCTCTGG TACAAGGAAT GAAAAATTGC CAGCAAACGG GGGGATTAAG TGGCCGGCCA TTACAATTAA AAAGCACAGA AATTATTCGC CGTTTATCCC AGGAGTTAAA GGGACAATTG CCTATTATCG GCGTCGGCGG CATTGACTCA GTTATCGCCG CGCGCGAGAA GATAGCGGCA GGAGCTACGC TGGTACAAAT TTATTCCGGC TTTATTTTTA AAGGCCCGCC ATTGATTAAA GAAATCGTAA CGCACATCTA A
|
Protein sequence | MYYPFVRKAL FQLDPERAHE FTFQQLRRIT GTPLEALVRQ KVPTKPVTCM GLTFKNPLGL AAGLDKDGEC IDALGAMGFG SLEIGTVTPR PQPGNDKPRL FRLVDAEGLI NRMGFNNLGV DNLVENVKKA HFDGILGINI GKNKDTPVEN GKDDYLICME KVYAYAGYIA INISSPNTPG LRTLQYGDAL DDLLTAIKNK QNDLQAIHHK YVPVAVKIAP DLCEEELIQV ADSLLRHNID GVIATNTTLD RSLVQGMKNC QQTGGLSGRP LQLKSTEIIR RLSQELKGQL PIIGVGGIDS VIAAREKIAA GATLVQIYSG FIFKGPPLIK EIVTHI
|
| |