Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1809 |
Symbol | pyrD |
ID | 4881949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 1782312 |
End bp | 1783340 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640127737 |
Product | dihydroorotate dehydrogenase 2 |
Protein accession | YP_001058846 |
Protein GI | 126439177 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0167] Dihydroorotate dehydrogenase |
TIGRFAM ID | [TIGR01036] dihydroorotate dehydrogenase, subfamily 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.505366 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTCAGCT CCCTCTACCC GCTTGCCCGC GCGTCCCTCT TCAAGATGGA TGCGGAGGAC GCCCATCATC TGACCCTGCG CATGCTCGGC GCCGCGGGCC GCACGGGCCT CGCGTGCGCG CTGTCGCCCC GCGTGCCCGA CGCGCCGCGC ACCGTGATGG GGCTCTCGTT CCGCAATCCG GTCGGGCTCG CGGCCGGCCT CGACAAGGAC GGCGCGGCGA TCGACGGCTT CGCCGCGCTC GGCTTCGGCT TCATCGAGGT GGGCACCGTC ACGCCGCGCG CGCAGCCCGG CAACCCGCGC CCGCGGCTGT TCCGGCTACC CGAGGCGGAC GCGATCATCA ACCGGATGGG CTTCAACAAC AGCGGCGTCG ACCAGTTCGT GAAGAACGTG CAGGCGGCGC GCTATCGCGG CGTGCTCGGC CTGAACATCG GCAAGAACGC CGACACGCCG ATCGAGCGCG CGGCCGACGA TTACCTGTAC TGCCTCGAGC GCGTCTACCC GTTCGCGAGC TACGTGACGA TCAACATCTC GTCGCCGAAC ACGAAGAACC TGCGCCAGCT CCAGGGCGCG GGCGAGCTCG ACGCGCTGCT CGCCGCGCTG AAGGACAAGC AGCGGCGCCT CGCCGATCTG CACGGCAAGC TCGTGCCGCT CGCGCTGAAG ATCGCGCCCG ATCTCGACGA CGAACAGGTG AAGGAAATCG CCGCAACGCT GCTGCGCCAC GACATCGAAG GCGTGATCGC GACCAACACC ACGCTGTCGC GCGAAGCGGT GAAAGGCCTG CCGCACGCCG ACGAGGCGGG CGGACTGTCC GGGCGGCCGG TGTTCGATGC GTCGAACGCG GTGATCCGCA AGCTGCGCGC GGAGCTTGGC GACGCGGTGC CGATCATCGG CGTGGGCGGC ATCTTCTCCG GCGAGGACGC GCGTGCGAAA CTCGCGGCGG GCGCGGCGCT CGTCCAGCTG TACACCGGCT TCATCTATCG GGGCCCGGCG CTCGTCGCCG AATGCGTGAA GGCGATCGCC CGCGGCTAA
|
Protein sequence | MFSSLYPLAR ASLFKMDAED AHHLTLRMLG AAGRTGLACA LSPRVPDAPR TVMGLSFRNP VGLAAGLDKD GAAIDGFAAL GFGFIEVGTV TPRAQPGNPR PRLFRLPEAD AIINRMGFNN SGVDQFVKNV QAARYRGVLG LNIGKNADTP IERAADDYLY CLERVYPFAS YVTINISSPN TKNLRQLQGA GELDALLAAL KDKQRRLADL HGKLVPLALK IAPDLDDEQV KEIAATLLRH DIEGVIATNT TLSREAVKGL PHADEAGGLS GRPVFDASNA VIRKLRAELG DAVPIIGVGG IFSGEDARAK LAAGAALVQL YTGFIYRGPA LVAECVKAIA RG
|
| |