Gene BURPS1710b_1978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1978 
SymbolpyrD 
ID3689820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2154610 
End bp2155647 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content70% 
IMG OID637728434 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_333375 
Protein GI76810873 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCAGCT CCCTCTACCC GCTTGCCCGC GCGTCCCTCT TCAAGATGGA TGCGGAGGAC 
GCCCATCATC TGACCCTGCG CATGCTCGGC GCCGCGGGCC GCACGGGCCT CGCGTGCGCG
CTGTCGCCCC GCGTGCCCGA CGCGCCGCGC ACCGTGATGG GGCTCTCGTT CCGCAATCCG
GTCGGGCTCG CGGCCGGCCT CGACAAGGAC GGCGCGGCGA TCGACGGCTT CGCCGCGCTC
GGCTTCGGCT TCATCGAGGT GGGCACCGTC ACGCCGCGCG CGCAGCCCGG CAACCCGCGC
CCGCGGATGT TCCGGCTACC CGAGGCGGAC GCGATCATCA ACCGGATGGG CTTCAACAAC
AGCGGCGTCG ACCAGTTCGT GAAGAACGTG CAGGCGGCGC GCTATCGCGG CGTGCTCGGC
CTGAACATCG GCAAGAACGC CGACACGCCG ATCGAGCGCG CGGCCGACGA TTACCTGTAC
TGCCTCGAGC GCGTCTACCC GTTCGCGAGC TACGTGACGA TCAACATCTC GTCGCCGAAC
ACGAAGAACC TGCGCCAGCT CCAGGGCGCG GGCGAGCTCG ACGCGCTGCT CGCCGCGCTG
AAGGACAAGC AGCGGCGCCT CGCCGACCTG CACGGCAAGC TCGTGCCGCT CGCGCTGAAG
ATCGCGCCCG ATCTCGACGA CGAACAGGTG AAGGAAATCG CCGCAACGCT GCTGCGCCAC
GACATCGAAG GCGTGATCGC GACCAACACC ACGCTGTCGC GCGAAGCGGT GAAAGGCCTG
CCGCACGCCG ACGAGGCGGG CGGACTGTCC GGGCGGCCGG TGTTCGACGC GTCGAACGCG
GTGATCCGCA AGCTGCGCGC GGAGCTTGGC GACGCGGTGC CGATCATCGG CGTGGGCGGC
ATCTTCTCCG GCGAGGACGC GCGTGCGAAA CTCGCGGCGG GCGCGGCGCT CGTCCAGCTG
TACACCGGCT TCATCTATCG GGGCCCGGCG CTCGTCGCCG AATGCGTGAA GGCGATCGCC
CGCGGCGAAG CGCGATGA
 
Protein sequence
MFSSLYPLAR ASLFKMDAED AHHLTLRMLG AAGRTGLACA LSPRVPDAPR TVMGLSFRNP 
VGLAAGLDKD GAAIDGFAAL GFGFIEVGTV TPRAQPGNPR PRMFRLPEAD AIINRMGFNN
SGVDQFVKNV QAARYRGVLG LNIGKNADTP IERAADDYLY CLERVYPFAS YVTINISSPN
TKNLRQLQGA GELDALLAAL KDKQRRLADL HGKLVPLALK IAPDLDDEQV KEIAATLLRH
DIEGVIATNT TLSREAVKGL PHADEAGGLS GRPVFDASNA VIRKLRAELG DAVPIIGVGG
IFSGEDARAK LAAGAALVQL YTGFIYRGPA LVAECVKAIA RGEAR