Gene BURPS668_1809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1809 
SymbolpyrD 
ID4881949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1782312 
End bp1783340 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content70% 
IMG OID640127737 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001058846 
Protein GI126439177 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.505366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTCAGCT CCCTCTACCC GCTTGCCCGC GCGTCCCTCT TCAAGATGGA TGCGGAGGAC 
GCCCATCATC TGACCCTGCG CATGCTCGGC GCCGCGGGCC GCACGGGCCT CGCGTGCGCG
CTGTCGCCCC GCGTGCCCGA CGCGCCGCGC ACCGTGATGG GGCTCTCGTT CCGCAATCCG
GTCGGGCTCG CGGCCGGCCT CGACAAGGAC GGCGCGGCGA TCGACGGCTT CGCCGCGCTC
GGCTTCGGCT TCATCGAGGT GGGCACCGTC ACGCCGCGCG CGCAGCCCGG CAACCCGCGC
CCGCGGCTGT TCCGGCTACC CGAGGCGGAC GCGATCATCA ACCGGATGGG CTTCAACAAC
AGCGGCGTCG ACCAGTTCGT GAAGAACGTG CAGGCGGCGC GCTATCGCGG CGTGCTCGGC
CTGAACATCG GCAAGAACGC CGACACGCCG ATCGAGCGCG CGGCCGACGA TTACCTGTAC
TGCCTCGAGC GCGTCTACCC GTTCGCGAGC TACGTGACGA TCAACATCTC GTCGCCGAAC
ACGAAGAACC TGCGCCAGCT CCAGGGCGCG GGCGAGCTCG ACGCGCTGCT CGCCGCGCTG
AAGGACAAGC AGCGGCGCCT CGCCGATCTG CACGGCAAGC TCGTGCCGCT CGCGCTGAAG
ATCGCGCCCG ATCTCGACGA CGAACAGGTG AAGGAAATCG CCGCAACGCT GCTGCGCCAC
GACATCGAAG GCGTGATCGC GACCAACACC ACGCTGTCGC GCGAAGCGGT GAAAGGCCTG
CCGCACGCCG ACGAGGCGGG CGGACTGTCC GGGCGGCCGG TGTTCGATGC GTCGAACGCG
GTGATCCGCA AGCTGCGCGC GGAGCTTGGC GACGCGGTGC CGATCATCGG CGTGGGCGGC
ATCTTCTCCG GCGAGGACGC GCGTGCGAAA CTCGCGGCGG GCGCGGCGCT CGTCCAGCTG
TACACCGGCT TCATCTATCG GGGCCCGGCG CTCGTCGCCG AATGCGTGAA GGCGATCGCC
CGCGGCTAA
 
Protein sequence
MFSSLYPLAR ASLFKMDAED AHHLTLRMLG AAGRTGLACA LSPRVPDAPR TVMGLSFRNP 
VGLAAGLDKD GAAIDGFAAL GFGFIEVGTV TPRAQPGNPR PRLFRLPEAD AIINRMGFNN
SGVDQFVKNV QAARYRGVLG LNIGKNADTP IERAADDYLY CLERVYPFAS YVTINISSPN
TKNLRQLQGA GELDALLAAL KDKQRRLADL HGKLVPLALK IAPDLDDEQV KEIAATLLRH
DIEGVIATNT TLSREAVKGL PHADEAGGLS GRPVFDASNA VIRKLRAELG DAVPIIGVGG
IFSGEDARAK LAAGAALVQL YTGFIYRGPA LVAECVKAIA RG