Gene BURPS1106A_3144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3144 
SymbolpyrC 
ID4903267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3066219 
End bp3067496 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content72% 
IMG OID640136370 
Productdihydroorotase 
Protein accessionYP_001067382 
Protein GI126452005 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATTC ATATCAAAGG CGGCACGCTC ATCGATCCGG CGGCCGGCAC GCAGCGGCAG 
GCCGACGTGT TCGTCGCGGC CGGCAAGGTG GCCGCGATCG GCGCGGCGCC GGCCGATTTC
AACGCGGCGA AGACGATCGA CGCGACGGGG CTGATCGTCG CGCCGGGCTT CGTCGATTTG
TCGGCGCGGC TGCGCGAGCC CGGCTACGAG CATAAGGCGA CGCTCGAATC CGAGATGGCG
GCGGCGGTCG CGGGCGGCGT GACGAGCCTC GTGTGCCCGC CCGACACCGA TCCGGTGCTC
GACGAGCCGG GCCTCGTCGA AATGCTGAAG TTTCGCGCCC GCAACCGGAA TCAGGCGCAC
GTGTATCCGC TCGGCGCGCT GACGGTCGGC CTGAAAGGGC AGGTCATCAC CGAGATGGTC
GAGCTGACCG AGGCGGGCTG CATCGGCTTC ACGCAGGCGA ACGTGCCCGT CACCGATACG
CAGGTGCTGC TGCGCGCGCT GCAGTACGCG AGCACCTACG GCTACACGGT GTGGCTGCGC
CCGCTCGACG CGTTTCTCGC GAAGGGCGGC GTCGCGGCGA GCGGGCCCGT CGCGTCGCGG
CTCGGCCTGT CGGGCGTGCC GGTCGCGGCC GAGACGATCG CGCTGCATAC GCTGTTCGAG
CTGATGCGGG TGACGGGCGC GCGCGTGCAC GTCGCGCGGC TGTCGTCGGC GGCCGGCGTC
GCGCTCGTGC GCGCCGCGAA GGCCGAGGGC CTGCCCGTGA CCTGCGATGT CGGCGCGAAC
CACCTGCATC TGATCGATGT CGACATCGGC TACTTCGACG CGCAGTTCCG GCTCGATCCG
CCGCTGCGCG CCGAGCGCGA CCGCGAGGCG ATTCGCGCGG CGCTCGCCGA CGGCACGATC
GATGCGATCT GCTCGGATCA CACGCCCGTC GATGACGACG AGAAGCTGCT GCCGTTCGCC
GAGGCGACGC CCGGCGCGAC GGGCCTCGAG CTGCTGCTGT CGCTGACCGT GAAGTGGGCG
CGCGAAGCGG GCGTGCCGCT CGCGCGGGCG CTCGCGGCGA TCACCTCGGC GCCCGCCGAT
GTGCTGAAGC TGCCCGCCGG CCGTATCGGC GAAGGCGCGC CGGCCGACCT GTGCGTGTTC
GATCCGAATG CGCACTGGCG CGTCGAGCCC CGCGCGCTGA AGAGCCAGGG CCACAACACG
CCGTTCCTCG GCTATGAGCT GCCGGCGCGA GTGTGCGCGA CGCTCGTCGC GGGGCAGGTG
GCGTTCGAGC GTCGCTGA
 
Protein sequence
MKIHIKGGTL IDPAAGTQRQ ADVFVAAGKV AAIGAAPADF NAAKTIDATG LIVAPGFVDL 
SARLREPGYE HKATLESEMA AAVAGGVTSL VCPPDTDPVL DEPGLVEMLK FRARNRNQAH
VYPLGALTVG LKGQVITEMV ELTEAGCIGF TQANVPVTDT QVLLRALQYA STYGYTVWLR
PLDAFLAKGG VAASGPVASR LGLSGVPVAA ETIALHTLFE LMRVTGARVH VARLSSAAGV
ALVRAAKAEG LPVTCDVGAN HLHLIDVDIG YFDAQFRLDP PLRAERDREA IRAALADGTI
DAICSDHTPV DDDEKLLPFA EATPGATGLE LLLSLTVKWA REAGVPLARA LAAITSAPAD
VLKLPAGRIG EGAPADLCVF DPNAHWRVEP RALKSQGHNT PFLGYELPAR VCATLVAGQV
AFERR