Gene Bcen_3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen_3038 
Symbol 
ID4096165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia AU 1054 
KingdomBacteria 
Replicon accessionNC_008061 
Strand
Start bp59849 
End bp61219 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content70% 
IMG OID638016333 
Productdihydroorotase 
Protein accessionYP_622907 
Protein GI107025396 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0322135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAGC GGCTGCGACA GGAAGCGGGC GACCGATCGA CACGGCGGCA CGCCGATCTG 
CTGGTGCATG GCGGCACGGT GATGACGCCC AACGGCGCCG AGCGGATCGA CGTCGCGTGC
ATGGGCGGCC GCGTCGTCGC GCTGGGCGCG TTGCACGGTA CGTGGAGCGC CGACGTGCTG
CTCGATGCGC GCGGCTTGCA CGTGTTGCCG GGCGTGGTCG ACAGTCAGGT GCATTTCCGT
GAACCGGGGC TCACGCACAA GGAGACCATC GAGGCCGGCA CGCGCGGCGC GGTGCTCGGC
GGCGTCACGA CGATCTTCGA GATGCCCAAT ACGCATCCGC TGACGCTGGA CGAGCAGGAC
CTGAGCGCCA AGCTCGATCT CGCGCGCGGC CGCGCGTGGT GCGACTACGC GTTCTACATC
GGCGGTTCGG CCGTGAATGC CGAACGGCTG CCGGTGCTCG AACGATTGCC CGGCTGCGCG
GGGGTGAAGG TCTTCATGGG CAGTTCGTTC GGCGATCTGC TGGCCGACGA CGAAACCGTG
TTGCGCCGGA TACTGCGCCA CGGCCGGCGG CGCATGGCCG TGCATGCGGA GGACGAGGCG
CGGCTGCGCG AACGCAAGTC GATCGCGGAA GCAAGCGGCG ACGTGCGCGA CCATCCGCGC
TGGCGCGACG CGGAAAGCGC GCTGGCCGCG ACGCGGCGCA TCGTCGGGCT GGCTGCCGAG
ACGGGCCGCC GGCTGCATGT GCTGCACGTA TCCACGGCGG ACGAAATGGC GTTGCTCGCA
CGGCACCGGC GGCGCGTGAC GGTCGAGGTC ACGCCGCATC ACCTGAGTCT GCACGCGCCG
GATTGCTACG AGCGGCTCGG CACGTTCGCG CAGATGAATC CGCCCGTGCG CGAACGGCAT
CATCGGGACG CGCTGTGGCA GGCCGTCAGC GACGGCGTGG TCGACGTGAT CGGCAGCGAT
CATGCGCCGC ATACGCGCGA CGAAAAGCGC CGCCCGTATC CGCAGTCGCC GAGCGGGATG
ACCGGTGTGC AGACGCTGCT GCCGCTGATG CTCGATCACG TGCAGGCCGG CCGTTTGAGC
GTCGAACGGC TGGTCGACCT GACCAGCGCC GGGCCGGCGC GCGTTTTCGG CATCGAAGGG
AAGGGACGCA TTGCGGCGGG CTACGACGCC GATTTCAGCA TCGTCGACCT GCGCGCGCGG
CGGATCATTC GCGACGAATG GATCGCGAGC GTGAGCGGGT GGACGCCGTA CGACGGCCGT
GCGGTCACGG GGTGGCCCGT GCATACGGTC GTGCGCGGGC AGGTCGTCGT GCGCGACGAG
GCGCTGAACG GACAACCGGC CGGGGAGGCC GTGACGTTTC TCGACCCCTA G
 
Protein sequence
MGERLRQEAG DRSTRRHADL LVHGGTVMTP NGAERIDVAC MGGRVVALGA LHGTWSADVL 
LDARGLHVLP GVVDSQVHFR EPGLTHKETI EAGTRGAVLG GVTTIFEMPN THPLTLDEQD
LSAKLDLARG RAWCDYAFYI GGSAVNAERL PVLERLPGCA GVKVFMGSSF GDLLADDETV
LRRILRHGRR RMAVHAEDEA RLRERKSIAE ASGDVRDHPR WRDAESALAA TRRIVGLAAE
TGRRLHVLHV STADEMALLA RHRRRVTVEV TPHHLSLHAP DCYERLGTFA QMNPPVRERH
HRDALWQAVS DGVVDVIGSD HAPHTRDEKR RPYPQSPSGM TGVQTLLPLM LDHVQAGRLS
VERLVDLTSA GPARVFGIEG KGRIAAGYDA DFSIVDLRAR RIIRDEWIAS VSGWTPYDGR
AVTGWPVHTV VRGQVVVRDE ALNGQPAGEA VTFLDP