Gene Bcenmc03_4940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcenmc03_4940 
Symbol 
ID6127752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia MC0-3 
KingdomBacteria 
Replicon accessionNC_010515 
Strand
Start bp1973552 
End bp1974922 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content70% 
IMG OID641652027 
Productdihydroorotase 
Protein accessionYP_001778560 
Protein GI170737300 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.536764 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.361909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGC GGCTGCGACA GGAAGCGGGC GACCGATCGA CACGGCGGCA CGCCGATCTG 
CTGGTGCATG GCGGCACGGT GATGACGCCC AACGGCGCCG AGCGGATCGA CGTCGCGTGC
ATGGGCGGCC GCGTCGTCGC GCTGGGCGCG TTGCACGGCG TGTGGAGCGC CGACGTGCTG
CTCGATGCGC GCGGCTTGCA CGTGTTGCCG GGCGTGGTCG ACAGCCAGGT GCATTTCCGT
GAACCGGGGC TCACGCACAA GGAGACCATC GAGGCCGGCA CGCGCGGCGC GGTGCTCGGC
GGCGTCACGA CGATCTTCGA GATGCCCAAT ACGCATCCGC TGACGCTGGA CGAGCAGGAT
CTGAGCGCCA AGCTCGATCT CGCGCGCGGC CGTGCGTGGT GCGACTACGC GTTCTACATC
GGCGGCTCGG CCGTGAATGC CGAACGGCTG CCGGTGCTCG AACGGTTGCC CGGCTGCGCG
GGGGTGAAGG TTTTCATGGG CAGTTCGTTC GGCGATCTGC TGGCCGACGA CGAAACCGTG
TTGCGCCGGA TACTGCGCCA CGGCCGGCGG CGCATGGCCG TGCATGCGGA GGACGAGGCG
CGGCTGCGCG AACGCAAGTC GATCGCGGAA GCAAGCGGCG ACGTGCGCGA CCATCCGCGC
TGGCGCGACG CGGAAAGCGC GCTGGCCGCG ACACGGTGCA TCGTCGGGCT GGCTGCCGAG
ACGGGTCGCC GGCTGCATGT GCTGCACGTA TCCACGGCGG ATGAAATGGC GTTGCTTGCA
CGGCACCGGC GACGCGTGAC GGTCGAGGTC ACGCCGCATC ACCTGAGCTT GCACGCGCCG
GATTGCTACG AGCGGCTCGG CACGTTCGCG CAGATGAATC CGCCCGTGCG CGAACGGCAT
CATCGGGACG CGCTGTGGCA GGCCGTCCGC GACGGCGTGG TCGACGTGAT CGGCAGCGAT
CATGCGCCGC ATACGCGCGA CGAAAAGCGC CGCCCGTATC CGCAGTCGCC GAGCGGGATG
ACCGGTGTGC AGACGCTGCT GCCGCTGATG CTCGATCACG TGCAGGCCGG CCGTTTGAGC
GTCGAACGGC TGGTCGACCT GACCAGCGCC GGGCCGGCGC GCGTTTTCGG CATCGAAGGG
AAGGGACGCA TTGCGGCGGG CTACGACGCC GATTTCAGCA TCGTCGACCT GCGCGCGCGG
CGGATCATTC GCGACGAATG GATCGCGAGC GTGAGCGGGT GGACGCCGTA CGACGGCTGT
GCGGTCACGG GGTGGCCCGT GCATACGGTC GTGCGCGGGC AGGTCGTCGT GCGCGACGAG
GCGCTGAACG GACAACCGGC CGGGGAGGCC GTGACGTTTC TCGACCCCTA G
 
Protein sequence
MDERLRQEAG DRSTRRHADL LVHGGTVMTP NGAERIDVAC MGGRVVALGA LHGVWSADVL 
LDARGLHVLP GVVDSQVHFR EPGLTHKETI EAGTRGAVLG GVTTIFEMPN THPLTLDEQD
LSAKLDLARG RAWCDYAFYI GGSAVNAERL PVLERLPGCA GVKVFMGSSF GDLLADDETV
LRRILRHGRR RMAVHAEDEA RLRERKSIAE ASGDVRDHPR WRDAESALAA TRCIVGLAAE
TGRRLHVLHV STADEMALLA RHRRRVTVEV TPHHLSLHAP DCYERLGTFA QMNPPVRERH
HRDALWQAVR DGVVDVIGSD HAPHTRDEKR RPYPQSPSGM TGVQTLLPLM LDHVQAGRLS
VERLVDLTSA GPARVFGIEG KGRIAAGYDA DFSIVDLRAR RIIRDEWIAS VSGWTPYDGC
AVTGWPVHTV VRGQVVVRDE ALNGQPAGEA VTFLDP