Gene Bcen2424_5623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen2424_5623 
Symbol 
ID4451931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia HI2424 
KingdomBacteria 
Replicon accessionNC_008543 
Strand
Start bp2733768 
End bp2734952 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content71% 
IMG OID639697684 
Productcytosine deaminase 
Protein accessionYP_839249 
Protein GI116693716 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.280185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0436616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC TGCTGATCCG CAATGTCCGC ACGAGCGCCG ACGCGGCGCT CGATATCCTG 
ATCGAAGGCG ATCGCATCGC GCGGGTCGGC CCGGCGCTCG ATGCGCCCGC CGGCTGCGCG
ATCGAGGACG GTGCGCTCTC GTTGGCGTTG CCTGGCCTCG TCGAGGGCCA CACCCACCTC
GACAAGACCC ACTGGGGGAT GCCGTGGTAT CGCAACCAGG TCGGCCCGCG CCTCGTCGAC
CGCATCGAGA ACGAACGCCA CTATCGCGCG ACGAGCGGCC ACGACGCGGG TGCCGCGTCG
CTCGCGCTGG CGCGCGCGTT CCTCGCGGCC GGCACGACGC GCATCCGTAC GCACGTCGAC
GTCGATACCG AAGCCGGGCT GCGCCATTTG CACGGCGTGC TCGCGACGCG CGAGACGCTG
CGCGGGCAGG TCGAGATCCA GATCGTCGCG TTTCCGCAAT CGGGCGTGCT CAAGCGGCCG
GGCACCGATG TGCTGCTGTC CGACGCGCTC CGCGCTGGTG CCGACCTGCT CGGCGGGCTC
GATCCGTGCG CGATCGAAGG CGATCCGGTC GAGGCGGTGG ACGTGCTGTT CGCGATCGCC
GAACGCCACG GCCGCGGGCT CGATATCCAT CTGCACGAGC GCGGGTCGAT GGGCGCGTAC
TCGCTCGACC TGATCCTGCA GCGCACGGCG GCGCACGGGA TGCACGGCAA GGTCACGATC
AGCCATGGTT TCTGTCTTGG CGATCTCGCC GAACGCGAGC GCGACGCGCT GCTCGCGCGG
ATGGCCGAAC TTGGCGTCGC GATCGTCACG ACCGCGCCGG CCGCGGTGCC GGTGCCGCCG
GTGGCCGCAT GCCGTGCGGC GGGCGTGACC GTGATCGGCG GCAACGACGG CGTGCGCGAC
ACGTGGACGC CGTATGGCTC GCCGGACATG CTCGAACGCG CGATGCTGAT CGGCATGCGC
AATGATTTCC GCCGCGACGA TGCGCTCGAA GTCGCGCTCG ATTGCGTGAC GCACGGCGCG
GCGCGCGGCT GCGGTTTCGC GGATTACGGA CTGCAGCCGG GTAGCCGCGC GGACGTCGTG
CTCGTCGATG CGCTGACGTT CGCCGAGGCC GTCGTCGCTC GGCCGGTGCG GCGGCTCGTC
GTGTCGTCCG GAAAAATCGT CGCGCGCAAC GGCGCGCTGG TCTGA
 
Protein sequence
MTNLLIRNVR TSADAALDIL IEGDRIARVG PALDAPAGCA IEDGALSLAL PGLVEGHTHL 
DKTHWGMPWY RNQVGPRLVD RIENERHYRA TSGHDAGAAS LALARAFLAA GTTRIRTHVD
VDTEAGLRHL HGVLATRETL RGQVEIQIVA FPQSGVLKRP GTDVLLSDAL RAGADLLGGL
DPCAIEGDPV EAVDVLFAIA ERHGRGLDIH LHERGSMGAY SLDLILQRTA AHGMHGKVTI
SHGFCLGDLA ERERDALLAR MAELGVAIVT TAPAAVPVPP VAACRAAGVT VIGGNDGVRD
TWTPYGSPDM LERAMLIGMR NDFRRDDALE VALDCVTHGA ARGCGFADYG LQPGSRADVV
LVDALTFAEA VVARPVRRLV VSSGKIVARN GALV