Gene Bcen2424_5047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen2424_5047 
Symbol 
ID4452307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia HI2424 
KingdomBacteria 
Replicon accessionNC_008543 
Strand
Start bp2072391 
End bp2073596 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content69% 
IMG OID639697104 
Productgamma-butyrobetaine dioxygenase 
Protein accessionYP_838674 
Protein GI116693141 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID[TIGR02409] gamma-butyrobetaine hydroxylase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.797229 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.931237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCAG CGGCGCAACA CCGTATCGAG GACTGGCGGA CGTTTTCGGC CGACGCGGCG 
ATCGCGGCGG CGACGATCGG CGATGACGCG GTGGAGGTCG AGTGGAGCGA CGCGCGACGC
TCGCCGTTCC ATTTCGACTG GCTGCGCGAC AACTGCGCGT GTTCCGCGTG CGTGCATGCG
ATCACGCGCG AACAGGTGTT CGAGATCGCC GATGCGCGCG AGGATCTGTC GGCGCTCACC
GTGCACGTCG AGACCGACGG CGCGCTGCAT GTGGAATGGA ACGACGGGCA CCGCAGCGCG
TGGTCGCCGG GCTGGTTGCG TGCGCATGCG TACGACGACG CGTCGCGCGC CGAGCGCCTG
GCCGCGCACG GGCGGCACGT ATGGACCGGT GACGATGCGA CGGCGATCGG CGTATTCGCG
TGGCGCGACG TGATGGAAGA CGACCGTGCG TTGCTCGCAT GGCTCGGCGC GTTGCAGCGC
ACGGGGCTCA CGCGCGTCGA AGGCGTGCCG GCCGAGCGCG GCCGCGTCGA CGAGATCGCA
CGCCGTGTCG GCCTGATCCG CGAAAGTAAT TTCGGTGTGC TGTTCGACGT CGAATCGAAG
CCGCGCCCGG ACAGCAATGC GTATACGTCG CTGAATCTTC CGCCGCACAC CGACCTGCCG
ACGCGTGAAT TGCAGCCGGG CGTGCAGTTC CTGCATTGCC TCGCGAACGA CGCGACGGGC
GGCGACAGCA TTTTCCTCGA CGGCTTCGCG CTCGCGGATG CGCTGCGGCG CGAACATCCG
GACGATTTCG AGCAGCTCGC ATCGACGCCG TTCGAGTTCT GGAACAAGAG CGCGAACAGC
GACTACCGCT GTTCGGCGCC GGTGATCGGC CTCGATGCAC GCGGCAACGT GACGGAAGTA
CGCGTCGCGA ACTTCCTGCG CGGGCCGCTC GATGCGCCGG CCGGTTCGGT CGCGGCCGTC
TATCGTGCGT ACCGGCGGTT TCTCGCGCTG GCGCGCGAGC CGCGCTTTCG CGTACAGCGC
CGGCTGCGGG CGGGCGACAT GTGGGCGTTC GACAACCGGC GCGTGCTGCA TGCGCGCACC
GGGTTCGATC CGTCGACGGG CCGTCGGCAC CTGCAGGGCT GTTACGTCGA TCGCGATGAG
TTGCTGTCGC GCTGGCGCGT GCTGTCGCGA TCGGCGCCGG CCGGGGCCGC GGCCGTGGCC
GGCTGA
 
Protein sequence
MQAAAQHRIE DWRTFSADAA IAAATIGDDA VEVEWSDARR SPFHFDWLRD NCACSACVHA 
ITREQVFEIA DAREDLSALT VHVETDGALH VEWNDGHRSA WSPGWLRAHA YDDASRAERL
AAHGRHVWTG DDATAIGVFA WRDVMEDDRA LLAWLGALQR TGLTRVEGVP AERGRVDEIA
RRVGLIRESN FGVLFDVESK PRPDSNAYTS LNLPPHTDLP TRELQPGVQF LHCLANDATG
GDSIFLDGFA LADALRREHP DDFEQLASTP FEFWNKSANS DYRCSAPVIG LDARGNVTEV
RVANFLRGPL DAPAGSVAAV YRAYRRFLAL AREPRFRVQR RLRAGDMWAF DNRRVLHART
GFDPSTGRRH LQGCYVDRDE LLSRWRVLSR SAPAGAAAVA G