Gene Bcep18194_A5063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A5063 
Symbol 
ID3750271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp2100415 
End bp2101653 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content69% 
IMG OID637763359 
ProductUDP-glycosyltransferase, MGT 
Protein accessionYP_369301 
Protein GI78066532 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCA TCGCCCAACC TGAGTCGTTC ATGAAACGCA TCCTGTTCTG CGTGGTGCCC 
GAGAAAGGGC ACGTCAATCC GTGTATCGGC CCGGCCCAGC ATCTGCGCGA TGCCGGGTGC
GACGTCGCGT TCTATGCGCC GGCGGACATC AGCGCACAGC TCGACGGTGC CGGCGGCTTC
GAATTCGTCG GGCCGCGCGA GACGCCCGAG CGCCACGACC TGTCGCGCGG CGCGAGCTTC
GCCGCGAACA TTCGCGACGC GGACTGGCTG CGGCACTGGA TCCGCACGCT GCTGATCGAT
CTCGCGCCCG CGCAGGTGGA CGGCATCCGT GCGGTGCTGC GCGACTGGCG GCCCGACGTG
GTCGTGATCG ATCCGCTGCT CTATGCGGCG GCGATCGCCG CGGAGCTGGA AGGGCTGCCG
TGGGTCTCGA TGTCGAATTC TCTGAATCCG GTGCTGCCGG ACGAGCTCGA TTCGGAACTG
CTGCGCACGG TGCGATGGCT CGCGCCGGAA CGCACGCGCC TGTTCGCGCG CTACGGGCTC
GATGCGCGTT TTCGCGGATG CGACATCCTT TCACCGCACC TGACGCTCGC GTTCACGACC
GACGCGCTGG TCGGTGCGCC GCCGCCCGGT GTCGAGCTGG TCGGCCCGGC GCTGCCGTCC
GGCCCGCGCG GCGACGAGAC GCCGTTCCCG TGGGAACGCC TCGATGCGGA CCGTCCGCTC
GTCTACATGT CGCTCGGCAG CCAGCTTTAC TACCATCCCG ATGTGTTTGC GAAGGTCATC
GACGCGACGC GCGCGACGTC GGCGCAACTG GTGCTGTCGG TGGGCGAACT GGTCGATTCG
GATCTGCTGC CGGCCGACGA CGAACGTGTG GTCGCGGTGC GTTACGTGCC GCAACTGGCG
CTGCTGCAGC GCACGCACGC GTTCGTCAGC CACGGCGGCG CGAATTCGGT GATGGAGTCG
CTTGCGTGCG GCGTGCCGAT GCTGCTGTCG CCGTTCTGCA ACGACCAGTT CCATTCGGCG
CACTTCGTCG AGCGGGCCGG TGCGGGGTGC GTGCTGGATC TGCAGCAGGC CGGCGTGGCG
GAGATTGCCG ATGCGCTCGA ACGCCTGTTG CGGCCCGGGA CGTTGCGCGA GCGGGCGGCG
CGGATCCGCA CGAGCTATGC ATCGCGCAAC GGTTCTGCCG AGGCCGCGCG CCTGATCAGC
GCACTCGCTT CAGGAAACCG CCTGGCGACA GCGTCATGA
 
Protein sequence
MPRIAQPESF MKRILFCVVP EKGHVNPCIG PAQHLRDAGC DVAFYAPADI SAQLDGAGGF 
EFVGPRETPE RHDLSRGASF AANIRDADWL RHWIRTLLID LAPAQVDGIR AVLRDWRPDV
VVIDPLLYAA AIAAELEGLP WVSMSNSLNP VLPDELDSEL LRTVRWLAPE RTRLFARYGL
DARFRGCDIL SPHLTLAFTT DALVGAPPPG VELVGPALPS GPRGDETPFP WERLDADRPL
VYMSLGSQLY YHPDVFAKVI DATRATSAQL VLSVGELVDS DLLPADDERV VAVRYVPQLA
LLQRTHAFVS HGGANSVMES LACGVPMLLS PFCNDQFHSA HFVERAGAGC VLDLQQAGVA
EIADALERLL RPGTLRERAA RIRTSYASRN GSAEAARLIS ALASGNRLAT AS