Gene Bcen_4875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen_4875 
Symbol 
ID4095155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia AU 1054 
KingdomBacteria 
Replicon accessionNC_008061 
Strand
Start bp2136731 
End bp2137894 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content67% 
IMG OID638018159 
Productarabinogalactan endo-1,4-beta-galactosidase 
Protein accessionYP_624725 
Protein GI107027214 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3867] Arabinogalactan endo-1,4-beta-galactosidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.417928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGAC GATCGATGCT GCGCTGGAGC ATGTCGAGCG CGGCGCTTGC CTGCCTCGAC 
CTGGCCGGTC CGCTGGCCGC CTTCGCACGA CCCGCGCCGC ACAGTGCGGC GGCGGCCGAG
TTCGCGATGG GCGCCGACAT CTCGACGTTG CCGGAACTCG AAGCGCATGG CGCCGCCTTC
TTCGACCGCG GCGGCGCGCC ACGCGACTGC CTGAAGATTC TCCGCGCGCA CGGCGTCGAT
TCGATCCGGA TCAAGGTCTG GAACGATCCC GGCAACCCGG ATTTCTTCCC AGCGAACCAG
AGCGATGCGG CGGGCTACAA CAATGCCGCG CACGTCGTCG TGCTCGCGCA GCGCGCGGCC
GCGCTCGGGA TGCGCATCCT GATCGACTTC CACTACAGCG ACTGGTGGGC CGACCCCGGC
AAGCAATATC CTCCGCATGC ATGGGCCGGC AAGAGCCTGG CCGAAACCTG CGCGCTGCTG
TCGGCGTACA CGACCGACGT GCTGCGCCGG CTGCAGCGCG CCGGCGTGAG CCCCGAGTGG
GTGCAGATCG GCAACGAGAT CACGGGCGGC ATGCTGTGGC CGCTCGGCCG CTACGACCAG
TGGGACAATC TCGCGCAGTT GCTGAAAACC GGCCACGACG CGGTGAAGGC CGTCGATCCG
CGCATCAAGG TGATGCTGCA CGTCGACAGC GGTGGCGACA ACGGCAAGAG CCGCTGGTGG
TTCGACAGCG CGACGCAGCG CGGCGTCGCA TTCGATGTGA TCGGCCTGTC GTATTACCCG
CAATGGCAAG GCTCGCTCGA CGATCTGCGC AACAACGCGA ACGACCTAGC GGTGCGCTAC
GACAAGGAGC TGATCGTCGT CGAAACCGCG TATCCGTGGA CCACCAGCGA TGGCGATTCC
GAGCCGAACG CGATGACCAA CACCGGATCG ACGACCTTTC CGCCGTCGCC GGCCGGCCAG
GCCCAATTCC TCGCAGCGGT CGTCGATATC GTGAAGGGCG TGCCGGGCAA TCGCGGCAAG
GGCGTGTTCT GGTGGGAACC GGAATGGATC CCGACGCGCG GTGTCGGCTG GAAGCTCGGC
GCGGGCGACC AGTGGGACAA CAACACGCTG TTCGATTTCC ACGGTCACGC GTTGCCGTCG
CTCGACGCGT TCCGGCAGCG CTGA
 
Protein sequence
MNRRSMLRWS MSSAALACLD LAGPLAAFAR PAPHSAAAAE FAMGADISTL PELEAHGAAF 
FDRGGAPRDC LKILRAHGVD SIRIKVWNDP GNPDFFPANQ SDAAGYNNAA HVVVLAQRAA
ALGMRILIDF HYSDWWADPG KQYPPHAWAG KSLAETCALL SAYTTDVLRR LQRAGVSPEW
VQIGNEITGG MLWPLGRYDQ WDNLAQLLKT GHDAVKAVDP RIKVMLHVDS GGDNGKSRWW
FDSATQRGVA FDVIGLSYYP QWQGSLDDLR NNANDLAVRY DKELIVVETA YPWTTSDGDS
EPNAMTNTGS TTFPPSPAGQ AQFLAAVVDI VKGVPGNRGK GVFWWEPEWI PTRGVGWKLG
AGDQWDNNTL FDFHGHALPS LDAFRQR