Gene BURPS1710b_A0909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0909 
SymbolhepB 
ID3692651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1166879 
End bp1168045 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content71% 
IMG OID637731163 
ProductHepB protein 
Protein accessionYP_336067 
Protein GI76818835 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCATA CCGAAACGTC GATCAAGTCG CTGCAGATCG GCATGCACTG GTTTCCCGAG 
CGAGCGGGCG GGCTCGATCG CATGTATTAC TCGCTCGTTG GCGCGCTGCC GAGCGCGGGC
GTCGCGGTGC GCGGCGTGGT CGCGGGCTCC GAGCGCGTCG CGGCCGACAC GGGCGGCGCG
ATCCGCGGCT TCGGGCCGGC GACGTCGTCG TTGCCGCGGC GGATGATCGC CGCGCGCCAT
GCGCTGCGCG ACGTGATGCG CATCGAGCGG CCCGACGTCG TGTCGTCGCA CTTCGCGCTG
TACACGTTCC CTGGGCTCGA CGTGACGCGC GGCATTCCGC AGGTGTCGCA TTTCCAGGGC
CCGTGGGCCG ACGAGAGCCA CGTCGAGGGC GCGGATTCGC TCGGGCAGAA GGTCAAGCAC
CGGCTCGAGC AGGCGGTCTA TGCCCGCTCG TCGCGGCTCA TCGTGCTGTC GCACGCGTTC
GGGCAGATTC TCACGTCGCG CTACAACGTC GATCCGGCGC GCGTGCGCGT CGTGCCCGGC
TGCGTCGACA CCGCGCAATT CGATTTGCCG ATGACGCCCG CCGACGCGCG CCGCAAGCTG
CAACTGCCGC AGGATCGGCC GATCGTGCTC GCGGTGCGGC GGCTCGTGCG GCGCATGGGG
CTCGAGGATC TGATCGACGC GGTGAAGACC GTGCGCCGCC GGCATCCGGA CGTGCTGCTG
CTGATCGCCG GCAAGGGGCG GCTCGAAGGC GAGCTGCGCA AACGGATCGA CGACGCCGAG
CTCGGCGAGA ACGTGAAGCT GCTCGGTTTC GTGCCCGACA ATCATCTGGC CGCGCTGTAC
CGCGCGGCGA CGCTCAGCGT CGTGCCGACC GTCGCGCTCG AGGGATTCGG GCTCATCACC
GTCGAGTCGC TCGCGTCCGG CACGCCGGTG CTCGTGACGC CCGTCGGCGG GCTGCCGGAG
GCGGTCGCGG GCCTGTCGGA GGCGCTCGTG CTGCCGGAGG TGGGCGCGGC CGCGATCGCG
GACGGGTTGG CCGCGGCGTT GTCCGGCTCG CTCGTGCTGC CGGATGCGGA CGCATGCCGG
CGATACGCGC GCGCGCATTT CGACAACACG GTGATCGCGC GCCGCGTCGC GGCGGTCTAC
GAGGAGGCGA TTCGGGCCGC CGTTTGA
 
Protein sequence
MKHTETSIKS LQIGMHWFPE RAGGLDRMYY SLVGALPSAG VAVRGVVAGS ERVAADTGGA 
IRGFGPATSS LPRRMIAARH ALRDVMRIER PDVVSSHFAL YTFPGLDVTR GIPQVSHFQG
PWADESHVEG ADSLGQKVKH RLEQAVYARS SRLIVLSHAF GQILTSRYNV DPARVRVVPG
CVDTAQFDLP MTPADARRKL QLPQDRPIVL AVRRLVRRMG LEDLIDAVKT VRRRHPDVLL
LIAGKGRLEG ELRKRIDDAE LGENVKLLGF VPDNHLAALY RAATLSVVPT VALEGFGLIT
VESLASGTPV LVTPVGGLPE AVAGLSEALV LPEVGAAAIA DGLAAALSGS LVLPDADACR
RYARAHFDNT VIARRVAAVY EEAIRAAV