Gene BURPS1106A_A3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A3038 
Symbol 
ID4903541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2952681 
End bp2953796 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content74% 
IMG OID640146141 
Productputative heptosyltransferase 
Protein accessionYP_001077067 
Protein GI126457370 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.148139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCG ATCCTTCCGC GCGGCCCGCG GCCGCCGAGC GGCGCGGCAC GGGCGAATAC 
GCGAGCATCG CGGTGTTTCG CGCGCTGCAG CTCGGCGACA TGCTGTGCGC GGTGCCCGCG
CTGCGCGCGC TGCGGCGCGG CGAGCCGCAG GCGCGGATCA CGCTGATCGG GCTGCCGTGG
GCGAAGGCGT TCGCCGAGCG CTTCTCCGAT TACGTCGACG ACTTCATCGA ATTCCCCGGC
GCGCCGGGGC TCGTCGAGCA GCCGCACGAC GTCGAGCGGC TCGCCGCGTT CGTCGCCGAA
TGCCGGTCGC GCCGTTTCGA TCTCGCGATC CAGCTGCATG GCAGCGGCGC GCAATCGAAC
GCGATCGTCG CGGGCCTCGG CGCGGCGTCG ACGGCGGGTT TCGCGCCCGA TGCGTTCGCG
GCCGGCGAGC ACGCCGCGCC GCGGCTCGAC CGCACGATCG CATGGCCGTC GGCGCTGCCG
GAAATCGCCC GCTACACGAA GCTGATGCGC CGGCTCGGCT ACGACGACTG GGGCGACTAT
CTGGAGTTTC CGCTCGGCGG CCTCGATTAC GCGATCTGCC GCGTGCTGTG CGAGCAGCAC
GATCTGCGGC CGCGCGAATA CGCGGTCGTG CATCCGGGCG CGCGCATGCA GTCGCGCCGC
TGGCCGGTCG CGCGCTTCGC GGGCGTCGCG CGCGCGCTCG CCGAGCGCGG GCTGCGCATC
GTGCTGACGG GCACGCGCGG CGAGGCGGCG CTCGCCGACG CGTTCGCCGC GCAACTGGGC
GCGCCGTTCG TCGATCTGTG CGGCCGCACG CCGCTCGGCG CGCTCGGCGC GCTGATCGGC
CGCAGCCGCC TCGTCGTCTG CAACGATACC GGCGTGTCGC ACGTGGCCGC CGCGCTCGGC
GCGCCGAGCG TCGTGATCGC GTGCGGCAGC GACGCCGCGC GCTGGGCGCC GCTCGATCGC
GAGCGCCATC GCGTGCTCGC CGACTATCCG CCGTGCCGCC CGTGCATGTT CGAAACCTGT
CCGTACGACC ACGCGTGCGC GAACGCGATC GGCGTCGAGG ACGTCGTCAG GCGCGCGGAC
GCACTGCTCG CCGTGGAGCC GCATCATGTC GCCTAA
 
Protein sequence
MSGDPSARPA AAERRGTGEY ASIAVFRALQ LGDMLCAVPA LRALRRGEPQ ARITLIGLPW 
AKAFAERFSD YVDDFIEFPG APGLVEQPHD VERLAAFVAE CRSRRFDLAI QLHGSGAQSN
AIVAGLGAAS TAGFAPDAFA AGEHAAPRLD RTIAWPSALP EIARYTKLMR RLGYDDWGDY
LEFPLGGLDY AICRVLCEQH DLRPREYAVV HPGARMQSRR WPVARFAGVA RALAERGLRI
VLTGTRGEAA LADAFAAQLG APFVDLCGRT PLGALGALIG RSRLVVCNDT GVSHVAAALG
APSVVIACGS DAARWAPLDR ERHRVLADYP PCRPCMFETC PYDHACANAI GVEDVVRRAD
ALLAVEPHHV A