Gene BURPS1106A_A3028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A3028 
Symbol 
ID4904574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2942002 
End bp2943498 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content73% 
IMG OID640146131 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_001077057 
Protein GI126457864 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.220926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGTTGCC GCGCGCGGCG CGCGCGGCGC CGTGCTTTTC GGCAAGCCGT TGTAACTTTT 
GCGGCCGCGT TCGCACATCC TCGCGCGCGC GGCAAACGGG CATGCGCGTT GCGCACGCCT
GCCGCAAAGG GCACATTCGC GCCCGCCGGC ACGGGAGCCA TCGTCATGCA GAAAATCGCG
TTGATCAGTG AGCACGCATC GCCGCTCGGC GTCATCGGAG GCGTCGACGC GGGCGGCCAG
AACATCTATG TCGCGAACGT CGCCAAGCAG CTCGCGCGGC TCGGCGTCGA CGTCGACGTG
TTCACGCGCT GCGACAATCC GCACCTGCCC GACGTCGCGC ACATCGGCGC GGGCATCCGC
GTGATCCACG TACCGGCCGG CCCGCCGTCG AACGTACCGA AGGAAGCGCT GCTGCCGTAC
ATGAAGGCAT TCTCGGCATT CCTCATCGAC TGGTTCCGGC GCGAGCCGAC GCCTTACGAC
GCGATGCACG CGAACTTCTT CATGTCCGGC GACGCGGCGC TGCGCGTGAA GGCGCGCCTC
GGCGTGCCGC TCGTGATGAC GTTCCATGCG CTCGGCCGCG TGCGCCGCCG GCATCAGGGC
GCGGCCGACG GCTTTCCGGA CGCGCGCTTT CCGATCGAGG ACGCGCTCGC GAAGCGCGCC
GATCGCGTGA TCGCCGAGTG CCCGCAGGAC GCGGCCGATC TGCGCGCGCT GTACCGCGCC
GATCCGGGCC GCATCGAGAT CGTGCCGTGC GGCTTCGACG AAGAAGAGTT TCGCCCGGTG
CTGCGGCGCG CCGCGCGCGC GCGGCTCGGC TGGCGCGACG ACGAATTCGC GGTGCTGCAG
CTCGGGCGCC TCGTGCCGCG CAAGGGCATC GACAACGTGA TCGAGGCGCT CGCGCGCGTG
CCGCGCGACG CGGGCGCGCG GCCGGCCCGT CTCTATGTGG TGGGCGGCAG CGACTACGAG
CCGGACCCGT CGCGCTGCGC GGAGCTCGCG CGCCTCGCCG GCATCGCGCG CGAAGCCGGC
GTGGCCGATC GCGTGACGTT CGTCGGCCGG CGCGATCGCG ACGCGCTGCA CCTCTACTAC
GGCGCGGCCG ACGTGTTCGT GACGACGCCG TGGTACGAGC CGTTCGGGAT CACGCCCGTC
GAGGCGATGG CGTGCGCGAC GCCCGTGATC GGCAGCGACG TCGGCGGCAT CCGCACGACA
GTCGAGCACG GCGTGACGGG CTATCTCGTC GCGCCGCGCG ATCCGGGCGC GCTCGCCGCG
CGGCTCGACG AACTGCGGCG CGACCCCGAG CGCGCGCAGC AGTTGGGCTG GGCCGGCTAC
CGGCGCGCGC ATCGCCATTA CACGTGGCGC GGCGTGGCCG AGCGGCTCGC GGCGATCTAT
CGCGACGTCG CCGCGTGCGC GCGGCGCGGC GCGCGCGCGG GCACGGCGGC GCACGTGCGG
CGCTCGCCCG TCGCGCCCTC GGCAACGGTT GCGAACCAGA AGGAGAACGG ATCATGA
 
Protein sequence
MCCRARRARR RAFRQAVVTF AAAFAHPRAR GKRACALRTP AAKGTFAPAG TGAIVMQKIA 
LISEHASPLG VIGGVDAGGQ NIYVANVAKQ LARLGVDVDV FTRCDNPHLP DVAHIGAGIR
VIHVPAGPPS NVPKEALLPY MKAFSAFLID WFRREPTPYD AMHANFFMSG DAALRVKARL
GVPLVMTFHA LGRVRRRHQG AADGFPDARF PIEDALAKRA DRVIAECPQD AADLRALYRA
DPGRIEIVPC GFDEEEFRPV LRRAARARLG WRDDEFAVLQ LGRLVPRKGI DNVIEALARV
PRDAGARPAR LYVVGGSDYE PDPSRCAELA RLAGIAREAG VADRVTFVGR RDRDALHLYY
GAADVFVTTP WYEPFGITPV EAMACATPVI GSDVGGIRTT VEHGVTGYLV APRDPGALAA
RLDELRRDPE RAQQLGWAGY RRAHRHYTWR GVAERLAAIY RDVAACARRG ARAGTAAHVR
RSPVAPSATV ANQKENGS