Gene BURPS1106A_A3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A3039 
Symbol 
ID4904233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2953793 
End bp2954794 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content74% 
IMG OID640146142 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_001077068 
Protein GI126456428 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.200076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAC CGCACTGGAG GCCGCCGCCC GTCGTGTCGA TCGTCGTGCC GACGTACCGG 
CGGCCGGAGC TGCTCGAACG CTGCCTCGGC GCGCTCGCGT CGCAGGTGTT CGATCCGGGC
ACCTACGAGA TCGTCGTCGT CGACGACGAT GCGGCCGGCA GCGCGCGCCC CGTCGTCGAT
GCGCTGACCG TGCGCATGGG CGGGCTGCCC GCGATCCGTT ACGTGAGCGC GCCGCGCACG
CAGGGCCCGG CCGGCGCGCG CAACGCGGGC TGGCGCGAAG CGGCGGGCCC GGTGATCGCG
TTCACCGACG ACGACACGAT CGCCGATCCG CTATGGCTGC GCAACGGCTG CTCGGCGCTG
CTCGCGCAGC CCAACGCGTC GGCCGCGGCC GGGCGCATCG AGGTGCCGCT CGCGCCGTGC
CCGACCGATT ACGAGCGCGA CGCGGGCGGG CTCGCCCACG CGGAGTTCGC GACCGCGAAC
TGTTTCGTGC GGCGCGCGGC GCTCGAGCGC GTCGGCGGCT TCGACGAGCG CTTCACGCGC
GCGTGGCGCG AGGACGCGGA CCTGATGTTC GCACTGCGCG AGCGCGCGGG GCCGATCGTC
GACGCGCGCA CGGCGACGAT CGTGCATCCG GTGCGGCCCG CGCGCTGGGG CGTGAGCATC
GCGCAGCAGT CGAAAGTGTT TTTCGACGCG CTGCTGTACA AGAAGCATCG CGACGTCTAC
CGTCGGCACA TCCGCTCCGT GCCGCCGTGG CATTACTACG CGGCGGTGCT CGCGCTGCTC
GGCGCGTGCG TCGCGCTCGC GCTCGGCCTG CATGCGGCCG CGGCCGCGTG CGCGGCGGCC
TGGGCCGGCA TCACGGCGGC GTTCTGCTGG CGGCGCCTGC GCGGCACCGC GCACACGCCG
TCGCACGTCG CGGAGATGAT CGTCACGTCG ATCGCGATTC CGCCCGTGTC GCTGTACTGG
CGGCTGCGCG GCGCGCTCCA CTTCCGGGTG CTGTTCCTAT GA
 
Protein sequence
MNAPHWRPPP VVSIVVPTYR RPELLERCLG ALASQVFDPG TYEIVVVDDD AAGSARPVVD 
ALTVRMGGLP AIRYVSAPRT QGPAGARNAG WREAAGPVIA FTDDDTIADP LWLRNGCSAL
LAQPNASAAA GRIEVPLAPC PTDYERDAGG LAHAEFATAN CFVRRAALER VGGFDERFTR
AWREDADLMF ALRERAGPIV DARTATIVHP VRPARWGVSI AQQSKVFFDA LLYKKHRDVY
RRHIRSVPPW HYYAAVLALL GACVALALGL HAAAAACAAA WAGITAAFCW RRLRGTAHTP
SHVAEMIVTS IAIPPVSLYW RLRGALHFRV LFL