Gene BURPS668_A3168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3168 
Symbol 
ID4888268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2995358 
End bp2996359 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content74% 
IMG OID640133104 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_001064159 
Protein GI126444830 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAC CGCACTGGAG GCCGCCGCCC GTCGTGTCGA TCGTCGTGCC GACGTACCGG 
CGGCCGGAGC TGCTCGAACG CTGCCTCGGC GCGCTCGCGT CGCAGGTGTT CGATCCGGGC
ACCTACGAGA TCGTCGTCGT CGACGACGAT GCGGCCGGCA GCGCGCGCCC CGTCGTCGAT
GCGCTGACCG TGCGCATGGG CGGGCTGCCC GCGATCCGTT ACGTGAGCGC GCAGCGCACG
CAGGGCCCGG CCGGCGCGCG CAACGCGGGC TGGCGCGAAG CGGCGGGCCC GGTGATCGCG
TTCACCGACG ACGACACGAT CGCCGATCCG CTATGGCTGC GCAACGGCTG CTCGGCGCTG
CTCGCGCAGC CCAACGCGTC GGCCGCGGCC GGGCGCATCG AGGTGCCGCT CGCGCCGTGC
CCGACCGATT ACGAGCGCGA CGCGGGCGGG CTCGCCCACG CGGAGTTCGC GACCGCGAAC
TGTTTCGTGC GGCGCGCGGC GCTCGAGCGC GTCGGCGGCT TCGACGAGCG CTTCACGCGC
GCGTGGCGCG AGGACGCGGA CCTGATGTTC GCGCTGCGCG AGCGCGCGGG GCCGATCGTC
GACGCGCGCA CGGCGACGAT CGTGCATCCG GTGCGGCCCG CGCGCTGGGG CGTGAGCATC
GCGCAGCAGT CGAAAGTGTT TTTCGACGCG CTGCTGTACA AGAAGCATCG CGACGTCTAC
CGTCGGCACA TCCGCTCCGT GCCGCCGTGG CATTACTACG CGGCGGTGCT CGCGCTGCTC
GGCGCGTGCG TCGCGCTCGC GCTCGGCCTG CACGCGGCCG CGGCCGCGTG CGCGGCGGCC
TGGGCCGGCA TCACGGCGGC GTTCTGCTGG CGGCGCCTGC GCGGCACCGC GCACACGCCG
TCGCACGTCG CGGAGATGAT CGTCACGTCG ATCGCGATTC CGCCCGTGTC GCTGTACTGG
CGGCTGCGCG GCGCGCTCCA CTTCCGGGTG CTGTTCCTAT GA
 
Protein sequence
MNAPHWRPPP VVSIVVPTYR RPELLERCLG ALASQVFDPG TYEIVVVDDD AAGSARPVVD 
ALTVRMGGLP AIRYVSAQRT QGPAGARNAG WREAAGPVIA FTDDDTIADP LWLRNGCSAL
LAQPNASAAA GRIEVPLAPC PTDYERDAGG LAHAEFATAN CFVRRAALER VGGFDERFTR
AWREDADLMF ALRERAGPIV DARTATIVHP VRPARWGVSI AQQSKVFFDA LLYKKHRDVY
RRHIRSVPPW HYYAAVLALL GACVALALGL HAAAAACAAA WAGITAAFCW RRLRGTAHTP
SHVAEMIVTS IAIPPVSLYW RLRGALHFRV LFL