Gene BURPS1106A_A2949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2949 
Symbol 
ID4904079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2874909 
End bp2875970 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content67% 
IMG OID640146052 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_001076978 
Protein GI126457757 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGATC AAATAGACAT ACCGCTCATT TCGCTCGTCG TGCCGTTCTA CAACGAAGGC 
GACGCGGTCA CGCGGTTCTT TGCGGAAGTG ATGCCGCTGA TGGAGGCGAT CGAATCGATC
CGCTTCGAGA TCGTCTGCGT GAACGACGGC AGCCGCGACG ACACGCTCGA GCAACTCGTC
GCGGTCGGCG CGCGCGAGCC GCGCGTGCGC GTGATCGATC TGACGCGCAA CTTCGGCAAG
GAAGCCGCGC TGACGGCGGG CCTCGACGAA GCGAACGGCG ACGCGGTGAT CCCGATCGAC
GCGGACCTGC AGGATCCGCC GAGCCTGATT CCCGTGATGA TCGACCATTG GCGCGACGGC
GCCGAGGTCG TGGCGGCGAA GCGCAGCAAC CGCGCGTGCG ACACGTTCGC GAAGCGCACC
GCCGCCGCGC TGTATTACCG CGTGCACAAT GCGCTGTCCG AAGTGAAGCT GCCGGTCAAC
GTCGGCGATT TCCGGCTGAT GGACCGGCAG GTCGTCAACG CGTTGCGCAG CCTGCCGGAG
CGCCGGCGCT TCATGAAGGG GCTGTTCGCG TGGGTGGGCT ACCGGACCGT GATCGTCGAG
TATCAGCGCG AGGCGCGCTG CGCGGGCCAC TCGAAATTCT CCGGCTGGAA GCTCTGGAAC
TTCGCGCTCG AAGGGATCAC GAGCTTCAGC ACGGTGCCGC TGCGCAGCTG GACCTACATC
GGGCTCGGCA TCGCGGCGCT CGCGTTCCTC TACGGCGGGT TCATCGTCGC GCGCACGCTG
TGGCTGGGCA ATCCGGTGCC GGGTTACGCG TCGCTGATTT CGGTGATGCT GTTCATCGGC
GGAATCGAGC TGGTCGGCAT CGGCGTCGTC GGCGAGTACA TCGGCCGCAT CTATTACGAA
TCGAAGGAGC GGCCGATCTA TCTCGTGCGC CGCCGCTATC AGGCGCGCAC GAAGGTGAGC
GCGCTGCCCG TGGGAGCCGC CGCGACGCGC GTCGCGCATG GCGCGCGGGC GGAGTTCGCC
CGGCGCCGCG CGATGCCGCG CGCGCGTGCC GACAGCCGTT GA
 
Protein sequence
MRDQIDIPLI SLVVPFYNEG DAVTRFFAEV MPLMEAIESI RFEIVCVNDG SRDDTLEQLV 
AVGAREPRVR VIDLTRNFGK EAALTAGLDE ANGDAVIPID ADLQDPPSLI PVMIDHWRDG
AEVVAAKRSN RACDTFAKRT AAALYYRVHN ALSEVKLPVN VGDFRLMDRQ VVNALRSLPE
RRRFMKGLFA WVGYRTVIVE YQREARCAGH SKFSGWKLWN FALEGITSFS TVPLRSWTYI
GLGIAALAFL YGGFIVARTL WLGNPVPGYA SLISVMLFIG GIELVGIGVV GEYIGRIYYE
SKERPIYLVR RRYQARTKVS ALPVGAAATR VAHGARAEFA RRRAMPRARA DSR