Gene BURPS1106A_A2909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2909 
Symbol 
ID4904761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2840751 
End bp2842442 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content74% 
IMG OID640146012 
Productputative lipoprotein 
Protein accessionYP_001076938 
Protein GI126457654 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.733186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGGACGA CGAAATCTCC CGGCATCTCC GTGATTCGGC AGCGGCTCGC CGTTGCGCTG 
CCGCTTGCGT TGACGCTCAC CCTCGCGAGC TGCGGCGGTG ACGATCTGAC GCCCGCCGCG
CAGCGCTGGG CGATGCCCGG CACCGAACTG CCGCTCGGGC CGCAGGGCCT CGCGCAGAGC
GTGTCGACGC AGACGCTCGC CGCAGGCGTC GCCTATTACC AGATCAAGCG CGGCGCGGCG
AGCGCGGCCG ATTTCTGGAC CGTCAACCTC GGCTTCTACG CGACGCAGGC CGCGGCGCAG
GCCGATGCGG CGAATCTCGC GGCGGCCGGC TTCGCGACGC GCGTCGACGC GTCGGCGGGC
ACCGACCTGC AGGGCAAGGT GCTCGGCTAC TGGCTGTCGG CCGGCCGCTA CGCGACGCAG
GCCGAGGCGA CGGCGGCCGC CGCACGCATC GCGCAGGCCA CGCAGAACCG CTACAAGCCG
GGCACGCGGC ATACGTCGCT CGCCGGCGCG CCGACGACGG GGCCGTGGAT CGTCAACGTG
CTCGCGATCG ACCCGTCGCG CGCCGGCGCG GCGCTGTCGC TCGCGCTGCC GGGCGGCAAC
GATCTCGGTG CGGGCGGCGA GACGGTTTCG GCCGCGCGGG CGCGTGTGAA CGCGCTCGCC
GGCGTCAACG GCGGCTTTTT CACGAACATC AATCCGTTCG GCGCGCCGCT GCCGCCGCGC
TCACCCGTCG GCGCGACGGT AGTCGACGGG CGGCTCGTCG CGGCCGCGAT CGGCAGGCGC
CCCGGCCTGC TGCTCGCGCG CGACGCGAAC GGCCGCCAAC GCGCGACGGT CGTGCGCAAT
CTCGCGACGG CGATCACGCT GACCGACGCG CAAGGCAGTG CGATCGCGGT CCAGACGCTG
AACCGGCCGA TCCTCGGCAC GGTCGTCAAT TGCGGCGCGC AGGCGCGCAC GCCGACGAGC
GAGCCGGCGC AGGACACGGT GTGCACGAAC TACGATGACC TCGTGATGTA CGACTCGCTA
TATCTGCGCG GCGGTGCGTC GAACACGCTC GTCGACGCCG GCTACCAGGG CGCGCGATAC
GAACTCGTGG TCGACGCGAA CGGCGCCGTC GTCGCCGGCC ATGCGACGCT CGGCGCGCCG
CCGCCGCCGA ACGGCTACGT GCTGCAGGGG CTCGGCGCGA GCGCCGCGTG GCTGCAGGCG
CATGCGACGC CGGGCACGCG CCTCGCGGTA TCGCGCCGGC TGTCGGCCGA CGGCGCGGAT
CTCGCGCTCG CGTCGGGCAC GTCGCTCGTC GAGGCGGGGC CGACGCTGTC CGTGCCGAAT
CTCGCGCAAA GCGCCGCGCA AGAGGGCTTC GCGCCGACGG TGGGCGGCGT CGACGCGGGC
GAAGGCGCCG CGGCGAACGG CAACTGGTAC AACGGCTGGT ATGTCGCGCG CAATGGGCGC
ACCGCGGCGG GCGTCGCGGC GGACGGCACG ATCCTGCTCG TCGAGATCGA CGGCCGGCAG
CCCGCGTTGA GCGTCGGCAC GAGCATTCCG GAGACGGCGG CGGTGATGGC ATGGCTCGGT
GCGACGTCGG CCGTCAATCT CGACGGCGGC GGCTCGAGCA ACATGGTGGT CGGCGGCAAG
ATGGTCGGAC ATCCGTCCGA CGCGGTGGGC GAGCGGGGCG TCGGCGATAC GCTGATGCTG
CTGCCGGGCT GA
 
Protein sequence
MRTTKSPGIS VIRQRLAVAL PLALTLTLAS CGGDDLTPAA QRWAMPGTEL PLGPQGLAQS 
VSTQTLAAGV AYYQIKRGAA SAADFWTVNL GFYATQAAAQ ADAANLAAAG FATRVDASAG
TDLQGKVLGY WLSAGRYATQ AEATAAAARI AQATQNRYKP GTRHTSLAGA PTTGPWIVNV
LAIDPSRAGA ALSLALPGGN DLGAGGETVS AARARVNALA GVNGGFFTNI NPFGAPLPPR
SPVGATVVDG RLVAAAIGRR PGLLLARDAN GRQRATVVRN LATAITLTDA QGSAIAVQTL
NRPILGTVVN CGAQARTPTS EPAQDTVCTN YDDLVMYDSL YLRGGASNTL VDAGYQGARY
ELVVDANGAV VAGHATLGAP PPPNGYVLQG LGASAAWLQA HATPGTRLAV SRRLSADGAD
LALASGTSLV EAGPTLSVPN LAQSAAQEGF APTVGGVDAG EGAAANGNWY NGWYVARNGR
TAAGVAADGT ILLVEIDGRQ PALSVGTSIP ETAAVMAWLG ATSAVNLDGG GSSNMVVGGK
MVGHPSDAVG ERGVGDTLML LPG