Gene BURPS668_A3025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3025 
Symbol 
ID4886866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2872311 
End bp2874002 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content74% 
IMG OID640132961 
Productputative lipoprotein 
Protein accessionYP_001064016 
Protein GI126444305 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.330625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGGACGA CGAAATCTCC CGGCATCTCC GTGATTCGGC AGCGGCTCGC CGTTGCGCTG 
CCGCTTGCGT TGACGCTCAC CCTCGCGAGC TGCGGCGGTG ACGATCTGAC GCCCGCCGCG
CAGCGCTGGG CGATGCCCGG CACCGAACTG CCGCTCGGGC CGCAGGGCCT CGCGCAGAGC
GTGTCGACGC AGACGCTCGC CGCAGGCGTC GCCTATTACC AGATCAAGCG CGGCGCGGCG
AGCGCGGCCG ATTTCTGGAC CGTCAACCTC GGCTTCTACG CGACGCAGGC CGCGGCGCAG
GCCGATGCGG CGAATCTCGC GGCGGCCGGC TTCGCGACGC GCGTCGACGC GTCGGCGGGC
ACCGACCTGC AGGGCAAGGT GCTCGGCTAC TGGCTGTCGG CCGGCCGCTA CGCGACGCAG
GCCGAGGCGA CGGCGGCCGC CGCACGCATC GCGCAGGCCA CGCAGAACCG CTACAAGCCG
GGCACGCGGC ATACGTCGCT CGCCGGCGCG CCGACGACGG GGCCGTGGAT CGTCAACGTG
CTCGCGATCG ACCCGTCGCG CGCCGGCGCG GCGCTGTCGC TCGCGCTGCC GGGCGGCGAC
GATCTCGGTG CGGGCGGCGA GACGGTTTCG GCCGCGCGGG CGCGTGTGAA CGCGCTCGCC
GGCGTCAACG GCGGCTTTTT CACGAACATC AATCCGTTCG GCGCGCCGCT GCCGCCGCGC
TCGCCCGTCG GCGCGACGGT AGTCGACGGG CGGCTCGTCG CGGCAGCGAT CGGCAGGCGC
CCCGGCCTGC TGCTCGCGCG CGACGCGAAC GGCCGCCAAC GCGCGACGGT CGTGCGCAAT
CTCGCGACGG CGATCACGCT GACCGACGCG CAAGGCAGTG CGATCGCGGT CCAGACGCTG
AACCGGCCGA TCCTCGGCAC GGTCGTCAAT TGCGGCGCGC AGGCGCGCAC GCCGACGAGC
GAGCCGGCGC AGGACACGGT GTGCACGAAC GACGATGACC TCGTGATGTA CGACTCGCTA
TATCTGCGCG GCGGTGCGTC GAACACGCTT GTCGACGCCG GCTACCAGGG CGCGCGATAC
GAACTCGTGG TCGACGCGAA CGGCGCCGTC GTCGCCGGCC ATGCGACGCT CGGCGCGCCG
CCGCCGCCGA ACGGCTACGT GCTGCAGGGG CTCGGCGCGA GCGCCGCGTG GCTGCAGGCG
CATGCGACGC CGGGCACGCG CCTCGCGGTA TCGCGCCGGC TGTCGGCCGA CGGCGCGGAT
CTCGCGCTCG CGTCGGGCAC GTCGCTCGTC GAGGCGGGGC CGACGCTGTC CGTGCCGAAT
CTCGCGCAAA GCGCCGCGCA AGAGGGCTTC GCGCCGACGG TGGGCGGCGT CGACGCGGGC
GAAGGCGCCG CGGCGAACGG CAACTGGTAC AACGGCTGGT ATGTCGCGCG CAATGGGCGC
ACCGCGGCGG GCGTCGCGGC GGACGGCACG ATCCTGCTCG TCGAGATCGA CGGCCGGCAG
CCCGCGTTGA GCGTCGGCAC GAGCATTCCG GAGACGGCGG CGGTGATGGC ATGGCTCGGT
GCGACGTCGG CCGTCAATCT CGACGGCGGC GGCTCGAGCA ACATGGTGGT CGGCGGCAAG
ATGGTCGGAC ATCCGTCCGA CGCCGTGGGC GAGCGGGGCG TCGGCGATAC GCTGATGCTG
CTGCCGGGCT GA
 
Protein sequence
MRTTKSPGIS VIRQRLAVAL PLALTLTLAS CGGDDLTPAA QRWAMPGTEL PLGPQGLAQS 
VSTQTLAAGV AYYQIKRGAA SAADFWTVNL GFYATQAAAQ ADAANLAAAG FATRVDASAG
TDLQGKVLGY WLSAGRYATQ AEATAAAARI AQATQNRYKP GTRHTSLAGA PTTGPWIVNV
LAIDPSRAGA ALSLALPGGD DLGAGGETVS AARARVNALA GVNGGFFTNI NPFGAPLPPR
SPVGATVVDG RLVAAAIGRR PGLLLARDAN GRQRATVVRN LATAITLTDA QGSAIAVQTL
NRPILGTVVN CGAQARTPTS EPAQDTVCTN DDDLVMYDSL YLRGGASNTL VDAGYQGARY
ELVVDANGAV VAGHATLGAP PPPNGYVLQG LGASAAWLQA HATPGTRLAV SRRLSADGAD
LALASGTSLV EAGPTLSVPN LAQSAAQEGF APTVGGVDAG EGAAANGNWY NGWYVARNGR
TAAGVAADGT ILLVEIDGRQ PALSVGTSIP ETAAVMAWLG ATSAVNLDGG GSSNMVVGGK
MVGHPSDAVG ERGVGDTLML LPG