Gene BURPS668_A3226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3226 
Symbol 
ID4886880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp3052277 
End bp3053590 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content70% 
IMG OID640133162 
Productpolysaccharide deacetylase family protein 
Protein accessionYP_001064217 
Protein GI126444723 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.107526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCGGC TGGGTTGCGT TGTTACCCGC TCACGAAAAA AGCCGTACGC TCGCGCGCCG 
CGCTCTTGTG CGGCGCGCGA GCGTGACGAG CGTGACGAGC GTGACGAGCG TGACGAGCGT
GACGAGCGTG ACGAGCGTGA CGAGCGTGAC GAGCGTGACG AGCGTGACGA GCGTGACGTG
CGGGACGTGC GTGACGTGCG TGACGTGCGT GACGTGCGGG ACGTGCGGGA CGTGCGGGAC
ATGCGCCATG AGCGTGACAG GCGCGGCGCG CATGACGCGT GCGGCGCGGC GGCGCGACGC
CCGGCAACTG CCATGCGTTG CGCCGCGAAG CGCACGTCGC GCCGAGCATG GGAAAACACG
CACCGTCCGC GCCGTCGCAA CCGGTCGGCC CGAACACGTT TCGAACAACC CAAGGAACGG
CCCTTGCGAC ATCCAGGCTC CAGGCTCACC CGGTTCCGGC TCGCGGCGGC GGCCTGCTGT
CTCGCATTGG CAAGCGCCGC GCGCCCCGCG TTCGCCGGCG CGAACGCGCC CGCCGCCGCC
GAGCCGCCCA CCGATGCGCA GCGCCCCGCG ATCCTCGTCT ATCACCGCTT CTCGTCGTCC
GCGCCGCCCG ATTCGATGAC CGTGCGCGTC AGCACGTTCG GCGCGCAGCT CGCGTTCCTG
CGCGCGCACG GTTACACGTT CGTACCGCTG CGCGACGTCG TCGCGTGGGC GGCTTCGCCG
TCCGCGCCGC TGCCGGACAA GGCGATCGCG ATCACCGTCG ACGACGGCCA CCGCTCGGTG
TACGAACTGC TGCGCCCGAT CGTGCTGCGC GAGCGGCTGC CCGTCACGCT GTTCATCTAT
CCGTCCGCGA TCTCGAACGC GTCGTATGCG ATGACATGGG ACGAACTGCG CGCGTTGCGC
GCCACCGGCC GCTTCGATAT CGAATCGCAC ACGTGGTGGC ATCCGAACTT CCGCACCGAG
CGGCGGCGCC TCGCACCCGA TGCGTTCCGC CGCTTCGCCT CGACCCAGTT CACCCATTCG
CGCGCGCGGC TCGAGCGCGA AATCGGCGGG CCGGTCGATC TGCTCGCATG GCCGTTCGGC
CTCTACGACG ATGAGCTGAC GTCGCTCGCC GCGCAGGCGG GCTACGTCGC GGGCTTCACG
CTCGACGCGC GCAAGGTTCG TCGCGGCGAC GCGCCGCTCG CGCTGCCGCG CTTGCTGATC
GTCGACAGTT GCACGCCGGC CGTGCTGGCG CGGATGCTCG GCGAACGCGG CGACGCGCGT
GCCGATTCGC ACGCCGATTT ACACGTCGAT TCACACGCGG AGCAATCGCG ATGA
 
Protein sequence
MIRLGCVVTR SRKKPYARAP RSCAARERDE RDERDERDER DERDERDERD ERDERDERDV 
RDVRDVRDVR DVRDVRDVRD MRHERDRRGA HDACGAAARR PATAMRCAAK RTSRRAWENT
HRPRRRNRSA RTRFEQPKER PLRHPGSRLT RFRLAAAACC LALASAARPA FAGANAPAAA
EPPTDAQRPA ILVYHRFSSS APPDSMTVRV STFGAQLAFL RAHGYTFVPL RDVVAWAASP
SAPLPDKAIA ITVDDGHRSV YELLRPIVLR ERLPVTLFIY PSAISNASYA MTWDELRALR
ATGRFDIESH TWWHPNFRTE RRRLAPDAFR RFASTQFTHS RARLEREIGG PVDLLAWPFG
LYDDELTSLA AQAGYVAGFT LDARKVRRGD APLALPRLLI VDSCTPAVLA RMLGERGDAR
ADSHADLHVD SHAEQSR