Gene BURPS668_A2231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2231 
Symbol 
ID4888532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2163852 
End bp2165372 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content77% 
IMG OID640132168 
Productendo-1,4-D-glucanase 
Protein accessionYP_001063225 
Protein GI126443038 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3405] Endoglucanase Y 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.161113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGGC AAGCGGGGCG AGACGAAGGC GTCGGGCGCA TCGGGCGTGA AGCGGGCGAC 
GCGCGCGAGG CGGCCGGGGC GGGCGAGGCG TGCGGGAGGC GCCACGCGCG GGCCGACGCG
TGGGCGCGGC GGGCGTCGCG AGCATCGCGC GGGTGGCGCA CGTTTTGGCG CGGGCGCGGG
GGCGTGCGCG CCGTCGTGGC GAGCGTCGTG GCGAGCTTCT CGGTGATGGC GTTCGCGGCG
GCGACGTTGC CGGTGTCGTG GCGCGTCGCC GCGGCGGCGG AGCGTACGCG AAGCGGCGGC
GAACGCGCGG GCGGGCTGCG CGACACGGCG GGCTTGATCG AGATCTCGGC GGCGGCGCCG
GCCTCGACGC CGATCCCGGC GGCGCCGCGG CGGTTCGCGC AGCCGTTCGC GCAGCCCGCC
CGCGCGTTCG CCGCCGCGAG CGCCTGCGCG CCGTCCTGGC CGCGCTGGGA CCGTTTCAAG
CGTGACTTCG TATCGGCCGA CGGCCGCGTG ATCGACGTCG GCTCGGCCGA CGAGCGGACC
GTATCCGAGG GGCAGGCGTA CGGCCTTTTC TTCGCGCTCG TCGCGAACGA CCGCGCGGCG
TTCGACGCGC TGCTGCGCTG GACCGAGGAC AATCTCGCGC AGGGCGATCT GAGCGCGCGT
CTGCCCGCGT GGCTGTGGGG CCGCGCGGCC GACGGCGCGT GGCGCGTGCT CGATGCGAAC
GCCGCGTCCG ACGCCGATCT GTGGCTTGCG TACGCGCTGC TCGAAGCGGG GCGCTTGTGG
CGCGAGCGCA GCTACACGGC GCGCGGCGCG TTGCTCGCGA AGCGCGTGCT CGACGAGGAG
ACCGCGACGC TGCCGGGGCT CGGTCTCGTG CTGCTGCCGG GCCCGACGGG TTTTCGGCCG
GCGCGCGACG CGTGGCGGCT GAATCCGAGC TATTCGCCGC CGCAGGCGAT TCGCGGAATC
GGCGCGCATG TGCCCGACGA CGCGCGCTGG GCGCGGCTCG CGGCGGGCGT CGGCCGCGTG
CTGACCGACA GCGCGCCGCG CGGCTTCGCG CCGGACTGGG CGCTGTATCG CGCGGGCCGC
GGCTTCGAGC CGGACGCCGA AACGCATGCG GCGAGCGCGT ACAACGCGAT TCGCGTCTAT
CTGTGGGCGG GCATGCTCGA TGCGGGCGAT CCGCTGGCGC GGCCGCTCGT CGCGCATTTC
GCGCCGTTCG CCGAGCATGT CGCCGCGCAT GGCGCGCCGC CGGAGGCGGT CGATGCGACG
ACGGGCACGG CCGCCCCGCG CGACGGCAAT GCCGGGTTTT CCGCGGCGGC CGTGCCGTTT
CTCGAGGCGC GCGGCGAGCG GGCGAGCGCC GACGCGCAGC TCGCGCGCGT CGCGCGGCTC
GAGCGCGAGA CGGCGAGCGG CTATTACGCG AACGTGCTGA CGCTGTTCGG GCTCGGCTGG
CGCGACGGGC GCTACCGGTT CGCGGCCGAC GGCACGCTGC GGGTGCGATG GAGCGAGCCG
TGCTCGACGC CCGCGCGTTG A
 
Protein sequence
MARQAGRDEG VGRIGREAGD AREAAGAGEA CGRRHARADA WARRASRASR GWRTFWRGRG 
GVRAVVASVV ASFSVMAFAA ATLPVSWRVA AAAERTRSGG ERAGGLRDTA GLIEISAAAP
ASTPIPAAPR RFAQPFAQPA RAFAAASACA PSWPRWDRFK RDFVSADGRV IDVGSADERT
VSEGQAYGLF FALVANDRAA FDALLRWTED NLAQGDLSAR LPAWLWGRAA DGAWRVLDAN
AASDADLWLA YALLEAGRLW RERSYTARGA LLAKRVLDEE TATLPGLGLV LLPGPTGFRP
ARDAWRLNPS YSPPQAIRGI GAHVPDDARW ARLAAGVGRV LTDSAPRGFA PDWALYRAGR
GFEPDAETHA ASAYNAIRVY LWAGMLDAGD PLARPLVAHF APFAEHVAAH GAPPEAVDAT
TGTAAPRDGN AGFSAAAVPF LEARGERASA DAQLARVARL ERETASGYYA NVLTLFGLGW
RDGRYRFAAD GTLRVRWSEP CSTPAR