Gene BURPS668_A2843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2843 
Symbol 
ID4886270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2703800 
End bp2704972 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content67% 
IMG OID640132778 
ProductBeta-glucosidase/6-phospho-beta- glucosidase/beta- galactosidase 
Protein accessionYP_001063834 
Protein GI126444638 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.638259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTGA CACAATCCAT TAGTATTCCT ATCCATTACC CGGCCGCGAC GGCCGCATTG 
CTCTTGCTGC TGCTCACCGG TTGCGGCGGC GGCGGCGACC AGAGCAAGGT CAACGCCGCC
GCCTCGCCCG CGAACAACCT CGTCGTGCCG GCGCCCGGCA CGGCGTCGCC CGGCACGCCC
GCGCCCGCGC CCGCGCCCGG CGCGCCGGCG CCCGCCGAGA CGGCCTCGGT GCTGCCGTTC
TTCGGCGTGA ACGGCCATTA CGTCGACGGC GGCGTCTACG CGTCGGTCCC GCTCGCCACG
CAGGCAAGCC ACCTCGCCGG CCTCGGCATG AACGTCTACC GGCAGGACGT GTACATTCCG
GATCACGTCG ACACGCTCGC GTCGACGGTC ATTCCCGGCC TCGGTTCCGG CATCACGGTC
CTGCCGATGA TCCAGGCGCA TCCATGGGCC GATCCGTCGC TGAACGGCCA ACCGCCGACC
GAAGCCAGCG CGTATGCGTA CGCCTACAAG CTGGCCGCCT ACGCGGCGAA GAAGCTCGCC
GGCATTCCGA TGGTGGAGTT CGGCAACGAG TACGACATCG ATAGCCACAA CGCGCCGATC
CAGGGCGACG GCATCAATGT TTCGGACTAC GACAATTCCA CGTTCCCCGT CTGGCGCGGC
GCGCTCCGAG GCTCGCTCGA CGGCTGGCGC TCGGTCGACA CGAACCGCAC GACGAAGCTG
ATCGCGAACG CAACGTCGGG GGCGCTGCAT TTCGGCTTCC TCGACGGCCT GATGACGGGC
ACGCAGCCCG ACGGCACGAC CGGGCATCCG AAGATCACGC CCGACGTGAT CCAGTGGCAC
TGGTATTCGA ACGGCGGCGA TTTCGAGAAC GCGCTCGGCA AGACCGGCCG ATACAACGTG
CTTGCGCGGC TGAAGGACCG CTACAACCTG CCGATCGTCG TCACCGAGAT CGGCGTGAAC
ACGGACAACT CCGACACGCA GATCGCCGCG TACATCGCAA AGACGATCCC CGAGCTGGTG
GCGGCGAAAG CCGCGTACAA CGTCATCGGC TTCAACTGGT ATGAGCTTTA CGACGACCGC
AGCGGCGCTT ACGGCTTGCT GACGAACAGC GCACAGGAAA AGCCCCGTTA CGGACTCATG
CGCGCGGCGA TCGCCGGCGC CGTGCCGAAC TGA
 
Protein sequence
MSVTQSISIP IHYPAATAAL LLLLLTGCGG GGDQSKVNAA ASPANNLVVP APGTASPGTP 
APAPAPGAPA PAETASVLPF FGVNGHYVDG GVYASVPLAT QASHLAGLGM NVYRQDVYIP
DHVDTLASTV IPGLGSGITV LPMIQAHPWA DPSLNGQPPT EASAYAYAYK LAAYAAKKLA
GIPMVEFGNE YDIDSHNAPI QGDGINVSDY DNSTFPVWRG ALRGSLDGWR SVDTNRTTKL
IANATSGALH FGFLDGLMTG TQPDGTTGHP KITPDVIQWH WYSNGGDFEN ALGKTGRYNV
LARLKDRYNL PIVVTEIGVN TDNSDTQIAA YIAKTIPELV AAKAAYNVIG FNWYELYDDR
SGAYGLLTNS AQEKPRYGLM RAAIAGAVPN