Gene BURPS1106A_A2358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2358 
Symbol 
ID4904687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2333815 
End bp2335503 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content66% 
IMG OID640145463 
Productsedolisin 
Protein accessionYP_001076391 
Protein GI126456903 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGGCCGC TCGCGCTCGC CGCCGGCATC GCACATGGCG CGACGGATTG GGTCGATACG 
CATACCAAAG CTTTCCTGAA TCACGCGCAG ATCGAGACGC TCGCCCGCGG CGCGAACGCC
GCATCGCTCG AGGTCGCGTC GGGCGAAGCC ACGCACGTCG TGGTCAGCCT GAAGCTGCGC
AACGCCGAGC AATTGAAAGC CGTCGCGCGC AACGTCAACG ATCCGCATAG CTCGCAGTAT
CGGCAGTACA TCACGAGCGC GCAGTTCCTC GCGAACTATG CGCCGACCGA AGCGCAGGTG
AAACAGGTCG TTGCCTATTT GCGCAAGAAC GGCTTCGTCG ACATCCACGT CGCGCCGAAT
CGCATGCTCG TCTCCGCGCG CGGCACCGCC GGCACGGTCA AGCAGGCGTT CAACACGTCG
CTCGTGCATT TCGAGTACGC GGGCCGCGCG GGCTTCGCGA ACGCGTCGAC GGCGCAAGTG
CCGCGCGCGC TCGGCGACAT CGTCGGCTCC GTGCTCGGCC TGCAGAACGT CGCGCGCGCC
CGGCCGCTCA CGAAGATCGG CGCGATCGCG AAACCGCTCG CGCTCGCGTC CGGCACGGCG
ACGGGCCACT ATCCATCCGA GTTTCCGGCG CTCTACAACG CAACGGGCGT GCCCACCGCG
GCGAACGCGA CGGTCGGCAT CATCACGATC GGCGGCGTGT CGCAAGCGCT GTCGGATCTG
CAGCAGTTCA CGAGCGCGAA CAGCTATCCG GACGTGTCGA CGCAGACCAT CCAGACCAAC
GGTTCCGGCG GCAACTACAG CGACGATCAG GAAGGCCAGG GCGAATGGGA TCTGGACAGC
CAGTCGATCG TCGGCGCCGC GGGCGGCCAG CTCGGGCAAC TGATCTTCTA CATGGCCGAT
CTCGACGCGT CGGGCAACAC CGGCCTCACG CAGGCATTCA ACCAGGCGGT GTCGGACAAC
GCGGCGAAAG TGATCAACGT CTCGCTCGGC TGGTGCGAAA CCGATGCGAA CGCGGACGGC
ACGCTTTCCG CCGAAGAGCA GATCTTCACG CAGGCGGTCG CGCAAGGTCA GACGTTCGCG
GTGTCCTCAG GCGACGAAGG CGTCTACGAG TGCAACAACC GCGGCTATCC CGATGGTTCG
AACTACACGG TATCGTGGCC GGCGTCGTCG CCGCACGTGC TCGCGATCGG CGGCACGACG
CTCTACACGA CTTCGTCGGG CGCATTCTCG AACGAAACGG TATGGAACGA AGGGCTCGAC
GGCAACGGCA AGCTGTGGGC GACGGGCGGC GGCGTCAGCA CGATCCTGCC GAACCCGTCA
TGGCAGTCGG GCAGCCATCG CAAGCTGCCG GACATATCGT TCGACGCCGC GCAAAGCACG
GGCGCGTATA TCTACAATTA CGGCCAGTTG CAGCAGATCG GCGGCACGAG CCTGTCGGCG
CCGATTTTCA CGGGCTTCTG GGCGCGGCTC CTGTCGGCGA ACGGCACGGG TCTCGGCTTC
CCGGCCGCGC GCTTCTACCA CTCGATTCCG ACCCACGCGT CGCTCGTGCG CTACGACGTC
ACGTCGGGCA ACAACGGCTA TTCGGGATAC GGCTACAAGG CATCGACCGG CTGGGACTAC
CCGACCGGCT GGGGCAGCAT CAACATCTCG AACCTGAATC AGTTGATCCA GTCGGGCGGC
TTCAATTGA
 
Protein sequence
MWPLALAAGI AHGATDWVDT HTKAFLNHAQ IETLARGANA ASLEVASGEA THVVVSLKLR 
NAEQLKAVAR NVNDPHSSQY RQYITSAQFL ANYAPTEAQV KQVVAYLRKN GFVDIHVAPN
RMLVSARGTA GTVKQAFNTS LVHFEYAGRA GFANASTAQV PRALGDIVGS VLGLQNVARA
RPLTKIGAIA KPLALASGTA TGHYPSEFPA LYNATGVPTA ANATVGIITI GGVSQALSDL
QQFTSANSYP DVSTQTIQTN GSGGNYSDDQ EGQGEWDLDS QSIVGAAGGQ LGQLIFYMAD
LDASGNTGLT QAFNQAVSDN AAKVINVSLG WCETDANADG TLSAEEQIFT QAVAQGQTFA
VSSGDEGVYE CNNRGYPDGS NYTVSWPASS PHVLAIGGTT LYTTSSGAFS NETVWNEGLD
GNGKLWATGG GVSTILPNPS WQSGSHRKLP DISFDAAQST GAYIYNYGQL QQIGGTSLSA
PIFTGFWARL LSANGTGLGF PAARFYHSIP THASLVRYDV TSGNNGYSGY GYKASTGWDY
PTGWGSINIS NLNQLIQSGG FN