Gene BURPS668_A2497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2497 
Symbol 
ID4888155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2411021 
End bp2412709 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content65% 
IMG OID640132433 
Productsedolisin 
Protein accessionYP_001063490 
Protein GI126445423 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.458922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGGCCGC TCGCGCTCGC CGCCGGCATC GCACATGGCG CGACGGATTG GGTCGATACG 
CATACCAAAG CTTTCCTGAA TCACGCGCAG ATCGAGACGC TCGCCCGCGG CGCGAACGCC
GCATCGCTCG AGGTCGCGTC GGGCGAAGCC ACGCACGTCG TGGTCAGCCT GAAGCTGCGC
AACGCCGAGC AATTGAAAGC CGTCGCGCGC AACGTCAACG ATCCGCATAG TTCGCAGTAT
CGGCAGTACA TCACGAGCGC GCAGTTCCTC GCGAACTATG CGCCGACCGA AGCGCAGGTG
AAACAGGTCG TCGCCTATCT GCGCAAGAAC GGCTTCGTCG ACATCCACGT CGCGCCGAAT
CGCATGCTCG TCTCCGCGCG CGGCACCGCC GGCACGGTCA AGCAGGCGTT CAACACGTCG
CTCGTGCATT TCGAGTACGC GGGCCGCGCG GGCTTCGCGA ACGCGTCGAC GGCGCAAGTG
CCGCGCGCGC TCGGCGACAT CGTCGGCTCC GTGCTCGGCC TGCAGAACGT CGCGCGCGCC
CGGCCGCTCA CGAAGATCGG CGCGATCGCG AAACCGCTCG CGCTCGCGTC CGGCACGGCG
ACGGGCCACT ATCCATCCGA GTTTCCGGCG CTCTACAACG CAACGGGCGT GCCCACCGCG
GCGAACGCGA CGGTCGGCAT CATCACGATC GGCGGCGTGT CGCAAGCGCT GTCGGATCTG
CAGCAGTTCA CGAGCGCGAA CAGCTATCCG GACGTGTCGA CACAGACCAT CCAGACCAAC
GGTTCCGGCG GCAACTACAG CGACGACCAG GAAGGCCAAG GCGAATGGGA TCTGGACAGC
CAGTCGATCG TCGGCGCCGC GGGCGGCCAG CTCGGGCAAC TGATCTTCTA CATGGCCGAT
CTCGACGCGT CGGGCAACAC CGGCCTCACG CAGGCATTCA ACCAGGCGGT GTCGGACAAC
GCGGCGAAAG TGATCAACGT CTCGCTCGGC TGGTGCGAAA CCGATGCGAA CGCGGACGGC
ACGCTTTCCG CCGAAGAACA GATCTTCACG CAGGCGGTCG CGCAAGGTCA GACGTTCGCG
GTGTCCTCAG GCGACGAAGG CGTCTACGAG TGCAACAACC GCGGCTATCC CGATGGTTCG
AACTACACGG TATCGTGGCC GGCGTCGTCG CCGCACGTGC TCGCGATCGG CGGCACGACG
CTCTACACGA CTTCGTCGGG CGCCTTCTCG AACGAAACGG TATGGAACGA AGGGCTCGAC
GGCAACGGCA AGCTGTGGGC GACGGGCGGC GGCGTCAGCA CGATCCTGCC GAACCCGTCA
TGGCAGTCGG GCAGCCATCG CAAGCTGCCG GACATATCGT TCGACGCCGC GCAAAGCACG
GGCGCGTATA TCTACAATTA CGGCCAGTTG CAGCAGATCG GCGGCACGAG CCTGTCGGCG
CCGATTTTCA CGGGCTTCTG GGCGCGGCTC CTGTCGGCGA ACGGCACGGG TCTCGGCTTC
CCGGCCGCGC GCTTCTACCA CTCGATTCCG ACCCACGCGT CACTCGTGCG CTACGACGTC
ACGTCGGGCA ACAACGGCTA TTCGGGATAC GGCTACAAGG CATCGACCGG CTGGGACTAC
CCGACCGGCT GGGGCAGCAT CAACATCTCG AACCTGAATC AGTTGATCCA GTCGGGCGGC
TTCAATTGA
 
Protein sequence
MWPLALAAGI AHGATDWVDT HTKAFLNHAQ IETLARGANA ASLEVASGEA THVVVSLKLR 
NAEQLKAVAR NVNDPHSSQY RQYITSAQFL ANYAPTEAQV KQVVAYLRKN GFVDIHVAPN
RMLVSARGTA GTVKQAFNTS LVHFEYAGRA GFANASTAQV PRALGDIVGS VLGLQNVARA
RPLTKIGAIA KPLALASGTA TGHYPSEFPA LYNATGVPTA ANATVGIITI GGVSQALSDL
QQFTSANSYP DVSTQTIQTN GSGGNYSDDQ EGQGEWDLDS QSIVGAAGGQ LGQLIFYMAD
LDASGNTGLT QAFNQAVSDN AAKVINVSLG WCETDANADG TLSAEEQIFT QAVAQGQTFA
VSSGDEGVYE CNNRGYPDGS NYTVSWPASS PHVLAIGGTT LYTTSSGAFS NETVWNEGLD
GNGKLWATGG GVSTILPNPS WQSGSHRKLP DISFDAAQST GAYIYNYGQL QQIGGTSLSA
PIFTGFWARL LSANGTGLGF PAARFYHSIP THASLVRYDV TSGNNGYSGY GYKASTGWDY
PTGWGSINIS NLNQLIQSGG FN