Gene BURPS1710b_A1080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1080 
Symbol 
ID3692685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1347279 
End bp1349579 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content67% 
IMG OID637731334 
Productsedolisin-B 
Protein accessionYP_336238 
Protein GI76819123 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.863455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGGAAA TCGTGTTGTC CGCCGTGTTT GTCCGCCGCG TTGTCCGCAA TCGCAACGAC 
ACGCGGCCCG ATGCAACGCC GTGCGGGTCG CGGAACGGTT TGCGGCGCGT CGAACGCGGT
CTTTTCGCGC GCCATATGAA TGGGATTCCT AAGCATTTGA CGCGCGGCGG CCGTATGTGC
CGGGTCCGCT CGGCGCGCAC GCCGGGCGAA ACGGCACCGC GCCGCCGCCG TCCATCGAAG
GGAGAAAATC ATGTCACCGT GAACGCCCGC CTCGCCGCAT CGCCATGCAT CGGCTGCACG
CCGCGCGTCC CTATCCGCCG GAGCGCGTCG TCGCATCTTT CCTCATCGTT TTCCTGGGGC
TTCAAAATGC AGAGAACTCA GCATGTCAAC CGGCTATTCT CGAAACGCTT CGCGCTGTCG
CCGCTACCGC TGGCGATCGC CGTCGCGCTG TCCCCGCTTC TCGCGCACGC GGCGCCCGAC
TGGGTGCCCA CCCGCACCGA CGCGTTCCTG ATCGCGCGCG CGCCCGCCGC GGCCGCCTCG
CAGACGCTCG CGAAGACCGC GCCGAGCTAC GCGCTCAACA TGATCGGCAC GCCCGAGCTC
GCCGACACGA ATGTCACGCC GCTCGAGCTG AGCCAGCCGC TGCGCGTGAC CATCGTGCTG
AAAAGCCGCA ACGAAGCGCA GCTCGATTCG CTCGTGCGCG AAGTGAACCA GCCGGGCAGC
GCGAACTATC GCAAGTACCT CACGCCCGAG CAGTTCAAGG CGCGCTTCGC GCCGACCGAC
GCCCAGGTGC GGGCCGTCGT CGCGCATCTG AAGGCGAACG GCTTCGGCGA CATCACGGTG
TCGGCAAACA ACAAGCTGAT CTTCGCGCAA GGCAACGCGT CGAACGCGGA ACACGGCTTC
CACACGACGC TCAAGCGCTT CAGCTATCGC GGCAAGGCGG TCTATGCGAA CGACTCGGCC
GCGCTCGTTC CCGCATCGCT GAGCCCGGTC GTCGAATCGG TGCTCGGGCT GCAGAGCGCG
GCCGTTCCGC ATCGGCTCAT CCATCGCGGC ACGCCGGCCG ACGCGCGCAT CCAGACCGAC
GCGATCACGA AGAACGCGAC GGCGAGCGGC ACGCAAACCG GCCATCAGCC GACCGATTTC
GCGCAGATCT ACAATGCGAG CGGCCTGCCC GCCGCGACCA ACACGACGGT CGGGATCATC
ACGTGGGGCG ACATGACGCA GACGATCGCC GATCTGAACA CGTTCACGCG AAACGCGGGC
CTGCCGAACG TGAACACCGC GGTCGTCGCG GGCAGCTCGG GCACGCTCGC GGACGACGGC
GATCCGGGCG AATGGGATCT CGACAGCCAG ACGATCATCG GCACGTCGGG CGGCGTGAAG
CAATTGATCT TCTATGCGGC CGTCAACGGC GACAGCAACG ACAGCGGCCT CACCAACGCG
ACGCTGACCG CCGCATACAA CAAGGCCGTC ACCGACAACG TCGCGAAGGT GATCAACGTA
TCGCTCGGCG AGGACGAGGC GGCCGCGAAT TCGGACGGCT CGCTCGCCGC GAACGACGCG
GTGTTCAAGC AGGCGGTCGC GCAAGGGCAG ATCTTCTCGG TGTCGTCCGG CGACGCGGGC
GTCTATCAGT GGTCGACTTC GCCATACGGC GCGCCGGGCT ACGTCGGCAC CTACAGCGGC
GGCAAGGTGA CGACGAAGAT CAATCTGGCG AAGTACAGCG TGTCGTCGCC GGCGAGCTCG
CCGTACGTCG TCGCGGTCGG CGGCACGACG TTGTCGACGA GCGGCACGAC GACCTGGGCC
GGCGAGACCG TCTGGAACGA AGGGCTCGCC TATGCGGACG TGAGCAGCAG CGGCCAGCCG
CTCGACAACG CGGTGCGGCT ATGGGCGACG GGCGGCGGCG TGAGCGGCTA CGAAGCGGCG
CCGAGCTGGC AGACGGCCGC GCTCGGCAGC TCGGTGACGA AGCGCGTCGT GCCGGACGTC
GCGTTCGACG CCGCGCAATC GACGGGCGCG TATCTCGTGA TCAACGGCCA GCCGAACCAG
TTGGTCGGCG GCACGAGCCT CGCGTCGCCG ATCTTCGTCG GCGGCTGGGC GCGCGTGGAA
TCGGCGAACG GCAACGGTCT CGGCCTGCCG ACGTCGGCGT TCTACCAGGG CCTGCCGGGC
AATCCGTCGC TCGTCCACGA CGTGACGTCG GGTAACAACG GCTACAACAG CTACGGCTAC
AGCGCGAAAT CCGGATGGGA CGACGACACC GGCTTCGGCA GTCTCGACTT CGCGAAGGTC
AGCGCGAGCT CGGTCAAGTA G
 
Protein sequence
MKEIVLSAVF VRRVVRNRND TRPDATPCGS RNGLRRVERG LFARHMNGIP KHLTRGGRMC 
RVRSARTPGE TAPRRRRPSK GENHVTVNAR LAASPCIGCT PRVPIRRSAS SHLSSSFSWG
FKMQRTQHVN RLFSKRFALS PLPLAIAVAL SPLLAHAAPD WVPTRTDAFL IARAPAAAAS
QTLAKTAPSY ALNMIGTPEL ADTNVTPLEL SQPLRVTIVL KSRNEAQLDS LVREVNQPGS
ANYRKYLTPE QFKARFAPTD AQVRAVVAHL KANGFGDITV SANNKLIFAQ GNASNAEHGF
HTTLKRFSYR GKAVYANDSA ALVPASLSPV VESVLGLQSA AVPHRLIHRG TPADARIQTD
AITKNATASG TQTGHQPTDF AQIYNASGLP AATNTTVGII TWGDMTQTIA DLNTFTRNAG
LPNVNTAVVA GSSGTLADDG DPGEWDLDSQ TIIGTSGGVK QLIFYAAVNG DSNDSGLTNA
TLTAAYNKAV TDNVAKVINV SLGEDEAAAN SDGSLAANDA VFKQAVAQGQ IFSVSSGDAG
VYQWSTSPYG APGYVGTYSG GKVTTKINLA KYSVSSPASS PYVVAVGGTT LSTSGTTTWA
GETVWNEGLA YADVSSSGQP LDNAVRLWAT GGGVSGYEAA PSWQTAALGS SVTKRVVPDV
AFDAAQSTGA YLVINGQPNQ LVGGTSLASP IFVGGWARVE SANGNGLGLP TSAFYQGLPG
NPSLVHDVTS GNNGYNSYGY SAKSGWDDDT GFGSLDFAKV SASSVK