Gene BURPS668_A2829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2829 
Symbol 
ID4886010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2690883 
End bp2693183 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content67% 
IMG OID640132765 
Productpseudomonalisin 
Protein accessionYP_001063821 
Protein GI126445251 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.672339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGGAAA TCGTGTTGTC CGCCGTGTTT GTCCGCCGCG TTGTCCGCAA TCGCAACGAC 
ACGCGGCCCG ATGCAACGCC GTGCGGGTCG CGGAACGGTT TGCGGCGCGT CGAACGCGGT
CTTTTCGCGC GCCATATGAA TGGGATTCCT AAGCATTTGA CGTGCGGCGG CCGTATGTGC
CGGGTCCGCT CGGCGCGCAC GCCGGGCGAA ACGGCACCGC GCCGCCGCCG TCCATCGAAG
GGAGAAAATC ATGTCACCGT GAACGCCCGC CTCGCCGCAT CGCCATGCAT CGGCTGCACG
CCGCGCGTCC CTATCCGCCG GAGCGCGTCG TCGCATCTTT CCTCATCGTT TTCCTGGGGC
TTCAAAATGC AGAGAACTCA GCATGTCAAC CGGCTATTCT CGAAACGCTT CGCGCTGTCG
CCGCTACCGC TGGCGATCGC CGTCGCGCTG TCCCCGCTTC TCGCGCACGC GGCGCCCGAC
TGGGTGCCCA CCCGCACCGA CGCGTTCCTG ATCGCGCGCG CGCCCGCCGC GGCCGCCTCG
CAGACGCTCG CGAAGACCGC GCCGAGCTAC GCGCTCAACA TGATCGGCAC GCCCGAGCTC
GCCGACACGA ATGTCACGCC GCTCGAGCTG AGCCAGCCGC TGCGCGTGAC CATCGTGCTG
AAAAGCCGCA ACGAAGCGCA GCTCGATTCG CTCGTGCGCG AAGTGAACCA GCCGGGCAGC
GCGAACTATC GCAAGTACCT CACGCCCGAG CAGTTCAAGG CGCGCTTCGC GCCGACCGAC
GCCCAGGTGC GGGCCGTCGT CGCGCATCTG AAGGCGAACG GCTTCGGCGA CATCACGGTG
TCGGCGAACA ACAAGCTGAT CTTCGCGCAA GGCAACGCGT CGAACGCGGA ACACGGCTTC
CACACGACGC TCAAGCGCTT CAGCTATCGT GGCAAGGCGG TCTATGCGAA CGACTCGGCC
GCGCTCGTTC CCGCATCGCT GAGCCCGGTC GTCGAATCGG TGCTCGGGCT GCAGAGCGCG
GCCGTTCCGC ATCGGCTCAT CCATCGCGGC ACGCCGGCCG ACGCGCGCAT CCAGACCGAC
GCGATCACGA AGAACGCGAC GGCGAGCGGC ACGCAAACCG GCCATCAGCC GACCGATTTC
GCGCAGATCT ACAATGCGAG CGGCCTGCCC GCCGCGACCA ACACGACGGT CGGGATCATC
ACGTGGGGCG ACATGACGCA GACGATCGCC GATCTGAACA CGTTCACGCG AAACGCGGGC
CTGCCGAACG TGAACACCGC GGTCGTCGCG GGCAGCTCGG GCACGCTCGC GGACGACGGC
GATCCGGGCG AATGGGATCT CGACAGCCAG ACGATCATCG GCACGTCGGG CGGCGTGAAG
CAATTGATCT TCTATGCGGC CGTCAACGGC GACAGCAACG ACAGCGGCCT CACCAACGCG
ACGCTGACCG CCGCATACAA CAAGGCCGTC ACCGACAACG TCGCGAAGGT GATCAACGTA
TCGCTCGGCG AGGACGAGGC GGCCGCGAAT TCGGACGGCT CGCTCGCCGC GAACGACGCG
GTGTTCAAGC AGGCGGTCGC GCAAGGGCAG ATCTTCTCGG TGTCGTCCGG CGACGCGGGC
GTCTATCAGT GGTCGACTTC GCCATACGGC GCGCCGGGCT ACGTCGGCAC CTACAGCGGC
GGCAAGGTGA CGACGAAGAT CAGTCTGGCG AAGTACAGCG TGTCGTCGCC GGCGAGCTCG
CCGTACGTCG TCGCGGTCGG CGGCACGACG TTGTCGACGA GCGGCACGAC GACCTGGGCC
GGCGAGACCG TCTGGAACGA AGGGCTCGCC TATGCGGACG TGAGCAGCAG CGGCCAGCCG
CTCGACAACG CGGTGCGGCT ATGGGCGACG GGCGGCGGCG TGAGCGGCTA CGAAGCGGCG
CCGAGCTGGC AGACGGCCGC GCTCGGCAGC TCGGTGACGA AGCGCGTCGT GCCGGACGTC
GCGTTCGACG CCGCGCAATC GACGGGCGCG TATCTCGTGA TCAACGGCCA GCCGAACCAG
TTGGTCGGCG GCACGAGCCT CGCGTCGCCG ATCTTCGTCG GCGGCTGGGC GCGCGTGGAA
TCGGCGAACG GCAACGGTCT CGGCCTGCCG ACGTCGGCGT TCTACCAGGG CCTGCCGGGC
AATCCGTCGC TCGTCCACGA CGTGACGTCG GGTAACAACG GCTACAACAG CTACGGCTAC
AGCGCGAAAT CCGGATGGGA CGACGACACC GGCTTCGGCA GTCTCGACTT CGCGAAGGTC
AGCGCGAGCT CGGTCAAGTA G
 
Protein sequence
MKEIVLSAVF VRRVVRNRND TRPDATPCGS RNGLRRVERG LFARHMNGIP KHLTCGGRMC 
RVRSARTPGE TAPRRRRPSK GENHVTVNAR LAASPCIGCT PRVPIRRSAS SHLSSSFSWG
FKMQRTQHVN RLFSKRFALS PLPLAIAVAL SPLLAHAAPD WVPTRTDAFL IARAPAAAAS
QTLAKTAPSY ALNMIGTPEL ADTNVTPLEL SQPLRVTIVL KSRNEAQLDS LVREVNQPGS
ANYRKYLTPE QFKARFAPTD AQVRAVVAHL KANGFGDITV SANNKLIFAQ GNASNAEHGF
HTTLKRFSYR GKAVYANDSA ALVPASLSPV VESVLGLQSA AVPHRLIHRG TPADARIQTD
AITKNATASG TQTGHQPTDF AQIYNASGLP AATNTTVGII TWGDMTQTIA DLNTFTRNAG
LPNVNTAVVA GSSGTLADDG DPGEWDLDSQ TIIGTSGGVK QLIFYAAVNG DSNDSGLTNA
TLTAAYNKAV TDNVAKVINV SLGEDEAAAN SDGSLAANDA VFKQAVAQGQ IFSVSSGDAG
VYQWSTSPYG APGYVGTYSG GKVTTKISLA KYSVSSPASS PYVVAVGGTT LSTSGTTTWA
GETVWNEGLA YADVSSSGQP LDNAVRLWAT GGGVSGYEAA PSWQTAALGS SVTKRVVPDV
AFDAAQSTGA YLVINGQPNQ LVGGTSLASP IFVGGWARVE SANGNGLGLP TSAFYQGLPG
NPSLVHDVTS GNNGYNSYGY SAKSGWDDDT GFGSLDFAKV SASSVK