Gene BURPS1106A_2473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2473 
Symbol 
ID4901398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2426984 
End bp2428870 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content71% 
IMG OID640135700 
Productserine protease 
Protein accessionYP_001066732 
Protein GI126454247 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCAA GAAAATGGGC CAGGCCTCGC GCTTCGCAGG CGAAGCATGC GATTTACGCG 
GCGACGTTCT TTGCCGCGGC GGCATTGAGC GCGCACGCGG CGGCGGCATG GGTCGACACG
CAAACCGGCG CGTATCCGGC GCTTGCGCAG CAGGCGCTTG CCGCGTCGCA AGCTTCGGCG
GCGGCAACCG CGGCCGGAAA GGCGATCGAC ACGGCGCCCG GCGAGCCGGT GCGGGTCGTC
GTCAGTCTCA ATCTCAACGA CGAAGCGAGG CTCGATCGCT TCCTGCGCGA TCTGCATACG
CCCGGCAGTG CCGCCTACGG CCGGCATCTG ACGCCCGCCG AATTCGCCGC GCAGTATGCG
CCGACGCCGC AGCAGGTCGC GCTCGTCGAA GCGCATCTGC GCCGGGCCGG ATTCCGCGAC
ATCGAGGTGG CGCCGAACCG GCTGTTGATC TCGGCGACGG GCACCGCCGC CGCGGTCAAG
ACGGCGTTCA ACACGCGGCT CAAGCGCTTT ACGCTCGAAG GCCGGCGCGT GTATGCGAAC
CAGGACGCGG CGCAGGTGCC CGCCGAGCTC GGCCGAATCG TTGGCGCGGT GCTCGGGCTC
GATAATGCGA CGCTCGCGCG CACGTACAAC CGCCAGGCGG CGGTGACGGG CGCGGTCGGC
GGCGCGAAGG CGTCGCTGGC CGCGCGCGCG AGCGACGCGT CGGCCGCCGC GAGCGGCGCG
CCCGTGCTGA CGGGACACGA TCCGCTCGAA TTCTCGCGAA TCTATCGCGC GGGCTCGACG
CCGACGGCGT CGCAGACGAC GGTCGGCGTG ATCATGGCGG GCGATGCGGC GCCCGTGCTG
CGGGATCTCG ATACGTTCGC GGCGAAGGCC GGGCTCGCGC GCGTCGCGGC GACGGTCACG
CGTACCGGGC CGCCCGGCAG CGATTACAGC GACAATTCGG GCCTGAGCGA ATGGGATATG
GACAGCCAGG CGATCGTCGG CGCGGCGGGC GGCGCGGTGA AGGGGCTTGT GCTCTACGCG
GCGCCCTCGA TGCTGCTCTC CGACATCACC TCGGCGTACA ACCGCGCGGT CGTCGACAAC
GTCGCGAAGG TGATCAACGT GTCGCTCGGC GTGTGCGAGG CGGACGCACG CGCGTCCGGC
ACGCAGGCGG CGGACGACCG CATCTTCAAG AGCGCGGTCG CGCAGGGGCA GACGTTCGTC
GTCGCGGCGG GCGACGCGGG CGCGTACGAA TGCAGCGTGA GCCGCGTGTC GGGCGGGCAG
GGCGTGCCGG CGCGATCGAA CTATTCGGTG AGCGAGCCTG CGACGTCGCC GTACGTCGTC
GCGGTCGGCG GCACGACGCT GTCGACCGAC AGGACGACGC TCGCCTACGC GGGCGAAGTC
GCATGGAACG AGGGTTTGCA GCCGATCGGC GTGTACGACG CGTACGGCAG CTACGACGGC
ACGAGGCGTC TGTGGGCGAC GGGCGGCGGC TACAGCCGAA GCGAAGCGGC ACCGGCGTGG
CAGCGAAGCG TGCTCGGCGC GTCGGCGAAA GCGCGCGCGC TGCCCGATGT CGCATTCGAT
GCGGACGGCC GCAGCGGCGC GCACGTCTAC GTGAACGGCC GGACCGAGCA ATGGGGCGGC
ACGAGCCTCG CGGCGCCGAT CTTCACGGGC ATCTGGGCGC GCGTGCAATC CGACAACGGC
AACCGGCTCG GTTTTCCGCT CGCGAGCCTC TATCGCTACG CGCCGGCCAA CGGCGCGTTC
GCGCATGACG TGAAATCCGG CAACAACGGC TCGGGCGGCT ATGGCTACAA GGCCGGTGCG
GGCTGGGATC CGGTGACGGG CTTCGGCAGC CTCGACATCG CGAACTTCGC CGCGTTCGTC
AAGCAGACGG CCGATTTCGC GCGATAA
 
Protein sequence
MTSRKWARPR ASQAKHAIYA ATFFAAAALS AHAAAAWVDT QTGAYPALAQ QALAASQASA 
AATAAGKAID TAPGEPVRVV VSLNLNDEAR LDRFLRDLHT PGSAAYGRHL TPAEFAAQYA
PTPQQVALVE AHLRRAGFRD IEVAPNRLLI SATGTAAAVK TAFNTRLKRF TLEGRRVYAN
QDAAQVPAEL GRIVGAVLGL DNATLARTYN RQAAVTGAVG GAKASLAARA SDASAAASGA
PVLTGHDPLE FSRIYRAGST PTASQTTVGV IMAGDAAPVL RDLDTFAAKA GLARVAATVT
RTGPPGSDYS DNSGLSEWDM DSQAIVGAAG GAVKGLVLYA APSMLLSDIT SAYNRAVVDN
VAKVINVSLG VCEADARASG TQAADDRIFK SAVAQGQTFV VAAGDAGAYE CSVSRVSGGQ
GVPARSNYSV SEPATSPYVV AVGGTTLSTD RTTLAYAGEV AWNEGLQPIG VYDAYGSYDG
TRRLWATGGG YSRSEAAPAW QRSVLGASAK ARALPDVAFD ADGRSGAHVY VNGRTEQWGG
TSLAAPIFTG IWARVQSDNG NRLGFPLASL YRYAPANGAF AHDVKSGNNG SGGYGYKAGA
GWDPVTGFGS LDIANFAAFV KQTADFAR