Gene BURPS1106A_A1343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1343 
Symbol 
ID4903781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1271453 
End bp1273144 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content71% 
IMG OID640144449 
Productputative serine metalloprotease 
Protein accessionYP_001075378 
Protein GI126457627 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA AACGTAGTAA TCCTGAACAG ACGCGGCTTG GCCAGGTCCG TGCCGTCGCC 
GGCATTCTCT CCATGTCCGT CCTCGTTCCG CTCGCCGGTT GCGGCGGCGG TGGCGACGGA
GGCGGAAGCG GCACGCCGTC GGCCGCCGCG CAGCCGACCC CCGCGCCGGC ACCGGCGCCG
GCCCCGGCAC CCGCGCCGAG CTCGGGTTCG TCGCAATCCA CCAATTCGTC GACCTCGACG
GCGGCCTGCC CCGTCACGCA GGCCGCCTCG ACCGCCGCCG GCGAAACGCT CGTCACCCGC
ACCGTTTCGC ACGAAGCACC CGTCGACCAT CTGATCGTCA AGCTGCAACG CACGGCGGCG
GCGAGCGCAT CCGGCGCGCG CATCATGGCC GCGGCGAACG ACGCGGCCCG ACTCGATTCG
GTGATCCAGC GCGTGATGTC GCAATGGAGC GCGAAGAGCG GCGCCGTTCG CTCGTATGCG
CAGAACATCG CGCCGACGAA CGCGGTGCAG GTGGAACGGA CGATGTCGGA CGGTGCCGCG
CTGCTCGCGC TCGGACAAAA GATGAGCGCG GATAATGCCG GCGCTCTCGC GCAAACGTTC
GCGGCCGATC CGGACGTCGC CTATGCGGAG CCCGACCGGC GCGTGTTCGC CCGCACGGTG
GCGACCGACC CGGACTACGC GCAGCAGTGG AACTACTTCG ATCCGGCGGC CGGCATCAAT
CTGCCGGACG CATGGAACGT GACGAACGGC CTGCCGAGCG TCGTCACCGC GGTGCTCGAC
ACCGGCTATC GCCCGCATCC GGACATCATC GCGAACCTGC TGCCGGGCTA CGATTTCATC
TCCGACATCA ACACCGGCAA CAACGGCCAC GGCCGCGGCC CGGACGCGAC CGACCCGGGC
GACTGGGTCA CGCAGCAGGA ACTGACCGAT CCGTCGAGCC CGTTCTACCA ATGCGCGAGC
GCGCCGTCGA ACAGCAGCTG GCACGGCACG CAGGTCGCCG GCATCATCGG CGCCGCCGCG
AACAACGGCA TCGGCATCGC GGGCGTCAGC TGGTACGGCA AGATCCTGCC CGTGCGCGTG
CTCGGCAAGT GCGGCGGCAC GACGAGCGAC ATCGCCGACG CGATGCGCTG GGCGGCGGGC
ATTCCCGTCG CGGGCGCGCC GACGAACCTC ACGCCGGCGA AGGTGATCAA TCTGAGCCTC
GGCGGCACCG GCCCGTGCGG CGACACGTTC CAGCAGGCGA TCAACGACGT GATCGCGCGC
GGCACGACCG TCGTCGTCTC GGCCGGCAAC GACGGCCAGG CGACGACGCT GGACCGCCCA
GCCAACTGCA AGGGCGTGAT CTCGGTCGGC GCGACCGACA GCACCGGCCA GCGCGCGTGG
TACAGCAACT TCGGCTCGGA CATCACGCTG AGCGCGCCGG GCTCGAACAT CCTGTCGACG
AGCAATGCGG GCACCACGGT GCCGACCACC GACGCATACG GCACGCACAG CGGCACGAGC
CTTGCCGCGC CGCAGGTGGC GGGCGTCGCC TCGCTGATGC TCGCGGTCAA CCCGAACCTC
ACGCCCGCGC AGATCGCGCA GAAGCTCGCG AGCACCGCGC GGCCGTCGCC GGCCACCGCA
TCCTGCCTCG CGCGCGCGCC GGGCGCGGGC ATCGTCGACG CCGGCACGGT GGTTGCGTCC
GCAACGAAAT AG
 
Protein sequence
MNKKRSNPEQ TRLGQVRAVA GILSMSVLVP LAGCGGGGDG GGSGTPSAAA QPTPAPAPAP 
APAPAPSSGS SQSTNSSTST AACPVTQAAS TAAGETLVTR TVSHEAPVDH LIVKLQRTAA
ASASGARIMA AANDAARLDS VIQRVMSQWS AKSGAVRSYA QNIAPTNAVQ VERTMSDGAA
LLALGQKMSA DNAGALAQTF AADPDVAYAE PDRRVFARTV ATDPDYAQQW NYFDPAAGIN
LPDAWNVTNG LPSVVTAVLD TGYRPHPDII ANLLPGYDFI SDINTGNNGH GRGPDATDPG
DWVTQQELTD PSSPFYQCAS APSNSSWHGT QVAGIIGAAA NNGIGIAGVS WYGKILPVRV
LGKCGGTTSD IADAMRWAAG IPVAGAPTNL TPAKVINLSL GGTGPCGDTF QQAINDVIAR
GTTVVVSAGN DGQATTLDRP ANCKGVISVG ATDSTGQRAW YSNFGSDITL SAPGSNILST
SNAGTTVPTT DAYGTHSGTS LAAPQVAGVA SLMLAVNPNL TPAQIAQKLA STARPSPATA
SCLARAPGAG IVDAGTVVAS ATK