Gene BURPS668_A1426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1426 
Symbol 
ID4887218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1336962 
End bp1338653 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content71% 
IMG OID640131365 
Productputative serine metalloprotease 
Protein accessionYP_001062423 
Protein GI126445517 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.23258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA AACGTAGTAA TCCTGAACAG ACGCGGCTTG GCCAGGTCCG TGCCGTCGCC 
GGCATTCTCT CCATGTCCGT CCTCGTTCCG CTCGCCGGTT GCGGCGGCGG TGGCGACGGA
GGCGGAAGCG GCACGCCGTC GGCCGCCGCG CAGCCGACCC CCGCGCCGGC GCCGGCGCCG
GCCCCGGCAC CCGCGCCGAG CTCGGGTTCG TCGCAATCCA CCAATTCGTC GACCTCGACG
GCGGCCTGCC CCGTCACGCA GGCCGCCTCG ACCGCCGCCA GCGAAACGCT CGTCACCCGC
ACCGTTTCGC ACGAAGCGCC CGTCGATCAT CTGATCGTCA AGCTGCAACG CACGGCGGCG
GCGAGCGCAT CCGGCGCGCG CATCATGGCC GCGGCGAACG ACGCGGCCCG ACTCGATTCG
GTGATCCAGC GCGTGATGTC GCAATGGAGC GCGAAGAGCG GCGCCGTTCG CTCGTATGCG
CAGAACATCG CGCCGACGAA CGCGGTGCAG GTGGAACGGA CGATGTCGGA CGGTGCCGCG
CTGCTCGCGC TCGGACAAAA GATGAGCGCG GATAATGCCG GCGCTCTCGC GCAAACGTTC
GCGGCCGATC CGGACGTCGC CTATGCGGAG CCCGACCGGC GCGTGTTCGC CCGCACGGTG
GCGACCGACC CGGACTACGC GCAGCAGTGG AACTACTTCG ATCCGGCGGC CGGCATCAAT
CTGCCGGACG CATGGAACGT GACGAACGGC CTGCCGAGCG TCGTCACCGC GGTGCTCGAC
ACCGGCTATC GCCCGCATCC GGACATCATC GCGAACCTGC TGCCGGGCTA CGATTTCATC
TCCGACATCA ACACCGGCAA CAACGGCCAC GGCCGCGGCC CGGATGCGAC CGACCCGGGC
GACTGGGTCA CGCAGCAGGA ACTGACCGAT CCGTCGAGCC CGTTCTACCA ATGCGCGAGC
GCGCCGTCGA ACAGCAGCTG GCACGGCACG CAGGTCGCCG GCATCATCGG CGCCGCCGCG
AACAACGGCA TCGGCATCGC GGGCGTCAGC TGGTACGGCA AGATCCTGCC CGTGCGCGTG
CTCGGCAAGT GCGGCGGCAC GACGAGCGAC ATCGCCGACG CGATGCGCTG GGCGGCGGGC
ATTCCCGTCG CGGGCGCGCC GACGAACCTC ACGCCGGCGA AGGTGATCAA CCTGAGCCTC
GGCGGCACCG GCCCGTGCGG CGACACGTTC CAGCAGGCGA TCAACGACGT GATCGCGCGC
GGCACGACCG TCGTCGTCTC GGCCGGCAAC GACGGCCAGG CGACGACGCT GGACCGCCCG
GCCAACTGCA AGGGCGTGAT CTCGGTCGGC GCGACCGACA GCACCGGCCA GCGCGCGTGG
TACAGCAACT TCGGCTCGGA CATCACGCTG AGCGCGCCGG GCTCGAACAT CCTGTCGACG
AGCAATGCGG GCACCACGGT GCCGACCACC GACGCGTACG GCACGCACAG CGGCACGAGC
CTTGCCGCGC CGCAGGTGGC GGGCGTCGCC TCGCTGATGC TCGCGGTCAA CCCGAACCTC
ACGCCCGCGC AGATCGCGCA GAAGCTCGCG AGCACCGCGC GGCCGTCGCC GGCCACCGCA
TCCTGCCTCG CGCGCGCGCC GGGCGCGGGC ATCGTCGACG CCGGCACGGT GGTTGCGTCC
GCAACGAAAT AG
 
Protein sequence
MNKKRSNPEQ TRLGQVRAVA GILSMSVLVP LAGCGGGGDG GGSGTPSAAA QPTPAPAPAP 
APAPAPSSGS SQSTNSSTST AACPVTQAAS TAASETLVTR TVSHEAPVDH LIVKLQRTAA
ASASGARIMA AANDAARLDS VIQRVMSQWS AKSGAVRSYA QNIAPTNAVQ VERTMSDGAA
LLALGQKMSA DNAGALAQTF AADPDVAYAE PDRRVFARTV ATDPDYAQQW NYFDPAAGIN
LPDAWNVTNG LPSVVTAVLD TGYRPHPDII ANLLPGYDFI SDINTGNNGH GRGPDATDPG
DWVTQQELTD PSSPFYQCAS APSNSSWHGT QVAGIIGAAA NNGIGIAGVS WYGKILPVRV
LGKCGGTTSD IADAMRWAAG IPVAGAPTNL TPAKVINLSL GGTGPCGDTF QQAINDVIAR
GTTVVVSAGN DGQATTLDRP ANCKGVISVG ATDSTGQRAW YSNFGSDITL SAPGSNILST
SNAGTTVPTT DAYGTHSGTS LAAPQVAGVA SLMLAVNPNL TPAQIAQKLA STARPSPATA
SCLARAPGAG IVDAGTVVAS ATK