Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1911 |
Symbol | |
ID | 4887843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1856009 |
End bp | 1858339 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640131849 |
Product | serine protease |
Protein accession | YP_001062906 |
Protein GI | 126445312 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.438045 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCGGC GCGCGTGGGC GATAGTCGGG ATCGCGCATG GATACGGGCG AAGCGGAAAA CGGCACGGCG TCGGCACCGG CTTCGGACAG TCCCGGCACG ACGAAGGCAC GCTTGCGACG CGCGCGCCGC GAATGCCCGC GGCGGACGCG TCGCAATCGA CGAGCCGGCA GCCAAGCGCG AGCGCCGCTC GTCTTACGCT ACCATTACAC GCCATGCCCC TTGCAAAGAA CGCCATGCCT CATGCTTCCT GGCCCGAACA GGCCGATCCG TATCAATTCC TCGAAGAACT GGACAGCGCC GCGAGCGTCG GCTGGGTCGA CGCGCAAAAC GCCCGCACGC ACGATGCGCC CTGGCTCGAC GAAGCGCACT ATCGCGCGCT GGTCGAGCGC TTCACCCGGG CGCTGCTGCC GCGCGAGCGC CCGGTGATTC CGCAGCGCTG GCAGGACTGG GCGTACGACG TCTGGCAGGA CGAACAGCAT CCGAAGGGCC TGTGGCGGCG CACGCGATGG ACGAGCTGGC GCAGCGGCCA CGCGGACTGG CAAACGTTGA TCGACCTCGA TGCGCTTGGT GAAGCGCAAG GCGTGCAGTG GGTATTCGAC GATCAGCTCA TCCTCGAGCC TGACGGCGAT CGTGCGCTGA TCGTGCTGTC CGACGGCGGC GCCGACGCGG TGGTCGTCCG CGAGTTCGAC ATCGCGCAAT GCCGGTTCGT CGACGACGGC TTCTCGATCG AAGCGGCCGG CAAGCATTCG GTCGAATGGA TCGATCGCGA CACGATCTAC GTCGGCTGGG ACGACGGCGG CGCCACCGTC ACGCGCTCCG GCTATCCGCG CGAAGTCCGG CGCTGGACGC GCGGCACGCC GCTGTCCAGC GCGCCCGTGG TGTTTCGCGG CGCGCGCGGC GACATCTCGG TCGATGCGCA ATACGATCCG CTCGACCGGC ATCACGCGAT CGAGCAGGCG ATCAATTTCT ACGACGCGAA CACGTATCGC CTCGCCGAGG ACGGCGCGTG GGCGCGCTAC GACGTGCCAC CGCACGTCGA AGTCGGTTAC TGGAGCGGGT GGCTGCTGCT TCAGCCGCGG CTCGACTGGA CTTGCGGCGG CGCGCGCTAC GCGGGCGGCA GCCTGCTCGC GATCCGCGAG GACGCGTTCG TCGCCGGTGA GCGCGCGTTC GCCGCGCTGT TCGAGCCGAA CGAGCGCACG TCCGCATGCG GCTGGACGCA CACGCGCCGC TACGTGCTGG TGTCGTGGCT CGACGACGTG CTCACGCGCA CGATGCTCTG GCTTCCCGAA CGTCAGGATG ACGGAGCATG GCGCTGGCAT GCTCGTCCGT TCCCCGTGCG AGGGCTCGCG CAAGTGGACG TGTCGCCCGT CGAGCCCACG TTCGACGACG AGGTGTACGT GAGCGTCGAC GATTACCTGA AGCCGCCCGA GTATTCGCTC GCGAATCTCG CCAGCGACGA CCTGTCCGCC TGGACGCTGC TCGACCGCTG GCCGACGCAG TTCGACGCGT CCGAACTGAC GGTGCGGCGC GAACACGCGC GCTCGCGCGA CGGCACGCTC GTGCCTTATA CGCTGGTCGG GCCGCGCGAC GTGCTGGACA ATGCGGCGCG CGCGCCGCGC CCCTGCCTGT TGAACGGCTA CGGCGGCTTC GCGATTGCGC TCACGCCCGA TTACGATCCG CTGCTCGGCA TCGGCTGGCT CGAGAAAGGC GGCATCGCGG TGTTCGCCCA TATTCGCGGC GGCGGCGAGT TCGGCACGCA GTGGCACGAA TCGGCGCGGC AAACGCAACG GCAGCGATCG TTCGACGATT TCATCGCGGT CGCCGAAAAA CTCGTCGCGG ACGGCGTGAC GAGCGCCGCG CAACTGGGTA TTCGCGGCGG CAGCAACGGC GGGCTGCTGG TCGCGGCATG CATGATTCAG CGCCCGGACC TGTTCGGCGC GGTGGTGAGC GACGTGCCGC TTCTCGACAT GCAGCGCTAT GCGCTGCTGC ACGCGGGCGC ATCGTGGCTG GACGAATTCG GCGATCCCGA CGATCCGGCG CATGCGTCGG CGCTCGCGGC CTACTCGCCG TATCACCGGG TCGCGCGCGA CATCGCGTAT CCGCCCGCGC TGTTCACGAC ATCGACGAGC GACGACCGCG TGCATCCCGC CCATGCGAGA AAAATGGTCG CGCGCATGCA GGCGCAAGGG CACCGGAACG TATGGCTGAT CGAGAAAACC GATGGCGGCC ACGGCAGCGC GGACGCGATC GATACCGCCG AGCACGAAGC GATCGGCTAT GTGTTTCTGT GGACTCACTT GTCCCGCGGC GCGCATGACG CGCGCGAGTG A
|
Protein sequence | MTRRAWAIVG IAHGYGRSGK RHGVGTGFGQ SRHDEGTLAT RAPRMPAADA SQSTSRQPSA SAARLTLPLH AMPLAKNAMP HASWPEQADP YQFLEELDSA ASVGWVDAQN ARTHDAPWLD EAHYRALVER FTRALLPRER PVIPQRWQDW AYDVWQDEQH PKGLWRRTRW TSWRSGHADW QTLIDLDALG EAQGVQWVFD DQLILEPDGD RALIVLSDGG ADAVVVREFD IAQCRFVDDG FSIEAAGKHS VEWIDRDTIY VGWDDGGATV TRSGYPREVR RWTRGTPLSS APVVFRGARG DISVDAQYDP LDRHHAIEQA INFYDANTYR LAEDGAWARY DVPPHVEVGY WSGWLLLQPR LDWTCGGARY AGGSLLAIRE DAFVAGERAF AALFEPNERT SACGWTHTRR YVLVSWLDDV LTRTMLWLPE RQDDGAWRWH ARPFPVRGLA QVDVSPVEPT FDDEVYVSVD DYLKPPEYSL ANLASDDLSA WTLLDRWPTQ FDASELTVRR EHARSRDGTL VPYTLVGPRD VLDNAARAPR PCLLNGYGGF AIALTPDYDP LLGIGWLEKG GIAVFAHIRG GGEFGTQWHE SARQTQRQRS FDDFIAVAEK LVADGVTSAA QLGIRGGSNG GLLVAACMIQ RPDLFGAVVS DVPLLDMQRY ALLHAGASWL DEFGDPDDPA HASALAAYSP YHRVARDIAY PPALFTTSTS DDRVHPAHAR KMVARMQAQG HRNVWLIEKT DGGHGSADAI DTAEHEAIGY VFLWTHLSRG AHDARE
|
| |