Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A1101 |
Symbol | |
ID | 3693577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | - |
Start bp | 1378015 |
End bp | 1379517 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637731355 |
Product | serine metalloprotease |
Protein accession | YP_336259 |
Protein GI | 76818474 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0573282 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAATAT TGATTCGTAC TGCTTCGTTC AAGGCGACCG TCCTGTGTGC CGCGCTGGCC GGCCTCGTTT CGGCCGCGCA AGCGGAAACC GCGGCCGCGC CCCAGGTGCC GGGGCCCGCC GACGCGGTCA ATCAGTTGAT CGTCAAGTTG CGCGCGGTGA AGACGCCGCC CGGTGCGACG GCCGCGAAGG CCGAGCGCGC GGACGTTCAG GCCGTCATCG ATCGCGTGCT CGCCGCGCGC AATGCGCGGG CGGCGGGGCG TGCGTTCGGC GCGGCCGCCG CATCCGCGCC CGGCAATCCG GACGACCCCG CCGCGGGCAT TCGCATCAAG CGCGACATGT CGGGCGGCGC GACCGTGCTG TCGCTGCAGC GCCACGTGTC GCTCGCGCAG GCCGAGGCGC TCGCGCGCGA CTTCGCGGCG GACGGCGCGA TCGAATATGC GGAGCCCGAT GCGCGGATGC ATCCGTTCGT CGTGCCGAAC GATACGCGCT ATTCGGAGCA ATGGGGCTAC TTCAATCCGA CCGCCGGCGC GAATCTGCCG AAGGCTTGGG ATCGCACGAC CGGCTCCGCG CGCGTCGTCG TCGCCGTCAT CGATACCGGC TACCGTCCGC ATGCGGATCT CGCCGCGAAC CTGCTGCCGG GCTACGACTT CATCTCCGAT ATCCCGAGCG CGAACGACGG CAATGGCCGC GACAGCGACG CATCGGATCC CGGCGACTGG GTGAGCGCGC AGGAAGACGG CGATCCGAGC GGCCCATTCT ATGGCTGCGG CGCGAGCGAC AGCTCATGGC ACGGCACGCA CGTCGCGGGC ACGATCGGCG CGGTGACGAA CAACGGCGTC GGCGTGGCGG GCATCTCGTG GGTCGGCAAG GTGCTGCCCG TGCGCGTGCT CGGCAAGTGC GGCGGGATGC TGAGCGACAT CGCCGACGGC ATGCGCTGGG CGGCGGGCCT GCCGGTGCCG GGCGCGCCGT CGAATCCGAA CCCGGCGAAG GTGCTGAACC TGAGCCTCGG CGGATACGGC CGCACATGCA GCTCGACGTA CCAGAACGCG ATCAACGAAA TCACGTCGCG CGGCGCGAAC GTCGTTGTCG CCGCGGGCAA TAACGGCGGC TCGGTGTCGA CGACTCAGCC GGCGAATTGC CGGGGCGTGA TCGCGGTTGG CGCGATCGAC AGCCGCGGTG TGCGCGCGAG CTTCAGCAAC ACCGGCGCCG CGGTGAAGAT CTCCGCGCCG GGCGTCGGCA TTCTGTCGAC GCTCAATGCG GGCAAGACCT CGCCGGGCGC GGACAGCTAC GCGAGCTATA GCGGCACGAG CATGGCAACG CCGCATGTCG CGGGCACGGT CGCGCTGATG CTCGCCGTCA ACTCGACGCT GTCGCCTTCG CAGATCTTGC AGCGGCTGCA ATCGAGCGCG CGGCCGTTCT CGAGCGGATC GAGCTGCTCG ACGAGCACGT GCGGCGCAGG GCTGCTCGAC GCAGGCAACG CGGTCGACGC CGCCGCGCAG TGA
|
Protein sequence | MSILIRTASF KATVLCAALA GLVSAAQAET AAAPQVPGPA DAVNQLIVKL RAVKTPPGAT AAKAERADVQ AVIDRVLAAR NARAAGRAFG AAAASAPGNP DDPAAGIRIK RDMSGGATVL SLQRHVSLAQ AEALARDFAA DGAIEYAEPD ARMHPFVVPN DTRYSEQWGY FNPTAGANLP KAWDRTTGSA RVVVAVIDTG YRPHADLAAN LLPGYDFISD IPSANDGNGR DSDASDPGDW VSAQEDGDPS GPFYGCGASD SSWHGTHVAG TIGAVTNNGV GVAGISWVGK VLPVRVLGKC GGMLSDIADG MRWAAGLPVP GAPSNPNPAK VLNLSLGGYG RTCSSTYQNA INEITSRGAN VVVAAGNNGG SVSTTQPANC RGVIAVGAID SRGVRASFSN TGAAVKISAP GVGILSTLNA GKTSPGADSY ASYSGTSMAT PHVAGTVALM LAVNSTLSPS QILQRLQSSA RPFSSGSSCS TSTCGAGLLD AGNAVDAAAQ
|
| |