Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2861 |
Symbol | |
ID | 4887023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2721808 |
End bp | 2723301 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640132797 |
Product | serine metalloprotease |
Protein accession | YP_001063853 |
Protein GI | 126442962 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0448584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATTCGTA CTGCTTCGTT CAAGGCGACC GTCCTGTGTG CCGCGCTGGC CGGCCTCGTT CCGGCCGCGC AAGCGGAAAC CGCGGCCGCG CCCCAGGTGC CGGGGCCCGC CGACGCGGTC AATCAGTTGA TCGTCAAGTT GCGCGCGGTG AAGACGCCGC CCGGTGCGAC GGCCGCGAAG GCCGAGCGCG CGGACGTTCA GGCCGTCATC GATCGCGTGC TCGCCGCGCG CAATGCGCGG GCGGCGGGGC GTGCGTTCGG CGCGGCCGCC GCATCCGCGC CCGGCAATCC GGACGACCCC GCCGCGGGCA TTCGCATCAA GCGCGACATG TCGGGCGGCG CGACCGTGCT GTCGATGCAG CGCCACGTGT CGCTCGCGCA AGCCGAGGCG CTCGCGCGCG ACTTCGCGGC GGACGGCGCG ATCGAATATG CGGAGCCCGA TGCGCGGATG CATCCGTTCG TCGTGCCGAA CGATACGCGC TATTCGGAGC AATGGGGCTA CTTCAATCCG ACCGCCGGCG CGAATCTGCC GAAGGCTTGG GATCGCACGA CCGGCTCCGC GCGCGTCGTC GTCGCCGTCA TCGATACCGG CTACCGTCCG CATGCGGATC TCGCCGCGAA CCTGCTGCCG GGCTACGACT TCATCTCCGA TATCCCGAGC GCGAACGACG GCAATGGCCG CGACAGCGAC GCATCGGATC CCGGCGACTG GGTGAGCGCG CAGGAAGACG GCGATCCGAG CGGCCCGTTC TACGGCTGCG GCGCGAGCGA CAGCTCATGG CACGGCACGC ACGTCGCGGG CACGATCGGC GCGGTGACGG ACAACGGCGT CGGCGTGGCG GGCATCTCGT GGGTCGGCAA GGTGCTGCCC GTGCGCGTGC TCGGCAAGTG CGGCGGGATG CTGAGCGACA TCGCCGACGG CATGCGCTGG GCGGCGGGCC TGCCGGTGCC GGGCGCGCCG TCGAATCCGA ACCCGGCGAA GGTGCTGAAC CTGAGCCTCG GCGGATACGG CCGCACATGC AGCTCGACGT ACCAGAACGC GATCAACGAA ATCACGTCGC GCGGCGCGAA CGTCGTTGTC GCCGCGGGCA ACAACGGCGG CTCGGTGTCG ACGACTCAGC CGGCGAACTG CCGGGGCGTG ATCGCGGTCG GCGCGATCGA CAGCCGCGGT GTGCGCGCGA GCTTCAGCAA CACCGGCGCC GCGGTGAAGA TCTCCGCGCC GGGCGTCGGC ATTCTGTCGA CGCTCAATGC GGGCAAGACC TCGCCGGGCG CGGACAGCTA CGCGAGCTAC AGCGGCACGA GCATGGCAAC GCCGCATGTC GCGGGCACGG TCGCGCTGAT GCTCGCCGTC AACTCGACGC TGTCGCCCTC GCAGGTCTTG CAGCGGCTGC AATCGAGCGC GCGGCCGTTC TCGAGCGGAT CGAGCTGCTC GACGAGCACG TGCGGCGCAG GGCTGCTCGA CGCAGGCAAC GCGGTCGACG CCGCCGCGCA GTGA
|
Protein sequence | MIRTASFKAT VLCAALAGLV PAAQAETAAA PQVPGPADAV NQLIVKLRAV KTPPGATAAK AERADVQAVI DRVLAARNAR AAGRAFGAAA ASAPGNPDDP AAGIRIKRDM SGGATVLSMQ RHVSLAQAEA LARDFAADGA IEYAEPDARM HPFVVPNDTR YSEQWGYFNP TAGANLPKAW DRTTGSARVV VAVIDTGYRP HADLAANLLP GYDFISDIPS ANDGNGRDSD ASDPGDWVSA QEDGDPSGPF YGCGASDSSW HGTHVAGTIG AVTDNGVGVA GISWVGKVLP VRVLGKCGGM LSDIADGMRW AAGLPVPGAP SNPNPAKVLN LSLGGYGRTC SSTYQNAINE ITSRGANVVV AAGNNGGSVS TTQPANCRGV IAVGAIDSRG VRASFSNTGA AVKISAPGVG ILSTLNAGKT SPGADSYASY SGTSMATPHV AGTVALMLAV NSTLSPSQVL QRLQSSARPF SSGSSCSTST CGAGLLDAGN AVDAAAQ
|
| |