Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0988 |
Symbol | |
ID | 4887219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 957620 |
End bp | 959605 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640130928 |
Product | collagenase |
Protein accession | YP_001061987 |
Protein GI | 126442357 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.244986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACTATAC ATGATCATCA ATCATATTCA CGACAACAAA GGGCAAATAT GAAAAATTCC CGCAATGTCG TCAATCGTTT TATTGTCGCC GCTTCTATTA TCATTGGAGT CGTTCTTTAC AGTTCCGCTT GGGCAAATCC GCAGCCCATG CATACGAAGC AGGCACGTAT GCCGCGTATC CCGCAGAATC TCCCGCTTTC ACCAGACCAA GCCAAATACG ACCTGCCGCT CAGCAAGTAT GACCGCGCAA CGCTGATGGA GCCGTTGCGG CGGAAGCAAT CAGCGAAACC CGACAGGCGC ACCCGGCCTG GAGCAGATTG CCGCGACATG TCAATAATGA CGCAATATCA CGGCACGGCG CTTGCTGATT ACATAGCAAA CCTCCCGGAT TATGAGTGCC ACTACGGACT ATTCTCGATT GACAGGGCGA TGGCCGCGCA GATTTTCAAT TCTGAAAACG TGTGGGCTGT TGCCAGCCGT CTCACTCAAG AAATCAATCG TTACGACGCA ACAAATATTA CATTGGTAAA TTTGCTTATT TATCTGAGAG CCGCTTATTT CCAATATGAC GCAGCCCAGC TTGCTGATCC GGTTCCCGGT CTCGTAGTCT GGCTGCGTCC GTATATTTTG CAGAGCCTCT CTGGCGACGC GCTTTACCTC GAGAATTCAC GCGCGCCGAG CACCGCCAAC GAGCTGATGA TCCTAATCAC AAACATGAAG GACGAGGCGT ACTACCTGCC AACGCTGAAG GACCGAATCG CGTTCTACAC CGCGAGCGCG ACCAACCCTC AGGCTGCGGC GCCGCTACTG CAGCGAAGCG CGGCGGGTGG CTTCACCGGC TTGCTCACGG TGTTCTTCTA CGCGCATCAG CGCAGCGGCG CTCAGCCGAT GCTCGATAGC GATGCGACTC TGCCGGAGAC GCTCAACCGC TTCGTCACGG CGAACCGCGC ATACCTGTCG AACACCAGTG CCGCCTATCA GCTCGCCGAT GCGGCGCGCG AAACGTACCG CTTTCTCCGC TATCCGTCGC AGAAGCCGCG GGTGAAGAAA ATGATTCAGG ATATGCTCGC GTCGACTACC ATGACGGGCC CGGACAACGA CCTGTGGCTC GCGGCAGCGG AAGCAGCCGA TTACGGCGAT CCCGGCAACT GCGCAGATTA CGGCACGTGC GACTATCAGA AGCGGCTCAT CGAGGCAGTG CTCACGCATC GGTACTCATG CAATGCGAAC GTACGAATTC TCGCGCAGGA CATGACGGTG CCGCAATTCC AGTCGGCATG CCAATCGGTC GCCCAGGAGG AGGACTATTT CCACAGGATG ATGAAGACAG GGCACGTACC GGTCGCGAAC GATCACAATG ACACGATCGA AATAGTCGTA TTCGGCGACT ACGACAATTA TCGGAAGTAC GCTTCGGTGA TCTACGGAAT TAGCACCGAT AACGGCGGCA TGTACGTTGA AGGCGATCCG TCGGCACCCG GCAATCAGGC GCGCTTCATC GCGCACGAGG CTTCGTGGCT ACGGCCGGAG TTCAAGGTCT GGAACCTTGA GCACGAGTTT ACGCACTATC TCGACGGCCG TTACGACATG GCGGGCGACT TCGCGGCGAG CACGGCGAAG CCCACCGTGT GGTGGATCGA GGGTCTTGCC GAATATATCT CCAGAAAGAA CGATGACCAG GAATCGATCG ACGCGGTGCG CACGAACGCA TATCGGCTCT CGGACGTGCT TCAGACGACT TATTCGTCCG GCGACTATGT CACGCGCGCG TATCGATGGG GTTATATGGC GACGCGCTTC ATGTTTGAAC GTCATCGCAC GGACGTCGAC GCGATCGTGT CACGTTTTCG CGTGGGCGAT TACGACGGTT ACGCGGACTA TGTCGCGTAC ATGGGCAACC GCTATGACAG CGAGTTTGTT GACTGGGCAC GCGGCGCGAC AACAACCGGT GAGCCGCCGT TGCCGCCAAC GAAAGCGGGG CATTGA
|
Protein sequence | MTIHDHQSYS RQQRANMKNS RNVVNRFIVA ASIIIGVVLY SSAWANPQPM HTKQARMPRI PQNLPLSPDQ AKYDLPLSKY DRATLMEPLR RKQSAKPDRR TRPGADCRDM SIMTQYHGTA LADYIANLPD YECHYGLFSI DRAMAAQIFN SENVWAVASR LTQEINRYDA TNITLVNLLI YLRAAYFQYD AAQLADPVPG LVVWLRPYIL QSLSGDALYL ENSRAPSTAN ELMILITNMK DEAYYLPTLK DRIAFYTASA TNPQAAAPLL QRSAAGGFTG LLTVFFYAHQ RSGAQPMLDS DATLPETLNR FVTANRAYLS NTSAAYQLAD AARETYRFLR YPSQKPRVKK MIQDMLASTT MTGPDNDLWL AAAEAADYGD PGNCADYGTC DYQKRLIEAV LTHRYSCNAN VRILAQDMTV PQFQSACQSV AQEEDYFHRM MKTGHVPVAN DHNDTIEIVV FGDYDNYRKY ASVIYGISTD NGGMYVEGDP SAPGNQARFI AHEASWLRPE FKVWNLEHEF THYLDGRYDM AGDFAASTAK PTVWWIEGLA EYISRKNDDQ ESIDAVRTNA YRLSDVLQTT YSSGDYVTRA YRWGYMATRF MFERHRTDVD AIVSRFRVGD YDGYADYVAY MGNRYDSEFV DWARGATTTG EPPLPPTKAG H
|
| |