Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A0900 |
Symbol | |
ID | 4904764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 883495 |
End bp | 885432 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640144006 |
Product | collagenase |
Protein accession | YP_001074936 |
Protein GI | 126456492 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATT CCCACAATGT CGTCAATCGC TTTATTGTCG CCGCTTCTAT TATCATTGGA GTCGTTCTTT ACAGTTCCGC TTGGGCAAAT CCGCAGCCCA TGCATACGAA GCAGGCACGT ATGCCGCGTA TCCCGCAGAA TCTCCCGCTT TCACCAGACC AAGCCAAATA CGACCTGCCG CTCAGCAAGT ATGACCGCGC AACGCTGATG GAGCCGTTGC GGCGGAAGCA ATCAGCGAAA CCCGACAGGC GCACCCGGCC TGGAGCAGAT TGCCGCGACA TGTCAATAAT GACGCAATAT CACGGTACGG CGCTTGCTGA TTACATAGCA AACCTCCCCG ATTATGAGTG CCACTACGGA CTATTCTCGA TTGACAGGGC GATGGCCGCG CAGATTTTCA ATTCTGAAAA CGTGTGGGCT GTTGCCAGCC GTCTCACTCA AGAAATCAAT CGTTACGACG CAACAAATAT TACATTGGTA AATTTGCTTA TTTATCTGAG AGCCGCTTAT TTCCAATATG ACGCAGCCCA GCTTGCTGAT CCGATTCCCG GTCTCGTAGT CTGGCTGCGT CCGTATATTT TGCAGAGCCT CTCTGGCGAC GCGCTTTACC TCGAGAATTC ACGCGCGCCG AGCACCGCCA ACGAGCTGAT GATCCTAATC ACAAACATGA AGGACGAGGC GTACTACCTG CCAACGCTGA AGGACCGAAT CGCGTTCTAC ACCGCGAGCG CGACCAACCC TCAGGCTGCG GCGCCGCTAC TGCAGCGAAG CGCGGCGGGT GGCTTCACCG GCTTGCTCAC GGTGTTCTTC TACGCGCATC AGCGCAGCGG CGCTCAGCCG ATGCTCGATA GCGATGCGAC TCTGCCGGAG ACGCTCAACC GCTTCGTCAC GGCGAACCGC GCATACCTGT CGAACACCAG TGCCGCCTAT CAGCTCGCCG ATGCGGCGCG CGAAACGTAC CGCTTTCTCC GCTATCCGTC GCAGAAGCCG CGGGTGAAGA AAATGATTCA GGATATGCTC GCGTCGACTA CCATGACGGG CCCGGACAAC GACCTGTGGC TCGCGGCAGC GGAAGCAGCC GATTACGGCG ATCCCGGCAA CTGCGCAGAT TACGGCACGT GCGACTATCA GAAGCGGCTC ATCGAGGCAG TGCTCACGCA TCGGTACTCA TGCAATGCGA ACGTACGAAT TCTCGCGCAG GACATGACGG TGCCGCAATT CCAGTCGGCA TGCCAATCGG TCGCCCAGGA GGAGGACTAT TTCCACCGGA TGATGAAGAC AGGGCACGTA CCGGTCGCGA ACGATCACAA TGACACGATC GAAATAGTCG TATTCGGCGA CTACGACAAT TATCGGAAGT ACGCTTCGGT GATCTACGGA ATTAGCACCG ATAACGGCGG CATGTACGTT GAAGGCGATC CGTCGGCACC CGGCAATCAG GCGCGCTTCA TCGCGCACGA GGCTTCGTGG CTACGGCCGG AGTTCAAGGT CTGGAACCTT GAGCACGAGT TTACGCACTA TCTCGACGGC CGTTACGACA TGGCGGGCGA CTTCGCGGCG AGCACGGCGA AGCCCACCGT GTGGTGGATC GAAGGTCTTG CCGAATATAT CTCCAGAAAG AACGATGACC AGGAATCGAT CGACGCGGTG CGCACGAACG CATATCGGCT CTCGGACGTG CTTCAGACGA CTTATTCGTC CGGCGACTAT GTCACGCGTG CGTATCGATG GGGTTATATG GCGACGCGCT TCATGTTTGA ACGTCATCGC GCGGACGTCG ACGCGATCGT GTCACGTTTT CGCGTGGGCG ATTACGACGG TTACGCGGAC TATGTCGCGT ACATGGGCAA CCGCTATGAC AGCGAGTTTG TTGACTGGGC ACGCGGCGCG ACAACAACCG GTGAGCCGCC GTTGCCGCCA ACGAAAGCGG GGCATTGA
|
Protein sequence | MKNSHNVVNR FIVAASIIIG VVLYSSAWAN PQPMHTKQAR MPRIPQNLPL SPDQAKYDLP LSKYDRATLM EPLRRKQSAK PDRRTRPGAD CRDMSIMTQY HGTALADYIA NLPDYECHYG LFSIDRAMAA QIFNSENVWA VASRLTQEIN RYDATNITLV NLLIYLRAAY FQYDAAQLAD PIPGLVVWLR PYILQSLSGD ALYLENSRAP STANELMILI TNMKDEAYYL PTLKDRIAFY TASATNPQAA APLLQRSAAG GFTGLLTVFF YAHQRSGAQP MLDSDATLPE TLNRFVTANR AYLSNTSAAY QLADAARETY RFLRYPSQKP RVKKMIQDML ASTTMTGPDN DLWLAAAEAA DYGDPGNCAD YGTCDYQKRL IEAVLTHRYS CNANVRILAQ DMTVPQFQSA CQSVAQEEDY FHRMMKTGHV PVANDHNDTI EIVVFGDYDN YRKYASVIYG ISTDNGGMYV EGDPSAPGNQ ARFIAHEASW LRPEFKVWNL EHEFTHYLDG RYDMAGDFAA STAKPTVWWI EGLAEYISRK NDDQESIDAV RTNAYRLSDV LQTTYSSGDY VTRAYRWGYM ATRFMFERHR ADVDAIVSRF RVGDYDGYAD YVAYMGNRYD SEFVDWARGA TTTGEPPLPP TKAGH
|
| |