Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1578 |
Symbol | |
ID | 3844676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1849092 |
End bp | 1850900 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637838879 |
Product | collagenase, putative |
Protein accession | YP_439773 |
Protein GI | 83717827 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.285851 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCGCA TGCCGCAGAA TCTGCCGGTT TCACCCGAGC AGGCCGAGTA CAACCTGCCG CTCAGCGAGC AGGACCGGGC GGCGTTGACC AAGCCTTCGC AGCTCAAGCA GCAGGCCAAG CGCAGCAAGC GCAGCGCGCC GGGCGCCGAT TGCCGCGACA TGTCGGCGAT GACGCGGTAT CGCGGCGCGG CACTCGCCGA TTACATCGCG AATCTTCCCG ATTACGAATG CCATTACGGC TTGTTCTCAG TCGATAAAAC GCTGGCTCAG CAGATTTTCA ATGCCGAAAA CGTGCATGCC GTCGCGAGCC GTTTTGTGCA GGAAGTCTAT CGCTATGATG CGAGCAATTT GATTCTGGTC AATTTGCTGA TTTATCTGCG TTCCGCTTAT TACCAATATG ATGTATCGGG CATTGCCGAT CCGATTCCGG ATCTCGCCGT GTGGCTGCGT CCGTATATCA AGCAAAGCCT CGAAGGCGAG GCGCTCTATC GCGAGAACGA CCGCGCGCCG AGCACGGCGA ACGAGCTGAT GAAGCTCATC ACGAACATGA AGGACGAGGC GTACTATCTG CCGACGCTGA AGAACCGCAT CGCGTCCTAC ACGGCGAGCG CGACGAATCC GCAGGCGGCG GCGCCGCTGT TGCAGCGCAG CGCGGCGGGC GGCTTCACCG GCTTGCTCAC GGTGTTCTTC TACGCGCATC AGCGCAGCGG CGCGCGGCAG ATGCTCGACA GCGACGCGAC GCTGCCGGAG ACGCTCAACC GCTTCGTCAC CGCGAACCGA GCGAGCCTGT CGAATACGAG CGCCGCGTAT CAGCTTGCCG ACGCCGCACG CGAGACGTTT CGCTTTCTCC GTTATCCGAC GCAGAAGCCG CGCGTGAAGA AGATGATCCA GGACATCCTG GCATCGACGA GCCTGACGGG GGCGGACAAC GATCTGTGGC TCGCGGCGGC GGAAGCGGTC GACTACGGCG ACGCGGGCAA CTGCGCGGAC TACGGCACGT GCGACTACAA GAAGCGGCTG ACCGATGCGG TGCTCACGCA TCGTCACGCA TGCAACGCGA GCGTGCGCAT TCTCGCGCAG GACATGACGG CGCCGCAGTT GCAGTCGGTC TGCGCGGCGG TCGCGCAGCA GGACGATTAC TTCCACCGGA TGATGAAGAC CGGGCGCAAG CCGGTCGCGG GCGATCGCAA CGACACGATC GAACTCGTCA TCTTCGACGA CTACGCGAAC TATCGCAAAT ACGCTTCGGT GATCTACGGC ATCAGCACCG ACAACGGCGG CATGTACCTC GAAGGCGATC CGTCCGCGCC CGGCAACCAG GCGCGCTTCA TTGCCCATGA GGCGTCGTGG CTGCGGCCCG AGTTCAAGGT CTGGAACCTC GAGCACGAGT TCACGCACTA TCTGGACGGC CGCTACGACA TGGCGGGCGA CTTCGCGGCG AGCACCGCGA AGCCCACCGT CTGGTGGATC GAAGGCGTCG CCGAATATCT GTCGAGAAAG AACGGCAACC AGGAGTCGAT CGACGCGGCG CGCACGGGCG CGTACCGGTT CGCGGACGTG CTCGGCACGC TGTATTCGTC GAGCGACTAC GTCGCGCGCG CATACCGGTG GGGCTACATG GCGACGCGCT TCATGTTCGA GCGCCACCGC GCGGACGTCG ACACGATCGT GTCGCGCTTC CGGGCGGGCG ATTACGACGG CTACGCGAAC TACGTCGCGT ACATCGGCAA CCGCTACGAC AACGAGTTCG TCGACTGGGC GCGCAACGCG ACGACGGCGG GCGAGCCGCC GCTGCCGACG CAGCGCTGA
|
Protein sequence | MPRMPQNLPV SPEQAEYNLP LSEQDRAALT KPSQLKQQAK RSKRSAPGAD CRDMSAMTRY RGAALADYIA NLPDYECHYG LFSVDKTLAQ QIFNAENVHA VASRFVQEVY RYDASNLILV NLLIYLRSAY YQYDVSGIAD PIPDLAVWLR PYIKQSLEGE ALYRENDRAP STANELMKLI TNMKDEAYYL PTLKNRIASY TASATNPQAA APLLQRSAAG GFTGLLTVFF YAHQRSGARQ MLDSDATLPE TLNRFVTANR ASLSNTSAAY QLADAARETF RFLRYPTQKP RVKKMIQDIL ASTSLTGADN DLWLAAAEAV DYGDAGNCAD YGTCDYKKRL TDAVLTHRHA CNASVRILAQ DMTAPQLQSV CAAVAQQDDY FHRMMKTGRK PVAGDRNDTI ELVIFDDYAN YRKYASVIYG ISTDNGGMYL EGDPSAPGNQ ARFIAHEASW LRPEFKVWNL EHEFTHYLDG RYDMAGDFAA STAKPTVWWI EGVAEYLSRK NGNQESIDAA RTGAYRFADV LGTLYSSSDY VARAYRWGYM ATRFMFERHR ADVDTIVSRF RAGDYDGYAN YVAYIGNRYD NEFVDWARNA TTAGEPPLPT QR
|
| |