Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_0773 |
Symbol | |
ID | 4790330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008835 |
Strand | - |
Start bp | 806901 |
End bp | 808844 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | putative collagenase |
Protein accession | YP_001024588 |
Protein GI | 124382282 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.293551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATGA CGGAGGTGTT CCGAAAAAAC CGCCGCTGGT CCGCCGTGGC GGCGCTATCG GCATTCGTGG GGCTGGCCGG CGCCGCGTCG GCCAATACGC AGCCGATGCA ACCGACGCAG CAAAAGCAGG CGCGCCTGCC GCGCCTGCCG CAGAACCTGC CGGTTTCGCC CGAGCAGGCC GAATACAACC TGCCGCTCAG CGAGCAGGAT CGTGCGGCGC TCACCAGGCC TTCGCCGCTC AAGCAGCCGG CCAAGCGCGG CAAACGCAGC GCGCCGGGCG CCGATTGCCG CGACATGTCG GTGATGACTC AGTATCGCGG CGCCGCGCTC GCCGATTACA TCGCGAATCT TCCCGATTAT GAATGCCATT ACGGCCTGTT CTCGGTCGAT AAAACCCTGG CTGCGCAGAT TTTCAGTGCG GAAAATGTGT ATGCCGTCGC GAGCCGTTTC GTGCAGGATA TCTATCGCTA TGATGCGAGC AACTTGATTC TGGTCAATTT ACTGATTTAT CTGCGTTCCG CTTATTACCA ATATGATGTA TCGGGCATTG CCAATCCGAT TCCGAATCTC GCGGTATGGC TGCGCCCGTA TATCAAGCAG AGCCTGGAGG GCGCCGCGCT CTATCGAGAG AACGCGCGCG CGCCGAGCAC CGCGAACGAG CTGATGAAGC TCATCACGAA CATGAAGGAC GAGGCGTTCT ATCTGCCCAC GCTGAAGGCG CGCATTGCGT TCTACACGGC GAGCGCGACG AATCCGCAGG CGGCGGCGCC GCTGTTGCAG CCGAGCGCGG CGGGCGGCTT CACCGGCTTG CTCACGGTGT TCTTCTATGC GCATCAGCGC AGCGGCGCGC AGCCGATGCT CGATAGCGAC GCGACGCTGC CCGAGACGCT CAACCGCTTC GTCACCGCGA ACCGCGCGAG CCTGTCGAAC ACGAGCGCCG CGTACCAGCT CGCGGACGCG GCGCGCGAAA CGTTTCGCTT CCTGCGCTAC CCGGCGCAGA AGCCGCGCGT GAAGAAGATG ATCCAGGACA TGCTCGCGTC GACGAGCATG ACGGGCGCGG ACAGCGACCT GTGGCTCGCG GCGGCGGAAG CGGTCGACTA TGGCGATCCG GGCAACTGCG CGGACTACGG CACGTGCGAC TACAAGAAAC GGCTCACCGA CGCGGTGCTC ACGCATCGTT ACGCGTGCAA CGCGGGCGTG CGCATTCTCG CGCAGGACAT GACGATGCCG CAGTTGCAGT CGGTCTGCAC GTCGGTCGCG CAGCAGGACG ACTACTTCCA CCGGATGATG AAGACCGGGC GCAAGCCGGT GGCGGGCGAC CGCAACGATA CGATCGAGCT CGTCATCTTC GACGACTACG CGAACTATCG AAAATATGCT TCGGTGATCT ACGGCATCAG CACCGACAAC GGCGGCATGT ATCTCGAAGG CGATCCGTCC GCGCCCGGCA ACCAGGCGCG CTTCATTGCG CACGAGGCGT CGTGGTTGCG GCCCGAGTTC AAGGTCTGGA ACCTCGAGCA CGAGTTCACG CACTATCTCG ACGGCCGCTA CGACATGGCG GGCGATTTCG CGGCGAGCAC CGCGAAGCCG ACCGTCTGGT GGATCGAGGG TCTCGCCGAA TATCTGTCGA GAAAGAACGA CAATCAGGAG TCGATCGACG CGGCGCGCAC GGGCGCGTAC CGCTTCTCGG ACGTGCTCGG CACGCTGTAT TCGTCGAGCG ATTACGTCGC GCGCGCCTAC CGTTGGGGCT ACATGGCGAC ACGCTTCATG TTCGAGCGCC ATCGCGCGGA CGTGGATACG ATCGTGTCGC GCTTCCGGGT GGGCGACTAC GACGGCTACG CGAACTATGT CGCGTACATC GGCAACCGCT ACGACGGCGA GTTCGTCGAT TGGGCGCGCG CGGCGACCAC GGCGGGCGAG CCGCCGCTGC CGACGAAGCG TTGA
|
Protein sequence | MPMTEVFRKN RRWSAVAALS AFVGLAGAAS ANTQPMQPTQ QKQARLPRLP QNLPVSPEQA EYNLPLSEQD RAALTRPSPL KQPAKRGKRS APGADCRDMS VMTQYRGAAL ADYIANLPDY ECHYGLFSVD KTLAAQIFSA ENVYAVASRF VQDIYRYDAS NLILVNLLIY LRSAYYQYDV SGIANPIPNL AVWLRPYIKQ SLEGAALYRE NARAPSTANE LMKLITNMKD EAFYLPTLKA RIAFYTASAT NPQAAAPLLQ PSAAGGFTGL LTVFFYAHQR SGAQPMLDSD ATLPETLNRF VTANRASLSN TSAAYQLADA ARETFRFLRY PAQKPRVKKM IQDMLASTSM TGADSDLWLA AAEAVDYGDP GNCADYGTCD YKKRLTDAVL THRYACNAGV RILAQDMTMP QLQSVCTSVA QQDDYFHRMM KTGRKPVAGD RNDTIELVIF DDYANYRKYA SVIYGISTDN GGMYLEGDPS APGNQARFIA HEASWLRPEF KVWNLEHEFT HYLDGRYDMA GDFAASTAKP TVWWIEGLAE YLSRKNDNQE SIDAARTGAY RFSDVLGTLY SSSDYVARAY RWGYMATRFM FERHRADVDT IVSRFRVGDY DGYANYVAYI GNRYDGEFVD WARAATTAGE PPLPTKR
|
| |