Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_A1736 |
Symbol | |
ID | 4889462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009079 |
Strand | - |
Start bp | 1664563 |
End bp | 1666500 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640148001 |
Product | putative collagenase |
Protein accession | YP_001078919 |
Protein GI | 126445968 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAGG TGTTCCGAAA AAACCGCCGC TGGTCCGCCG TGGCGGCGCT ATCGGCATTC GTGGGGCTGG CCGGCGCCGC GTCGGCCAAT ACGCAGCCGA TGCAACCGAC GCAGCAAAAG CAGGCGCGCC TGCCGCGCCT GCCGCAGAAC CTGCCGGTTT CGCCCGAGCA GGCCGAATAC AACCTGCCGC TCAGCGAGCA GGATCGTGCG GCGCTCACCA GGCCTTCGCC GCTCAAGCAG CCGGCCAAGC GCGGCAAACG CAGCGCGCCG GGCGCCGATT GCCGCGACAT GTCGGTGATG ACTCAGTATC GCGGCGCCGC GCTCGCCGAT TACATCGCGA ATCTTCCCGA TTATGAATGC CATTACGGCC TGTTCTCGGT CGATAAAACC CTGGCTGCGC AGATTTTCAG TGCGGAAAAT GTGTATGCCG TCGCGAGCCG TTTCGTGCAG GATATCTATC GCTATGATGC GAGCAACTTG ATTCTGGTCA ATTTACTGAT TTATCTGCGT TCCGCTTATT ACCAATATGA TGTATCGGGC ATTGCCAATC CGATTCCGAA TCTCGCGGTA TGGCTGCGCC CGTATATCAA GCAGAGCCTG GAGGGCGCCG CGCTCTATCG AGAGAACGCG CGCGCGCCGA GCACCGCGAA CGAGCTGATG AAGCTCATCA CGAACATGAA GGACGAGGCG TTCTATCTGC CCACGCTGAA GGCGCGCATT GCGTTCTACA CGGCGAGCGC GACGAATCCG CAGGCGGCGG CGCCGCTGTT GCAGCCGAGC GCGGCGGGCG GCTTCACCGG CTTGCTCACG GTGTTCTTCT ATGCGCATCA GCGCAGCGGC GCGCAGCCGA TGCTCGATAG CGACGCGACG CTGCCCGAGA CGCTCAACCG CTTCGTCACC GCGAACCGCG CGAGCCTGTC GAACACGAGC GCCGCGTACC AGCTCGCGGA CGCGGCGCGC GAAACGTTTC GCTTCCTGCG CTACCCGGCG CAGAAGCCGC GCGTGAAGAA GATGATCCAG GACATGCTCG CGTCGACGAG CATGACGGGC GCGGACAGCG ACCTGTGGCT CGCGGCGGCG GAAGCGGTCG ACTATGGCGA TCCGGGCAAC TGCGCGGACT ACGGCACGTG CGACTACAAG AAACGGCTCA CCGACGCGGT GCTCACGCAT CGTTACGCGT GCAACGCGGG CGTGCGCATT CTCGCGCAGG ACATGACGAT GCCGCAGTTG CAGTCGGTCT GCACGTCGGT CGCGCAGCAG GACGACTACT TCCACCGGAT GATGAAGACC GGGCGCAAGC CGGTGGCGGG CGACCGCAAC GATACGATCG AGCTCGTCAT CTTCGACGAC TACGCGAACT ATCGAAAATA TGCTTCGGTG ATCTACGGCA TCAGCACCGA CAACGGCGGC ATGTATCTCG AAGGCGATCC GTCCGCGCCC GGCAACCAGG CGCGCTTCAT TGCGCACGAG GCGTCGTGGT TGCGGCCCGA GTTCAAGGTC TGGAACCTCG AGCACGAGTT CACGCACTAT CTCGACGGCC GCTACGACAT GGCGGGCGAT TTCGCGGCGA GCACCGCGAA GCCGACCGTC TGGTGGATCG AGGGTCTCGC CGAATATCTG TCGAGAAAGA ACGACAATCA GGAGTCGATC GACGCGGCGC GCACGGGCGC GTACCGCTTC TCGGACGTGC TCGGCACGCT GTATTCGTCG AGCGACTACG TCGCGCGCGC CTACCGTTGG GGCTACATGG CGACACGCTT CATGTTCGAG CGCCATCGCG CGGACGTGGA TACGATCGTG TCGCGCTTCC GGGTGGGCGA CTACGACGGC TACGCGAACT ATGTCGCGTA CATCGGCAAC CGCTACGACG GCGAGTTCGT CGATTGGGCG CGCGCGGCGA CCACGGCGGG CGAGCCGCCG CTGCCGACGA AGCGTTGA
|
Protein sequence | MTEVFRKNRR WSAVAALSAF VGLAGAASAN TQPMQPTQQK QARLPRLPQN LPVSPEQAEY NLPLSEQDRA ALTRPSPLKQ PAKRGKRSAP GADCRDMSVM TQYRGAALAD YIANLPDYEC HYGLFSVDKT LAAQIFSAEN VYAVASRFVQ DIYRYDASNL ILVNLLIYLR SAYYQYDVSG IANPIPNLAV WLRPYIKQSL EGAALYRENA RAPSTANELM KLITNMKDEA FYLPTLKARI AFYTASATNP QAAAPLLQPS AAGGFTGLLT VFFYAHQRSG AQPMLDSDAT LPETLNRFVT ANRASLSNTS AAYQLADAAR ETFRFLRYPA QKPRVKKMIQ DMLASTSMTG ADSDLWLAAA EAVDYGDPGN CADYGTCDYK KRLTDAVLTH RYACNAGVRI LAQDMTMPQL QSVCTSVAQQ DDYFHRMMKT GRKPVAGDRN DTIELVIFDD YANYRKYASV IYGISTDNGG MYLEGDPSAP GNQARFIAHE ASWLRPEFKV WNLEHEFTHY LDGRYDMAGD FAASTAKPTV WWIEGLAEYL SRKNDNQESI DAARTGAYRF SDVLGTLYSS SDYVARAYRW GYMATRFMFE RHRADVDTIV SRFRVGDYDG YANYVAYIGN RYDGEFVDWA RAATTAGEPP LPTKR
|
| |