Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1216 |
Symbol | colA |
ID | 4885883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1158037 |
End bp | 1159974 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640131155 |
Product | collagenase |
Protein accession | YP_001062213 |
Protein GI | 126444490 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000787421 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAGG TGTTCCGAAA AACCCGCCGC TGGTCCGCCG TGGCGGCGCT ATCGGCATTC GTGGGGCTGG CCGGCGCCGC GTCGGCCAAT ACGCAGCCGA TGCAACCGAC GCAACAAAAG CAGGCGCGCA TGCCGCGCCT GCCGCAGAAC CTGCCGGTTT CGCCCGAGCA GGCCGAATAC AATCTGCCGC TCAGCGAGCA GGATCGTGCG GCGCTCACCA GGCCTTCGCC GCTCAAGCAG CCGGCCAAGC GCGGCAAACG CAGCGCGCCG GGCGCCGATT GCCGCGACAT GTCGGTGATG ACTCAGTATC GCGGCGCCGC GCTCGCCGAT TACATCGCGA ATCTTCCCGA TTATGAATGC CATTACGGCC TGTTCTCGGT CGATAAAACC CTGGCCGCGC AGATTTTCAG TGCGGAAAAT GTGCATGCCG TCGCGAGCCG TTTCGTGCAG GATATCTATC GCTATGATGC GAGCAACTTG ATTCTGGTCA ATTTGCTGAT TTATCTGCGT TCCGCTTATT ACCAATATGA TGTATCGGGC ATTGCCAATC CGATTCCGAA TCTCGCGGTA TGGCTGCGCC CGTATATCAA GCAGAGCCTG GAGGGCGCCG CGCTCTATCG AGAGAACGCG CGCGCGCCGA GCACCGCGAA CGAGCTGATG AAGTTCATCA CGAACATGAA GGACGAGGCG TTCTATCTGC CCACGCTGAA GGCGCGCATT GCGTTCTACA CGGCGAGCGC GACGAATCCG CAGGCGGCGG CGCCGCTGTT GCAGCCGAGC GCGGCGGGCG GCTTCACCGG CCTGCTCACG GTGTTCTTCT ATGCGCATCA GCGCAGCGGC GCGCAGCCGA TGCTCGATAG CGACGCGACG CTGCCCGAGA CGCTCAACCG CTTCGTCACC GCGAACCGCG CGAGCCTGTC GAACACGAGC GCCGCGTACC AGCTCGCGGA CGCGGCGCGC GAAACGTTTC GCTTCCTGCG CTACCCGGCG CAGAAGCCGC GCGTGAAGAA GATGATCCAG GACATGCTCG CGTCGACGAG CATGACGGGC GCGGACAGCG ACCTGTGGCT CGCGGCGGCG GAAGCGGTCG ACTATGGCGA TCCGGGCAAC TGCGCGGACT ACGGCACGTG CGACTACAAG AAGCGGCTCA CCGACGCGGT GCTCACGCAT CGTTACGCGT GCAACGCGGG CGTGCGCATT CTCGCGCAGG ACATGACGCT GCCGCAGTTG CAGTCGGTCT GCACGTCGGT CGCGCAGCAG GACGACTACT TCCACCGGAT GATGAAGACC GGGCGCAAGC CGGTGGCGGG CGACCGCAAC GATACGATCG AGCTCGTCAT CTTCGACGAC TACGCGAACT ATCGAAAATA TGCTTCGGTG ATCTACGGCA TCAGCACCGA CAACGGCGGC ATGTATCTCG AAGGCGATCC GTCCGCGCCC GGCAACCAGG CGCGCTTCAT TGCGCACGAG GCGTCGTGGT TGCGGCCCGA GTTCAAGGTC TGGAACCTCG AGCACGAGTT CACGCACTAT CTCGACGGCC GCTACGACAT GGCGGGCGAT TTCGCGGCGA GCACCGCGAA GCCGACCGTC TGGTGGATCG AGGGTCTCGC CGAATATCTG TCGAGAAAGA ACGACAATCA GGAGTCGATC GATGCGGCGC GCACGGGCGC GTACCGCTTC TCGGACGTGC TCGGCACGCT GTATTCGTCG AGCGACTACG TCGCGCGCGC CTACCGTTGG GGCTACATGG CGACACGCTT CATGTTCGAG CGCCATCGCG CGGACGTGGA CACGATCGTG TCGCGCTTCC GGGTGGGCGA CTACGACGGC TACGCGAACT ACGTCGCGTA CATCGGCAAC CGCTACGACG GCGAGTTCGT CGATTGGGCG CGCGCGGCGA CCACGGCGGG CGAGCCGCCG CTGCCGACGA AGCGTTGA
|
Protein sequence | MTEVFRKTRR WSAVAALSAF VGLAGAASAN TQPMQPTQQK QARMPRLPQN LPVSPEQAEY NLPLSEQDRA ALTRPSPLKQ PAKRGKRSAP GADCRDMSVM TQYRGAALAD YIANLPDYEC HYGLFSVDKT LAAQIFSAEN VHAVASRFVQ DIYRYDASNL ILVNLLIYLR SAYYQYDVSG IANPIPNLAV WLRPYIKQSL EGAALYRENA RAPSTANELM KFITNMKDEA FYLPTLKARI AFYTASATNP QAAAPLLQPS AAGGFTGLLT VFFYAHQRSG AQPMLDSDAT LPETLNRFVT ANRASLSNTS AAYQLADAAR ETFRFLRYPA QKPRVKKMIQ DMLASTSMTG ADSDLWLAAA EAVDYGDPGN CADYGTCDYK KRLTDAVLTH RYACNAGVRI LAQDMTLPQL QSVCTSVAQQ DDYFHRMMKT GRKPVAGDRN DTIELVIFDD YANYRKYASV IYGISTDNGG MYLEGDPSAP GNQARFIAHE ASWLRPEFKV WNLEHEFTHY LDGRYDMAGD FAASTAKPTV WWIEGLAEYL SRKNDNQESI DAARTGAYRF SDVLGTLYSS SDYVARAYRW GYMATRFMFE RHRADVDTIV SRFRVGDYDG YANYVAYIGN RYDGEFVDWA RAATTAGEPP LPTKR
|
| |