Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1142 |
Symbol | colA |
ID | 4905947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1095475 |
End bp | 1097418 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640144248 |
Product | collagenase |
Protein accession | YP_001075177 |
Protein GI | 126455591 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATGA CGGAGGTGTT CCGAAAAACC CGCCGCTGGT CCGCCGTGGC GGCGCTATCG GCATTCGTGG GGCTGGCCGG CGCCGCGTCG GCCAATACGC AGCCGATGCA ACCGACGCAG CAAAAGCAGG CGCGCATGCC GCGCCTGCCG CAGAACCTGC CGGTTTCGCC CGAGCAGGCC GAATACAACC TGCCGCTCAG CGAGCAGGAT CGTGCGGCGC TCACCAGGCC TTCGCCGCTC AAGCAGCCGG CCAAGCGCGG CAAACGCAGC GCGCCGGGCG CCGATTGCCG CGACATGTCG GTGATGACTC AGTATCGCGG CGCCGCGCTC GCCGATTACA TCGCGAATCT TCCCGATTAT GAATGCCATT ACGGCCTGTT CTCGGTCGAT AAAACCCTGG CTGCGCAGAT TTTCAGTGCG GAAAATGTGT ATGCCGTCGC GAGCCGTTTC GTGCAGGATA TCTATCGCTA TGATGCGAGC AACTTGATTC TGGTCAATTT ACTGATTTAT CTGCGTTCCG CTTATTACCA ATATGATGTA TCGGGCATTG CCAATCCGAT TCCGAATCTC GCGGTATGGC TGCGCCCGTA TATCAAGCAG AGCCTGGAGG GCGCCGCGCT CTATCGAGAG AACGCGCGCG CGCCGAGCAC CGCGAACGAG CTGATGAAGC TCATCACGAA CATGAAGGAC GAGGCGTTCT ATCTGCCCAC GCTGAAGGCG CGCATTGCGT TCTACACGGC GAGCGCGACG AATCCGCAGG CGGCGGCGCC GCTGTTGCAG CCGAGCGCGG CGGGCGGCTT CACCGGCCTG CTCACGGTGT TCTTCTATGC GCATCAGCGC AGCGGCGCGC AGCCGATGCT CGATAGCGAC GCGACGCTGC CCGAGACGCT CAATCGCTTC GTCACCGCGA ACCGCGCGAG CCTGTCGAAC ACGAGCGCCG CGTACCAGCT CGCGGACGCG GCGCGCGAAA CGTTTCGCTT CCTGCGCTAC CCGGCGCAGA AGCCGCGCGT GAAGAAGATG ATCCAGGACA TGCTCGCGTC GACGAGCATG ACGGGCGCGG ACAGCGACCT GTGGCTCGCG GCGGCGGAAG CGGTCGACTA TGGCGATCCG GGCAACTGCG CGGACTACGG CACGTGCGAC TACAAGAAGC GGCTCACCGA TGCGGTGCTC ACGCATCGTT ACGCGTGCAA CGCGGGCGTG CGCATTCTCG CGCAGGACAT GACGCTGCCG CAGTTGCAGT CGGTCTGCAC GTCGGTCGCG CAGCAGGACG ACTACTTCCA CCGGATGATG AAGACCGGGC GCAAGCCGGT GGCGGGCGAC CGCAACGATA CGATCGAGCT CGTCATCTTC GACGACTACG CGAACTATCG AAAATATGCT TCGGTGATCT ACGGCATCAG CACCGACAAC GGCGGCATGT ATCTCGAAGG CGATCCGTCC GCGCCCGGCA ACCAGGCGCG CTTCATTGCG CACGAGGCGT CGTGGTTGCG GCCCGAGTTC AAGGTCTGGA ACCTCGAGCA CGAGTTCACG CACTATCTCG ACGGCCGCTA CGACATGGCG GGCGATTTCG CGGCGAGCAC CGCGAAGCCG ACCGTCTGGT GGATCGAGGG TCTCGCCGAA TATCTGTCGA GAAAGAACGA CAATCAGGAG TCGATCGATG CGGCGCGCAC GGGCGCGTAC CGCTTCTCGG ACGTGCTCGG CACGCTGTAT TCGTCGAGCG ACTACGTCGC GCGCGCCTAC CGTTGGGGCT ACATGGCGAC ACGCTTCATG TTCGAGCGCC ATCGCGCGGA CGTGGATACG ATCGTGTCGC GCTTCCGGGT GGGCGACTAC GACGGCTACG CGAACTATGT CGCGTACATC GGTAACCGCT ACGACGGCGA GTTCGTCGAT TGGGCGCGCG CGGCGACCAC GGCGGGCGAG CCGCCGCTGC CGACGAAGCG TTGA
|
Protein sequence | MPMTEVFRKT RRWSAVAALS AFVGLAGAAS ANTQPMQPTQ QKQARMPRLP QNLPVSPEQA EYNLPLSEQD RAALTRPSPL KQPAKRGKRS APGADCRDMS VMTQYRGAAL ADYIANLPDY ECHYGLFSVD KTLAAQIFSA ENVYAVASRF VQDIYRYDAS NLILVNLLIY LRSAYYQYDV SGIANPIPNL AVWLRPYIKQ SLEGAALYRE NARAPSTANE LMKLITNMKD EAFYLPTLKA RIAFYTASAT NPQAAAPLLQ PSAAGGFTGL LTVFFYAHQR SGAQPMLDSD ATLPETLNRF VTANRASLSN TSAAYQLADA ARETFRFLRY PAQKPRVKKM IQDMLASTSM TGADSDLWLA AAEAVDYGDP GNCADYGTCD YKKRLTDAVL THRYACNAGV RILAQDMTLP QLQSVCTSVA QQDDYFHRMM KTGRKPVAGD RNDTIELVIF DDYANYRKYA SVIYGISTDN GGMYLEGDPS APGNQARFIA HEASWLRPEF KVWNLEHEFT HYLDGRYDMA GDFAASTAKP TVWWIEGLAE YLSRKNDNQE SIDAARTGAY RFSDVLGTLY SSSDYVARAY RWGYMATRFM FERHRADVDT IVSRFRVGDY DGYANYVAYI GNRYDGEFVD WARAATTAGE PPLPTKR
|
| |