Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1804 |
Symbol | |
ID | 3848939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 2022908 |
End bp | 2023942 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637841473 |
Product | U32 family peptidase |
Protein accession | YP_442336 |
Protein GI | 83720783 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCAAA GCAGCCACTT CGCGACGGGC GCCGCGCCGA TCGAACTCGT GTGCCCGGCG GGCAGCCTGC CCGCGCTGAA GGCCGCGGTC GACAACGGCG CGGACTGCGT GTATCTCGGT TTTCGCGACG CGACGAACGC GCGCAACTTC GCCGGCCTGA ACTTCGACGC GCAGGCGATC GACGCCGGCA TTCGCTACGC GCGCGAGCGC GGCCGCAAGG TGCTCGTCGC GCTCAACACG TATCCGCAGC CGGACGGCTG GGCCGCATGG CGGGAAGCGG TCGGCCGCGC GGCCGACGCG GGCGTCGACG CGATCATCGT CGCCGATCCG GGGCTCATGC GTTTCGCGCG CGAGCGCTAT CCGGACCTGC GGCTGCACCT GTCGGTGCAG GGCTCGGCGA CGAACTACGA GGCGATCAAC TTCTATCACG AGCACTTCGG CATATCGCGC GCGGTGCTGC CGCGCGTGCT GTCGCTCGCG CAGGTCGAGC AGGTGACCGA AAACACGCCG GTCGAAATCG AGGTGTTCGG CTTCGGCAGT CTGTGCGTGA TGGTCGAGGG GCGCTGCGCG CTGTCGTCGT ACGCGACGGG CGAATCGCCG AACACGCGCG GCGTGTGCTC GCCCGCGAAG GCGGTGCGCT GGCAGAAGAC GCCGGACGGC CTCGAATCGC GGCTGAACGG CGTGCTGATC GACCGTTACG AAGACGGCGA GAACGCCGGC TATCCGACGC TCTGCAAGGG GCGCTTCACG GTGGCCGACG AGAGCTACTA CGCGATCGAG GAGCCGACGA GCCTGAACAC GCTCGAACTG CTGCCGAAGC TGATGCAGAT CGGCATACGG GCGATCAAGA TCGAAGGCCG TCAGCGCAGC CCCGCGTACG TCGCGCAGGT GACGCGCGTC TGGCGCGACG CGATCGATCA GTGCGCATCG AACCTCGCGC GCTACTACGT GAAGCCCGCG TGGATGACGG AACTGAACAA GGTCGCGGAA GGGCAGCAGC ATACGCTCGG CGCCTACCAC CGGCCGTGGA AATGA
|
Protein sequence | MTQSSHFATG AAPIELVCPA GSLPALKAAV DNGADCVYLG FRDATNARNF AGLNFDAQAI DAGIRYARER GRKVLVALNT YPQPDGWAAW REAVGRAADA GVDAIIVADP GLMRFARERY PDLRLHLSVQ GSATNYEAIN FYHEHFGISR AVLPRVLSLA QVEQVTENTP VEIEVFGFGS LCVMVEGRCA LSSYATGESP NTRGVCSPAK AVRWQKTPDG LESRLNGVLI DRYEDGENAG YPTLCKGRFT VADESYYAIE EPTSLNTLEL LPKLMQIGIR AIKIEGRQRS PAYVAQVTRV WRDAIDQCAS NLARYYVKPA WMTELNKVAE GQQHTLGAYH RPWK
|
| |