Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2749 |
Symbol | |
ID | 4903054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 2714243 |
End bp | 2715277 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640135976 |
Product | U32 family peptidase |
Protein accession | YP_001067000 |
Protein GI | 126454060 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.254012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCAAA GCAGCCACTT CGCGACGGGC GCCGCGCCGA TCGAACTCGT GTGCCCGGCG GGCAGCCTGC CCGCGCTGAA GGCCGCGGTC GACAACGGCG CGGACTGCGT GTATCTCGGT TTTCGCGACG CGACGAACGC GCGCAACTTC GCCGGCCTGA ACTTCGACGC GCAGGCGATC GCGGCCGGCA TCCGCTATGC GCGCGAGCGC GGCCGCAAGG TGCTCGTCGC GCTCAACACG TATCCGCAGC CGGACGGCTG GGCCGCGTGG CGGGAGGCGG TGGGCCGCGC GGCCGACGCG GGCGTGGACG CGATCATCGT CGCCGATCCG GGGCTCATGC GCTTCGCGCG CGAGCGCTAC CCGGAGCTGC GACTGCACCT GTCGGTGCAG GGCTCGGCGA CGAACTACGA GGCGATCAAC TTCTATCACG AGCACTTCGG CGTTTCGCGC GCGGTGCTGC CGCGCGTGCT GTCGCTCGCG CAGGTCGAAC AGGTGGCCGA GAACACGCCG GTCGAAATCG AGGTGTTCGG CTTCGGCAGT CTGTGCGTGA TGGTCGAGGG GCGCTGCGCG CTGTCGTCGT ATGCAACGGG CGAATCGCCG AACACGCGCG GCGTGTGCTC GCCCGCGAAG GCGGTGCGCT GGCAGAAGAC GCCGGACGGC CTCGAATCGC GGCTGAACGG CGTGCTGATC GACCGCTACG AAGACGGCGA GAACGCCGGC TATCCGACGC TCTGCAAGGG GCGCTTCACG GTGGCCGACG AGAGCTACTA CGCGATCGAG GAACCGACGA GCCTGAACAC GCTCGAGCTG CTGCCGAAGC TGATGCAGAT CGGCATACGC GCGATCAAGA TCGAAGGCCG TCAGCGCAGC CCCGCGTACG TCGCGCAGGT GACGCGCGTG TGGCGCGATG CGATCGACCA GTGCACGGCG AACCTCGCGC GCTACTACGT GAAGCCCGCG TGGATGACGG AACTGAACAA GGTCGCGGAA GGGCAGCAGC ATACGCTCGG CGCCTACCAC CGGCCGTGGA AATGA
|
Protein sequence | MTQSSHFATG AAPIELVCPA GSLPALKAAV DNGADCVYLG FRDATNARNF AGLNFDAQAI AAGIRYARER GRKVLVALNT YPQPDGWAAW REAVGRAADA GVDAIIVADP GLMRFARERY PELRLHLSVQ GSATNYEAIN FYHEHFGVSR AVLPRVLSLA QVEQVAENTP VEIEVFGFGS LCVMVEGRCA LSSYATGESP NTRGVCSPAK AVRWQKTPDG LESRLNGVLI DRYEDGENAG YPTLCKGRFT VADESYYAIE EPTSLNTLEL LPKLMQIGIR AIKIEGRQRS PAYVAQVTRV WRDAIDQCTA NLARYYVKPA WMTELNKVAE GQQHTLGAYH RPWK
|
| |