Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_1173 |
Symbol | pip |
ID | 4790465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008835 |
Strand | - |
Start bp | 1226014 |
End bp | 1227303 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | proline iminopeptidase |
Protein accession | YP_001024976 |
Protein GI | 124382091 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCGC GCGTCGAGCC GGCGCCCGCG CGCGCAAGCA TTCATGCATG CATCCATGCG TCGATGCATT CGTGCGCGCC TTCGGCAGTC GACGCGCCAC GCCGCGCCGC GACGAACGGC GCGCGAGGCG GCCGGGCGGC GGCTCGGCCA CCTCCGGTGC AATTGCGTCC CCGCGTTTTC CGGCGACTCG GCATAATGAA GCGTCGCTTT CGTCGCCGGC GCCGCATCGG CGCGAGCCAA CGCGGCCGGC GCATCGCATG GGGCGCACGC ATGCGCCGCG CGGCCGTTCC ATTCGTCGCG TTCGGCGAGG CGCCCCCAGT CGTCTTCTTC CATTCAACCG GAGCGTCTCT CTTGTATCCA CCGATCGAAC CTTATGCACA CGGCTTCCTC GATACCGGCG ACGGCCATCG CGTGTACTGG GAGCTGTGCG GCAACCCCAA CGGCAAGCCG GCCGTCTTCC TGCACGGCGG CCCCGGCAGC GGCTGCAGCG CCGATCACCG TCGCCTCTTC GATCCCGCGC GCTACAACGT GCTGCTGTTC GACCAACGCG GCTGCGGCCG CTCGACGCCG CACGCGAGCC TCGAGAACAA CACGACATGG CACCTCGTCG ACGACATCGA GCGGCTGCGC GCGATGCTCG GCGTCGAGCG CTGGCTCGTG TTCGGCGGCT CGTGGGGCAG CGCGCTCGCG CTCGCATATG CGCAAACGCA CCCGGCGCGC GTGGCCGAGC TCGTCGTGCG CGGCATCTTC ACGGTGCGCC GGTCCGAGCT GCTCTGGTAC TACCAGGAAG GCGCGTCGTG GCTGTTTCCG GATCTGTGGG AAGACTTCAT CGCGCCCATT CCGAGCGCCG AGCGCGCGGA TCTGATCGCC GCGTATCGCC GCCGGCTGAC GGGCGACGAC GAGGCGGCCA AGCGCGAGGC CGCGCGCGCG TGGAGCGTCT GGGAGGGCCG GACGATCGCG CTGCTGCCGA ACGCCGCGCA CGAAACGTAT TTCGGCGACG CGCATTTCGC GCTCGCGTTC GCCCGCATCG AAAACCACTA CTTCGTTCAT CAAGGTTTCA TGGAAGACGG GCAGTTGCTG CGCGATGCGC ATCGTCTCGC GGACATCCCG GGCGTGATCG TTCAGGGGCG CTACGACGTC GCGACGCCGG CGCGCACCGC GTGGGAACTC GCGAAGGCGT GGCCGCGCGC GTCGCTCGAG ATCGTGCCCG ACGCGGGCCA CGCATACGAC GAGCCGGGCA TTCTGCGCGC GCTGATCGCG GCGACCGACC GCTTCGCGCG CGAGCGCTGA
|
Protein sequence | MRARVEPAPA RASIHACIHA SMHSCAPSAV DAPRRAATNG ARGGRAAARP PPVQLRPRVF RRLGIMKRRF RRRRRIGASQ RGRRIAWGAR MRRAAVPFVA FGEAPPVVFF HSTGASLLYP PIEPYAHGFL DTGDGHRVYW ELCGNPNGKP AVFLHGGPGS GCSADHRRLF DPARYNVLLF DQRGCGRSTP HASLENNTTW HLVDDIERLR AMLGVERWLV FGGSWGSALA LAYAQTHPAR VAELVVRGIF TVRRSELLWY YQEGASWLFP DLWEDFIAPI PSAERADLIA AYRRRLTGDD EAAKREAARA WSVWEGRTIA LLPNAAHETY FGDAHFALAF ARIENHYFVH QGFMEDGQLL RDAHRLADIP GVIVQGRYDV ATPARTAWEL AKAWPRASLE IVPDAGHAYD EPGILRALIA ATDRFARER
|
| |